Conference Publication Details
Mandatory Fields
Peter Corcoran, Shabab Bazrafkan, Joe Lemley
Embedded Vision Summit 2018
Getting More from Your Datasets: Data Augmentation, Annotation and Generative Techniques
Optional Fields
Deep Neural Networks Data Augmentation Smart Augmentation Generative Adversarial Networks Data Generation Data Annotation
Jeff Bier
Santa Clara Convention Center, Santa Clara, California
Deep Learning for embedded vision requires large datasets. Indeed the more varied training data is, the more accurate the trained network. But, acquiring and accurately annotating datasets costs time and money. This talk will show how to get more from existing datasets. Firstly, state-of-art data augmentation techniques are reviewed, and a new approach, smart augmentation, is explained. CNN network-A vs. trained, learning optimal augmentation strategies for CNN network-B. Secondly, Generative Adversarial Networks (GAN) learn the structure of an existing dataset and several example use cases show how GANs can generate “new” data corresponding to the original dataset. The example of creating a very large dataset of facial training data is presented. But, building a dataset is not the whole problem—data must be annotated in a way that is meaningful for the training process. An example of training a GAN from a dataset that incorporates ‘annotations’ is given. This enables ‘pre-annotated data’ to be generated, providing an exciting way to create large datasets at significantly reduced costs.
Xperi Inc. & Science Foundation Ireland
Grant Details
13/SPP/12868 Next Generation Imaging for Smartphones & Embedded Imaging Devices
Publication Themes
Informatics, Physical and Computational Sciences