Variational Autoencoders for Pedestrian Synthetic Data Augmentation of Existing Datasets - a Preliminary Investigation

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

20 Downloads (Pure)

Abstract

The requirements for more and more data for training deep learning surveillance and object detection models have resulted in slower deployment and more costs connected to dataset gathering, annotation, and testing. One way to help with this is the use of synthetic data giving more varied scenarios and not requiring manual annotation. We present our initial exploratory work in generating synthetic pedestrian augmentations for an existing dataset through the use of variational autoencoders. Our method consists of creating a large number of backgrounds and training a variational autoencoder on a small subset of annotated pedestrians. We then interpolate the latent space of the autoencoder to generate variations of these pedestrians, calculate their positions on the backgrounds, and blend them to create new images. We show that even though we do not achieve as good results as just adding more real images, we can boost the performance and robustness of a YoloV5 model trained on a mix of real and small amounts of synthetic images. As part of this paper, we also propose the next steps to expand this approach and make it much more useful for a wider array of datasets.
Original languageEnglish
Title of host publication19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Number of pages8
Volume2
PublisherSCITEPRESS Digital Library
Publication date13 Mar 2024
Pages829-836
ISBN (Electronic)978-989-758-679-8
DOIs
Publication statusPublished - 13 Mar 2024

Keywords

  • Dataset Augmentation
  • Object Detection
  • Surveillance
  • Synthetic Data
  • Variational Autoencoders

Fingerprint

Dive into the research topics of 'Variational Autoencoders for Pedestrian Synthetic Data Augmentation of Existing Datasets - a Preliminary Investigation'. Together they form a unique fingerprint.

Cite this