Variational Autoencoders for Pedestrian Synthetic Data Augmentation of Existing Datasets: A Preliminary Investigation

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

1 Citationer (Scopus)
48 Downloads (Pure)

Abstract

The requirements for more and more data for training deep learning surveillance and object detection models have resulted in slower deployment and more costs connected to dataset gathering, annotation, and testing. One way to help with this is the use of synthetic data giving more varied scenarios and not requiring manual annotation. We present our initial exploratory work in generating synthetic pedestrian augmentations for an existing dataset through the use of variational autoencoders. Our method consists of creating a large number of backgrounds and training a variational autoencoder on a small subset of annotated pedestrians. We then interpolate the latent space of the autoencoder to generate variations of these pedestrians, calculate their positions on the backgrounds, and blend them to create new images. We show that even though we do not achieve as good results as just adding more real images, we can boost the performance and robustness of a YoloV5 model trained on a mix of real and small amounts of synthetic images. As part of this paper, we also propose the next steps to expand this approach and make it much more useful for a wider array of datasets.
OriginalsprogEngelsk
TitelProceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Antal sider8
Vol/bind2
ForlagSCITEPRESS Digital Library
Publikationsdato13 mar. 2024
Sider829-836
ISBN (Elektronisk)978-989-758-679-8
DOI
StatusUdgivet - 13 mar. 2024
Begivenhed19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISAPP 2024 - Rom, Italien
Varighed: 27 feb. 202429 feb. 2024
Konferencens nummer: 19
https://visapp.scitevents.org/?y=2024

Konference

Konference19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISAPP 2024
Nummer19
Land/OmrådeItalien
ByRom
Periode27/02/202429/02/2024
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Variational Autoencoders for Pedestrian Synthetic Data Augmentation of Existing Datasets: A Preliminary Investigation'. Sammen danner de et unikt fingeraftryk.

Citationsformater