Latent Birds: A Bird's-Eye View Exploration of the Latent Space

Juan Alonso Moreno, Francesco Bigoni, George Palamas

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

2 Citationer (Scopus)
138 Downloads (Pure)

Abstract

The use of a generative approach for sound synthesis breaks
through the limitations of traditional approaches, proposing novel ways to explore creative ideas. This paper demonstrates a method to generate original bird vocalizations using a Variational Convolutional Autoencoder trained on
mel-spectrograms of bird song and call recordings. The
vocalizations are reconstructed by sampling the latent space
and decompressing the resulting mel-spectrogram. The
results are quite promising, in that our system is able to
generate a variety of bird vocalizations depicting plausible
songs and calls, by interpolating between existing vocalizations or sampling the latent space. A Twitter bot that
publishes a unique daily bird vocalization is also implemented.
OriginalsprogEngelsk
TitelProceedings of the 17th Sound and Music Computing Conference
RedaktørerSimone Spagnol, Andrea Valle
Antal sider7
ForlagAxea sas/SMC Network
Publikationsdato2020
Sider364-370
ISBN (Elektronisk)978-88-945415-0-2
DOI
StatusUdgivet - 2020
Begivenhed17th Sound and Music Computing Conference - Torino, Italien
Varighed: 24 jun. 202026 jun. 2020
Konferencens nummer: 17
https://smc2020torino.it/uk/

Konference

Konference17th Sound and Music Computing Conference
Nummer17
Land/OmrådeItalien
ByTorino
Periode24/06/202026/06/2020
Internetadresse
NavnProceedings of the Sound and Music Computing Conference
ISSN2518-3672

Fingeraftryk

Dyk ned i forskningsemnerne om 'Latent Birds: A Bird's-Eye View Exploration of the Latent Space'. Sammen danner de et unikt fingeraftryk.

Citationsformater