No Need to Scream: Robust Sound-based Speaker Localisation in Challenging Scenarios

Tze Ho Elden Tse, Daniele De Martini, Letizia Marchegiani

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

4 Citationer (Scopus)
60 Downloads (Pure)

Abstract

This paper is about speaker verification and horizontal localisation in the presence of conspicuous noise. Specifically, we are interested in enabling a mobile robot to robustly and accurately spot the presence of a target speaker and estimate his/her position in challenging acoustic scenarios. While several solutions to both tasks have been proposed in the literature, little attention has been devoted to the development of systems able to function in harsh noisy conditions. To address these shortcomings, in this work we follow a purely data-driven approach based on deep learning architectures which, by not requiring any knowledge either on the nature of the masking noise or on the structure and acoustics of the operation environment, it is able to reliably act in previously unexplored acoustic scenes. Our experimental evaluation, relying on data collected in real environments with a robotic platform, demonstrates that our framework is able to achieve high performance both in the verification and localisation tasks, despite the presence of copious noise.

OriginalsprogEngelsk
TitelSocial Robotics - 11th International Conference, ICSR 2019, Proceedings
RedaktørerMiguel A. Salichs, Shuzhi Sam Ge, Emilia Ivanova Barakova, John-John Cabibihan, Alan R. Wagner, Álvaro Castro-González, Hongsheng He
Antal sider10
Vol/bind11876
ForlagSpringer
Publikationsdato2019
Sider 176-185
ISBN (Trykt)978-3-030-35887-7
ISBN (Elektronisk)978-3-030-35888-4
DOI
StatusUdgivet - 2019
BegivenhedInternational Conference on Social Robotics - Madrid, Spanien
Varighed: 26 nov. 201929 nov. 2019

Konference

KonferenceInternational Conference on Social Robotics
Land/OmrådeSpanien
ByMadrid
Periode26/11/201929/11/2019
NavnLecture Notes in Computer Science
ISSN0302-9743

Fingeraftryk

Dyk ned i forskningsemnerne om 'No Need to Scream: Robust Sound-based Speaker Localisation in Challenging Scenarios'. Sammen danner de et unikt fingeraftryk.

Citationsformater