Abstract
This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 23rd International Congress on Acoustics |
Antal sider | 7 |
Forlag | Deutsche Gesellschaft für Akustik e.V. (DEGA) |
Publikationsdato | sep. 2019 |
Udgave | 2019 |
Sider | 7651-7658 |
ISBN (Trykt) | 978-3-939296-15-7 |
Status | Udgivet - sep. 2019 |
Begivenhed | 23rd International Congress on Acoustics - ICA 2019 - Aachen, Tyskland Varighed: 9 sep. 2019 → 13 sep. 2019 Konferencens nummer: 23 |
Konference
Konference | 23rd International Congress on Acoustics - ICA 2019 |
---|---|
Nummer | 23 |
Land/Område | Tyskland |
By | Aachen |
Periode | 09/09/2019 → 13/09/2019 |
Navn | International Congress on Acoustics - Proceedings |
---|---|
ISSN | 2226-7808 |