Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

Roberto Barumerli; Andrea Almenari; Michele Geronazzo; Giorgio Maria Di Nunzio; Federico Avanzini

Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

Roberto Barumerli, Andrea Almenari, Michele Geronazzo, Giorgio Maria Di Nunzio, Federico Avanzini

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

28 Downloads (Pure)

Abstract

This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.

Originalsprog	Engelsk
Titel	Proceedings of the 23rd International Congress on Acoustics
Antal sider	7
Forlag	Deutsche Gesellschaft für Akustik e.V. (DEGA)
Publikationsdato	sep. 2019
Udgave	2019
Sider	7651-7658
ISBN (Trykt)	978-3-939296-15-7
Status	Udgivet - sep. 2019
Begivenhed	23rd International Congress on Acoustics - ICA 2019 - Aachen, Tyskland Varighed: 9 sep. 2019 → 13 sep. 2019 Konferencens nummer: 23

Konference

Konference	23rd International Congress on Acoustics - ICA 2019
Nummer	23
Land/Område	Tyskland
By	Aachen
Periode	09/09/2019 → 13/09/2019

Navn	International Congress on Acoustics - Proceedings
ISSN	2226-7808

Adgang til dokumentet

Open Access ArticleForlagets udgivne version, 391 KBLicens: CC BY-NC-SA 4.0

http://pub.dega-akustik.de/ICA2019/data/articles/001138.pdfLicens: CC BY-NC-SA 4.0

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Link til program

Citationsformater

Barumerli, R., Almenari, A., Geronazzo, M., Di Nunzio, G. M., & Avanzini, F. (2019). Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios. I Proceedings of the 23rd International Congress on Acoustics (2019 udg., s. 7651-7658). Deutsche Gesellschaft für Akustik e.V. (DEGA). http://pub.dega-akustik.de/ICA2019/data/articles/001138.pdf

@inproceedings{1bb563bd87c04e08877e0a6eaab064b7,

title = "Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios",

abstract = "This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.",

author = "Roberto Barumerli and Andrea Almenari and Michele Geronazzo and {Di Nunzio}, {Giorgio Maria} and Federico Avanzini",

year = "2019",

month = sep,

language = "English",

isbn = "978-3-939296-15-7",

series = "International Congress on Acoustics - Proceedings",

pages = "7651--7658",

booktitle = "Proceedings of the 23rd International Congress on Acoustics",

publisher = "Deutsche Gesellschaft f{\"u}r Akustik e.V. (DEGA)",

edition = "2019",

note = "23rd International Congress on Acoustics - ICA 2019<br/> ; Conference date: 09-09-2019 Through 13-09-2019",

}

Barumerli, R, Almenari, A, Geronazzo, M, Di Nunzio, GM & Avanzini, F 2019, Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios. i Proceedings of the 23rd International Congress on Acoustics. 2019 udg, Deutsche Gesellschaft für Akustik e.V. (DEGA), International Congress on Acoustics - Proceedings, s. 7651-7658, 23rd International Congress on Acoustics - ICA 2019
, Aachen, Tyskland, 09/09/2019. <http://pub.dega-akustik.de/ICA2019/data/articles/001138.pdf>

Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios. / Barumerli, Roberto; Almenari, Andrea; Geronazzo, Michele et al.
Proceedings of the 23rd International Congress on Acoustics. 2019. udg. Deutsche Gesellschaft für Akustik e.V. (DEGA), 2019. s. 7651-7658 (International Congress on Acoustics - Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

AU - Barumerli, Roberto

AU - Almenari, Andrea

AU - Geronazzo, Michele

AU - Di Nunzio, Giorgio Maria

AU - Avanzini, Federico

N1 - Conference code: 23

PY - 2019/9

Y1 - 2019/9

N2 - This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.

AB - This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.

UR - http://www.ica2019.org/fileadmin/ica2019.org/program/ICA19_program_web2.pdf

M3 - Article in proceeding

SN - 978-3-939296-15-7

T3 - International Congress on Acoustics - Proceedings

SP - 7651

EP - 7658

BT - Proceedings of the 23rd International Congress on Acoustics

PB - Deutsche Gesellschaft für Akustik e.V. (DEGA)

T2 - 23rd International Congress on Acoustics - ICA 2019<br/>

Y2 - 9 September 2019 through 13 September 2019

ER -

Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

Abstract

Konference

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater