A Joint Audio-Visual Approach to Audio Localization

Research output: Contribution to journalConference article in JournalResearchpeer-review

Abstract

Localization of audio sources is an important research problem, e.g., to facilitate noise reduction. In the recent years, the problem has been tackled using distributed microphone arrays (DMA). A common approach is to apply direction-of-arrival (DOA) estimation on each array (denoted as nodes), and then map the DOA estimates to a location. In practice, however, the individual nodes contain few microphones, limiting the DOA estimation accuracy and, thereby, also the localization performance. We investigate a new approach, where range estimates are also obtained and utilized from each node, e.g., using time-of-flight cameras. Moreover, we propose an optimal method for weighting such DOA and range information for audio localization. Our experiments on both synthetic and real data show that there is a clear, potential advantage of using the joint audiovisual localization framework.
Close

Details

Localization of audio sources is an important research problem, e.g., to facilitate noise reduction. In the recent years, the problem has been tackled using distributed microphone arrays (DMA). A common approach is to apply direction-of-arrival (DOA) estimation on each array (denoted as nodes), and then map the DOA estimates to a location. In practice, however, the individual nodes contain few microphones, limiting the DOA estimation accuracy and, thereby, also the localization performance. We investigate a new approach, where range estimates are also obtained and utilized from each node, e.g., using time-of-flight cameras. Moreover, we propose an optimal method for weighting such DOA and range information for audio localization. Our experiments on both synthetic and real data show that there is a clear, potential advantage of using the joint audiovisual localization framework.
Original languageEnglish
JournalI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Pages (from-to)454-458
Number of pages5
ISSN1520-6149
DOI
StatePublished - 2015
Publication categoryResearch
Peer-reviewedYes
Event40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015 - Brisbane, Australia
Duration: 19 Apr 201524 Apr 2015
Conference number: 2015

Conference

Conference40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015
Number2015
CountryAustralia
CityBrisbane
Period19/04/201524/04/2015

    Research areas

  • localization, range, DOA, optimal weighting, distributed microphone arrays, time-of-flight camera

Download statistics

No data available
ID: 207890226