A Joint Audio-Visual Approach to Audio Localization

Research output: Contribution to journalConference article in JournalResearchpeer-review

558 Downloads (Pure)

Abstract

Localization of audio sources is an important research problem, e.g., to facilitate noise reduction. In the recent years, the problem has been tackled using distributed microphone arrays (DMA). A common approach is to apply direction-of-arrival (DOA) estimation on each array (denoted as nodes), and then map the DOA estimates to a location. In practice, however, the individual nodes contain few microphones, limiting the DOA estimation accuracy and, thereby, also the localization performance. We investigate a new approach, where range estimates are also obtained and utilized from each node, e.g., using time-of-flight cameras. Moreover, we propose an optimal method for weighting such DOA and range information for audio localization. Our experiments on both synthetic and real data show that there is a clear, potential advantage of using the joint audiovisual localization framework.
Original languageEnglish
JournalI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Pages (from-to)454-458
Number of pages5
ISSN1520-6149
DOIs
Publication statusPublished - 2015
Event40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015 - Brisbane, Australia
Duration: 19 Apr 201524 Apr 2015
Conference number: 2015

Conference

Conference40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015
Number2015
Country/TerritoryAustralia
CityBrisbane
Period19/04/201524/04/2015

Keywords

  • localization
  • range
  • DOA
  • optimal weighting
  • distributed microphone arrays
  • time-of-flight camera

Fingerprint

Dive into the research topics of 'A Joint Audio-Visual Approach to Audio Localization'. Together they form a unique fingerprint.

Cite this