Localization and Tracking of Speech - a Joint Audio-Visual Approach

Jensen, Jesper Rindom (Project Participant)

Description

Several emerging applications operate on human speech. A few examples are smart homes, systems for automated camera steering, and surveillance systems. All of these requires that the position of the speaker in relation to an array of microphones is known, which is most often not the case in practice. It is therefore necessary to estimate the position of the speaker. However, this is problematic in practice, due to phenomena such as reverberation, background noise, wrong microphone array calibration, and interfering sources. These phenomena appears in almost every practical scenario, complicating or even precluding the estimation of the speaker location. This project therefore tackles the estimation problem in a novel way, where visual information about the speaker obtained using one or more cameras is used jointly with microphone recordings for speaker localization. This procedure is beneficial as the audio and visual information are complementary, as many of the aforementioned phenomena do not appear in camera recordings. The robust estimates obtained in this way, will help in improving the performance of the initially listed applications.

Status	Finished
Effective start/end date	01/10/2013 → 30/09/2016

Links

http://www.create.aau.dk/audio/audiovisual/

7 Journal article
6 Article in proceeding
5 Conference article in Journal
1 Book
More
- 1 Review article

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers
Tavakoli, V. M., Jensen, J. R., Heusdens, R., Benesty, J. & Christensen, M. G., 29 Aug 2016, Proceedings of the 2016 24th European Signal Processing Conference (EUSIPCO) . IEEE, p. 1088-1092 (Proceedings of the European Signal Processing Conference (EUSIPCO)).
Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Open Access
File
10 Citations (Scopus)

519 Downloads (Pure)
A Framework for Speech Enhancement with Ad Hoc Microphone Arrays
Tavakoli, V. M., Jensen, J. R., Christensen, M. G. & Benesty, J., 2 Mar 2016, In: I E E E Transactions on Audio, Speech and Language Processing. 24, 16, p. 1038-1051 14 p., 07423739.
Research output: Contribution to journal › Journal article › Research › peer-review

File
36 Citations (Scopus)

141 Downloads (Pure)
A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays
Tavakoli, V. M., Jensen, J. R., Benesty, J. & Christensen, M. G., 20 Mar 2016, In: I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings. p. 3221-3225 07472272.
Research output: Contribution to journal › Conference article in Journal › Research › peer-review

File
5 Citations (Scopus)

58 Downloads (Pure)

Localization and Tracking of Speech - a Joint Audio-Visual Approach

Project Details

Description

Links

Fingerprint

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers

A Framework for Speech Enhancement with Ad Hoc Microphone Arrays

A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays

Localization and Tracking of Speech - a Joint Audio-Visual Approach

Project Details

Description

Links

Fingerprint

Research output

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers

A Framework for Speech Enhancement with Ad Hoc Microphone Arrays

A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays