Localization and Tracking of Speech - a Joint Audio-Visual Approach

Jensen, Jesper Rindom (Projektdeltager)

Beskrivelse

Several emerging applications operate on human speech. A few examples are smart homes, systems for automated camera steering, and surveillance systems. All of these requires that the position of the speaker in relation to an array of microphones is known, which is most often not the case in practice. It is therefore necessary to estimate the position of the speaker. However, this is problematic in practice, due to phenomena such as reverberation, background noise, wrong microphone array calibration, and interfering sources. These phenomena appears in almost every practical scenario, complicating or even precluding the estimation of the speaker location. This project therefore tackles the estimation problem in a novel way, where visual information about the speaker obtained using one or more cameras is used jointly with microphone recordings for speaker localization. This procedure is beneficial as the audio and visual information are complementary, as many of the aforementioned phenomena do not appear in camera recordings. The robust estimates obtained in this way, will help in improving the performance of the initially listed applications.

Status	Afsluttet
Effektiv start/slut dato	01/10/2013 → 30/09/2016

Emneord

localization
speech
audio
video
audio-visual
tracking

Links

http://www.create.aau.dk/audio/audiovisual/

7 Tidsskriftartikel
6 Konferenceartikel i proceeding
5 Konferenceartikel i tidsskrift
1 Bog
Mere
- 1 Review (oversigtsartikel)

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers
Tavakoli, V. M., Jensen, J. R., Heusdens, R., Benesty, J. & Christensen, M. G., 29 aug. 2016, Proceedings of the 2016 24th European Signal Processing Conference (EUSIPCO) . IEEE, s. 1088-1092 (Proceedings of the European Signal Processing Conference (EUSIPCO)).
Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

Åben adgang
Fil
10 Citationer (Scopus)

519 Downloads (Pure)
A Framework for Speech Enhancement with Ad Hoc Microphone Arrays
Tavakoli, V. M., Jensen, J. R., Christensen, M. G. & Benesty, J., 2 mar. 2016, I: I E E E Transactions on Audio, Speech and Language Processing. 24, 16, s. 1038-1051 14 s., 07423739.
Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

Fil
36 Citationer (Scopus)

141 Downloads (Pure)
A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays
Tavakoli, V. M., Jensen, J. R., Benesty, J. & Christensen, M. G., 20 mar. 2016, I: I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings. s. 3221-3225 07472272.
Publikation: Bidrag til tidsskrift › Konferenceartikel i tidsskrift › Forskning › peer review

Fil
5 Citationer (Scopus)

58 Downloads (Pure)

Localization and Tracking of Speech - a Joint Audio-Visual Approach

Projektdetaljer

Beskrivelse

Emneord

Links

Fingerprint

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers

A Framework for Speech Enhancement with Ad Hoc Microphone Arrays

A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays

Localization and Tracking of Speech - a Joint Audio-Visual Approach

Projektdetaljer

Beskrivelse

Emneord

Links

Fingerprint

Publikation

Ad Hoc Microphone Array Beamforming Using the Primal-Dual Method of Multipliers

A Framework for Speech Enhancement with Ad Hoc Microphone Arrays

A Partitioned Approach to Signal Separation with Microphone Ad Hoc Arrays