Abstract
An algorithm suitable for voice activity detection under reverberant conditions is proposed in this paper. Due to the use of far-filed microphones the proposed solution processes speech signals of highly-varying intensity and signal to noise ratio, that are contaminated with several echoes. The core of the system is a pair of Hidden Markov Models, that effectively model the speech presence and speech absence situations. To minimise mis-detections an adaptive threshold is used, while a hang-over scheme caters for the intra-frame correlation of speech signals. Experimental results conducted in a typical office room using a single far field microphone to support the analysis.
Originalsprog | Engelsk |
---|---|
Titel | DSP 2009: 16th International Conference on Digital Signal Processing, Proceedings |
Antal sider | 5 |
Forlag | IEEE Press |
Publikationsdato | 2009 |
Sider | 1-5 |
ISBN (Trykt) | 978-142443298-1, 978-1-4244-3297-4 |
DOI | |
Status | Udgivet - 2009 |
Begivenhed | DSP 2009: 16th International Conference on Digital Signal Processing - Santorini, Grækenland Varighed: 5 jul. 2009 → 7 jul. 2009 |
Konference
Konference | DSP 2009: 16th International Conference on Digital Signal Processing |
---|---|
Land/Område | Grækenland |
By | Santorini |
Periode | 05/07/2009 → 07/07/2009 |