SII-Based Speech Prepocessing for Intelligibility Improvement in Noise

Cees H. Taal, Jesper Jensen

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

584 Downloads (Pure)

Abstract

A linear time-invariant filter is designed in order to improve speech understanding when the speech is played back in a noisy environment. To accomplish this, the speech intelligibility index (SII) is maximized under the constraint that the speech energy is held constant. A nonlinear approximation is used for the SII such that a closed-form solution exists to the constrained optimization problem. The resulting filter is dependent both on the long-term average noise and speech spectrum and the global SNR and, in general, has a high-pass characteristic. In contrast to existing methods, the proposed filter sets certain frequency bands to zero when they do not contribute to intelligibility anymore. Experiments show large intelligibility improvements with the proposed method when used in stationary speech-shaped noise. However, it was also found that the method does not perform well for speech corrupted by a competing speaker. This is due to the fact that the SII is not a reliable intelligibility predictor for fluctuating noise sources. MATLAB code is provided.
OriginalsprogEngelsk
TidsskriftProceedings of the International Conference on Spoken Language Processing
Sider (fra-til)3582-3586
Antal sider6
ISSN1990-9772
StatusUdgivet - 2013
BegivenhedInterspeech 2013 - Lyon, Frankrig
Varighed: 25 aug. 201329 aug. 2013
http://www.interspeech2013.org/

Konference

KonferenceInterspeech 2013
Land/OmrådeFrankrig
ByLyon
Periode25/08/201329/08/2013
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'SII-Based Speech Prepocessing for Intelligibility Improvement in Noise'. Sammen danner de et unikt fingeraftryk.

Citationsformater