Abstract
This paper deals with the problem of predicting the average intelligibility of noisy and potentially processed speech signals, as observed by a group of normal hearing listeners. We propose a prediction model based on the hypothesis that intelligibility is monotonically related to the the amount of Shannon information the critical-band amplitude envelopes of the noisy/processed signal convey about the corresponding clean signal envelopes. The resulting intelligibility predictor turns out to be a simple function of the correlation between noisy/processed and clean amplitude envelopes. The proposed predictor performs well (ρ>0.95) in predicting the intelligibility of speech signals contaminated by additive noise and potentially non-linearly processed using time-frequency weighting.
Original language | English |
---|---|
Title of host publication | 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013) : Speech in Life Sciences and Human Societies |
Number of pages | 5 |
Publisher | International Speech Communications Association |
Publication date | 2013 |
Pages | 1174-1178 |
ISBN (Print) | 978-1-62993-443-3 |
Publication status | Published - 2013 |
Event | Interspeech 2013 - Lyon, France Duration: 25 Aug 2013 → 29 Aug 2013 http://www.interspeech2013.org/ |
Conference
Conference | Interspeech 2013 |
---|---|
Country/Territory | France |
City | Lyon |
Period | 25/08/2013 → 29/08/2013 |
Internet address |
Series | Proceedings of the International Conference on Spoken Language Processing |
---|---|
ISSN | 1990-9772 |