Speech Intelligibility Prediction Based on Mutual Information

Jesper Jensen, Cees H. Taal

Research output: Contribution to journalJournal articleResearchpeer-review

44 Citations (Scopus)
1423 Downloads (Pure)

Abstract

This paper deals with the problem of predicting the average intelligibility of noisy and potentially processed speech signals, as observed by a group of normal hearing listeners. We propose a model which performs this prediction based on the hypothesis that intelligibility is monotonically related to the mutual information between critical-band amplitude envelopes of the clean signal and the corresponding noisy/processed signal. The resulting intelligibility predictor turns out to be a simple function of the mean-square error (mse) that arises when estimating a clean critical-band amplitude using a minimum mean-square error (mmse) estimator based on the noisy/processed amplitude. The proposed model predicts that speech intelligibility cannot be improved by any processing of noisy critical-band amplitudes. Furthermore, the proposed intelligibility predictor performs well ( ρ > 0.95) in predicting the intelligibility of speech signals contaminated by additive noise and potentially non-linearly processed using time-frequency weighting.
Original languageEnglish
JournalI E E E Transactions on Audio, Speech and Language Processing
Volume22
Issue number2
Pages (from-to)430-440
ISSN1558-7916
DOIs
Publication statusPublished - Feb 2014

Fingerprint

Dive into the research topics of 'Speech Intelligibility Prediction Based on Mutual Information'. Together they form a unique fingerprint.

Cite this