A Neural Network for Monaural Intrusive Speech Intelligibility Prediction

Mathias Pedersen, Asger Heidemann Andersen, Søren Holdt Jensen, Jesper Jensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

17 Citations (Scopus)

Abstract

Monaural intrusive speech intelligibility prediction (SIP) methods aim to predict the speech intelligibility (SI) of a single-microphone noisy and/or processed speech signal using the underlying clean speech signal. In the present work, we propose a neural network for monaural intrusive SIP. The proposed network is trained on data from multiple listening tests to predict SI. In the interest of using the available listening test data as efficiently as possible and to facilitate SI prediction of short duration speech signals, training is based on a local-time intelligibility curve derived from the listening test data. The trained neural network is evaluated, in terms of rank order correlation, against the classical monaural intrusive predictors STOI and ESTOI. The network is found to perform the best overall with a Kendall's tau of 0.825 measured over long duration, i.e. speech signals up to several minutes in duration. For short-term prediction using short speech signals of 1-10 seconds the network also shows better performance and smaller prediction variance.

Original languageEnglish
Title of host publicationICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Number of pages5
PublisherIEEE
Publication dateMay 2020
Pages336-340
Article number9052949
ISBN (Print)978-1-5090-6632-2
ISBN (Electronic)978-1-5090-6631-5
DOIs
Publication statusPublished - May 2020
EventICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Barcelona, Spain
Duration: 4 May 20208 May 2020

Conference

ConferenceICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/TerritorySpain
CityBarcelona
Period04/05/202008/05/2020
SeriesInternational Conference on Acoustics Speech and Signal Processing (ICASSP)
ISSN1520-6149

Keywords

  • intelligibility
  • intrusive
  • monaural
  • prediction
  • speech

Fingerprint

Dive into the research topics of 'A Neural Network for Monaural Intrusive Speech Intelligibility Prediction'. Together they form a unique fingerprint.

Cite this