A Neural Network for Monaural Intrusive Speech Intelligibility Prediction

Mathias Pedersen, Asger Heidemann Andersen, Søren Holdt Jensen, Jesper Jensen

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

1 Citationer (Scopus)

Abstrakt

Monaural intrusive speech intelligibility prediction (SIP) methods aim to predict the speech intelligibility (SI) of a single-microphone noisy and/or processed speech signal using the underlying clean speech signal. In the present work, we propose a neural network for monaural intrusive SIP. The proposed network is trained on data from multiple listening tests to predict SI. In the interest of using the available listening test data as efficiently as possible and to facilitate SI prediction of short duration speech signals, training is based on a local-time intelligibility curve derived from the listening test data. The trained neural network is evaluated, in terms of rank order correlation, against the classical monaural intrusive predictors STOI and ESTOI. The network is found to perform the best overall with a Kendall's tau of 0.825 measured over long duration, i.e. speech signals up to several minutes in duration. For short-term prediction using short speech signals of 1-10 seconds the network also shows better performance and smaller prediction variance.

OriginalsprogEngelsk
TitelICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Antal sider5
ForlagIEEE
Publikationsdatomaj 2020
Sider336-340
Artikelnummer9052949
ISBN (Trykt)978-1-5090-6632-2
ISBN (Elektronisk)978-1-5090-6631-5
DOI
StatusUdgivet - maj 2020
BegivenhedICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Barcelona, Spanien
Varighed: 4 maj 20208 maj 2020

Konference

KonferenceICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
LandSpanien
ByBarcelona
Periode04/05/202008/05/2020
NavnInternational Conference on Acoustics Speech and Signal Processing (ICASSP)
ISSN1520-6149

Fingeraftryk

Dyk ned i forskningsemnerne om 'A Neural Network for Monaural Intrusive Speech Intelligibility Prediction'. Sammen danner de et unikt fingeraftryk.

Citationsformater