A Binaural Short Time Objective Intelligibility Measure for Noisy and Enhanced Speech

Asger Heidemann Andersen, Jan Mark De Haan, Zheng Hua Tan, Jesper Jensen

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

12 Citationer (Scopus)

Abstract

Objective intelligibility measures are increasingly being used to assess the performance of speech processing algorithms, e.g. for hearing aids. It has been shown that the short time objective intelligibility (STOI) measure yields good results in this respect. In this paper we propose a binaural extension of the STOI measure, which predicts binaural advantage using a modified equalization cancellation (EC) stage. The proposed method is evaluated for a range of acoustic conditions. Firstly, the method is able to predict the advantage of spatial separation between a speech target and a speech shaped noise (SSN) interferer. Secondly, the method yields results comparable to the monaural STOI measure when presented with noisy speech processed by ideal time-frequency segregation (ITFS). Finally, the method also performs well when presented with a selection of different acoustic conditions combined with beamforming as used in hearing aids.
OriginalsprogEngelsk
TitelProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Vol/bind2015-January
ForlagInternational Speech and Communication Association
Publikationsdatosep. 2015
Sider2563-2567
StatusUdgivet - sep. 2015
Begivenhed16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Tyskland
Varighed: 6 sep. 201510 sep. 2015

Konference

Konference16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
Land/OmrådeTyskland
ByDresden
Periode06/09/201510/09/2015
SponsorAlibaba Group, Amazon, et al., Facebook, Google, Telekom Innovation Laboratories
NavnINTERSPEECH
ISSN1990-9770

Fingeraftryk

Dyk ned i forskningsemnerne om 'A Binaural Short Time Objective Intelligibility Measure for Noisy and Enhanced Speech'. Sammen danner de et unikt fingeraftryk.

Citationsformater