A Binaural Short Time Objective Intelligibility Measure for Noisy and Enhanced Speech

Asger Heidemann Andersen, Jan Mark De Haan, Zheng Hua Tan, Jesper Jensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

12 Citations (Scopus)

Abstract

Objective intelligibility measures are increasingly being used to assess the performance of speech processing algorithms, e.g. for hearing aids. It has been shown that the short time objective intelligibility (STOI) measure yields good results in this respect. In this paper we propose a binaural extension of the STOI measure, which predicts binaural advantage using a modified equalization cancellation (EC) stage. The proposed method is evaluated for a range of acoustic conditions. Firstly, the method is able to predict the advantage of spatial separation between a speech target and a speech shaped noise (SSN) interferer. Secondly, the method yields results comparable to the monaural STOI measure when presented with noisy speech processed by ideal time-frequency segregation (ITFS). Finally, the method also performs well when presented with a selection of different acoustic conditions combined with beamforming as used in hearing aids.
Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2015-January
PublisherInternational Speech and Communication Association
Publication dateSept 2015
Pages2563-2567
Publication statusPublished - Sept 2015
Event16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Duration: 6 Sept 201510 Sept 2015

Conference

Conference16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
Country/TerritoryGermany
CityDresden
Period06/09/201510/09/2015
SponsorAlibaba Group, Amazon, et al., Facebook, Google, Telekom Innovation Laboratories
SeriesINTERSPEECH
ISSN1990-9770

Fingerprint

Dive into the research topics of 'A Binaural Short Time Objective Intelligibility Measure for Noisy and Enhanced Speech'. Together they form a unique fingerprint.

Cite this