Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions

Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen

Research output: Contribution to journalJournal articleResearchpeer-review

32 Citations (Scopus)

Abstract

Speech intelligibility prediction methods have recently gained popularity in the speech processing community as supplements to time consuming and costly listening experiments. Such methods can be used to objectively quantify and compare the advantage of different speech enhancement algorithms, in a way that correlates well with actual speech intelligibility. One such method is the short-time objective intelligibility (STOI) measure. In a recent publication, we proposed a binaural version of the STOI measure, based on a modified version of the equalization cancellation (EC) model. This measure was shown to retain many of the advantageous properties of the STOI measure, while at the same time being able to predict intelligibility correctly in conditions involving both binaural advantage and non-linear signal processing. The biggest prediction errors were found for conditions involving multiple spatially distributed interferers. In this paper, we report results for a new listening experiment including different mixtures of isotropic and point source noise. This exposes that the binaural STOI measure has a tendency to overestimate the intelligibility in conditions with spatially distributed interferes at low signal to noise ratios (SNRs). This condition-dependent error can make it difficult to compare intelligibility across different acoustical conditions. We investigate the cause of this upward bias, and propose a correction which alleviates the problem. The modified method is evaluated with five datasets of measured intelligibility, spanning a wide range of realistic acoustic conditions. Within the tested conditions, the modified method yields very accurate predictions, and entirely alleviates the aforementioned tendency to overestimate intelligibility in conditions with spatially distributed interferers.

Original languageEnglish
JournalSpeech Communication
Volume102
Pages (from-to)1-13
Number of pages13
ISSN0167-6393
DOIs
Publication statusPublished - Sept 2018

Keywords

  • Binaural hearing
  • Speech enhancement
  • Speech intelligibility prediction

Fingerprint

Dive into the research topics of 'Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions'. Together they form a unique fingerprint.

Cite this