Dual-channel eKF-RTF Framework for Speech Enhancement with DNN-based Speech Presence Estimation

Juan M. Martín-Doñas*, Antonio Peinado, Ivan Lopez Espejo, Angel Gomez

*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

This paper presents a dual-channel speech enhancement framework that effectively integrates deep neural network (DNN) mask estimators. Our framework follows a beamforming-plus-postfiltering approach intended for noise reduction on dual-microphone smartphones. An extended Kalman filter is used for the estimation of the relative acoustic channel between microphones, while the noise estimation is performed using a speech presence probability estimator. We propose the use of a DNN estimator to improve the prediction of the speech presence probabilities without making any assumption about the statistics of the signals. We evaluate and compare different dual-channel features to improve the accuracy of this estimator, including the power and phase difference between the speech signals at the two microphones. The proposed integrated scheme is evaluated in different reverberant and noisy environments when the smartphone is used in both close- and far-talk positions. The experimental results show that our approach achieves significant improvements in terms of speech quality, intelligibility, and distortion when compared to other approaches based only on statistical signal processing.
Original languageEnglish
Title of host publicationIberSPEECH 2021
Number of pages5
PublisherISCA
Publication dateMar 2021
Pages31-35
DOIs
Publication statusPublished - Mar 2021
EventIberSPEECH 2020 - Valladolid, Spain
Duration: 24 Mar 202125 Dec 2021

Conference

ConferenceIberSPEECH 2020
Country/TerritorySpain
CityValladolid
Period24/03/202125/12/2021

Fingerprint

Dive into the research topics of 'Dual-channel eKF-RTF Framework for Speech Enhancement with DNN-based Speech Presence Estimation'. Together they form a unique fingerprint.

Cite this