Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions

Adam Kuklasinski, Simon Doclo, Jesper Jensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

6 Citations (Scopus)

Abstract

We propose a novel Power Spectral Density (PSD) estimator for multi-microphone systems operating in reverberant and noisy conditions. The estimator is derived using the maximum likelihood approach and is based on a blocked and pre-whitened additive signal model. The intended application of the estimator is in speech enhancement algorithms, such as the Multi-channel Wiener Filter (MWF) and the Minimum Variance Distortionless Response
(MVDR) beamformer. We evaluate these two algorithms in a speech dereverberation task and compare the performance obtained using the proposed and a competing PSD estimator. Instrumental performance measures indicate an advantage of the proposed estimator over the competing one. In a speech intelligibility test all algorithms significantly improved the word intelligibility score. While the results suggest a minor advantage of using the proposed PSD estimator, the difference between algorithms was found to be statistically significant only in some of the experimental conditions.
Original languageEnglish
Title of host publicationIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016
PublisherIEEE
Publication date25 Mar 2016
Pages599 - 603
ISBN (Electronic)978-1-4799-9988-0
DOIs
Publication statusPublished - 25 Mar 2016
EventThe 41st IEEE International Conference on Acoustics, Speech and Signal Processing - Shanghai, China
Duration: 20 Mar 201625 Mar 2016
http://www.icassp2016.org/

Conference

ConferenceThe 41st IEEE International Conference on Acoustics, Speech and Signal Processing
CountryChina
CityShanghai
Period20/03/201625/03/2016
Internet address

Fingerprint Dive into the research topics of 'Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions'. Together they form a unique fingerprint.

Cite this