In this contribution we focus on the problem of power spectral density (PSD) estimation from multiple microphone signals in reverberant and noisy environments. The PSD estimation method proposed in this paper is based on the maximum likelihood (ML) methodology. In particular, we derive a novel ML PSD estimation scheme that is suitable for sound scenes which besides speech and reverberation consist of an additional noise component whose second-order statistics are known. The proposed algorithm is shown to outperform an existing similar algorithm in terms of PSD estimation accuracy. Moreover, it is shown numerically that the mean squared estimation error achieved by the proposed method is near the limit set by the corresponding Cram´er-Rao lower bound. The speech dereverberation performance of a multi-channel Wiener filter (MWF) based on the proposed PSD estimators is measured using several instrumental measures and is shown to be higher than when the competing estimator is used. Moreover, we perform a speech intelligibility test where we demonstrate that both the proposed and the competing PSD estimators lead to similar intelligibility improvements.
|Journal||I E E E Transactions on Audio, Speech and Language Processing|
|Number of pages||14|
|Publication status||Published - 1 Sep 2016|
Kuklasinski, A., Doclo, S., Jensen, S. H., & Jensen, J. (2016). Maximum Likelihood PSD Estimation for Speech Enhancement in Reverberation and Noise. I E E E Transactions on Audio, Speech and Language Processing, 24(9), 1599-1612. https://doi.org/10.1109/TASLP.2016.2573591