Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions

Adam Kuklasinski; Simon Doclo; Jesper Jensen

doi:10.1109/ICASSP.2016.7471745

Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions

Adam Kuklasinski, Simon Doclo, Jesper Jensen

Institut for Elektroniske Systemer

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

6 Citationer (Scopus)

Abstract

We propose a novel Power Spectral Density (PSD) estimator for multi-microphone systems operating in reverberant and noisy conditions. The estimator is derived using the maximum likelihood approach and is based on a blocked and pre-whitened additive signal model. The intended application of the estimator is in speech enhancement algorithms, such as the Multi-channel Wiener Filter (MWF) and the Minimum Variance Distortionless Response
(MVDR) beamformer. We evaluate these two algorithms in a speech dereverberation task and compare the performance obtained using the proposed and a competing PSD estimator. Instrumental performance measures indicate an advantage of the proposed estimator over the competing one. In a speech intelligibility test all algorithms significantly improved the word intelligibility score. While the results suggest a minor advantage of using the proposed PSD estimator, the difference between algorithms was found to be statistically significant only in some of the experimental conditions.

Originalsprog	Engelsk
Titel	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016
Forlag	IEEE
Publikationsdato	25 mar. 2016
Sider	599 - 603
ISBN (Elektronisk)	978-1-4799-9988-0
DOI	https://doi.org/10.1109/ICASSP.2016.7471745
Status	Udgivet - 25 mar. 2016
Begivenhed	The 41st IEEE International Conference on Acoustics, Speech and Signal Processing - Shanghai, Kina Varighed: 20 mar. 2016 → 25 mar. 2016 http://www.icassp2016.org/

Konference

Konference	The 41st IEEE International Conference on Acoustics, Speech and Signal Processing
Land/Område	Kina
By	Shanghai
Periode	20/03/2016 → 25/03/2016
Internetadresse	http://www.icassp2016.org/

Adgang til dokumentet

10.1109/ICASSP.2016.7471745

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@inproceedings{1c2ab6b5a89d46c5b04d10f5dfd8987a,

title = "Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions",

abstract = "We propose a novel Power Spectral Density (PSD) estimator for multi-microphone systems operating in reverberant and noisy conditions. The estimator is derived using the maximum likelihood approach and is based on a blocked and pre-whitened additive signal model. The intended application of the estimator is in speech enhancement algorithms, such as the Multi-channel Wiener Filter (MWF) and the Minimum Variance Distortionless Response(MVDR) beamformer. We evaluate these two algorithms in a speech dereverberation task and compare the performance obtained using the proposed and a competing PSD estimator. Instrumental performance measures indicate an advantage of the proposed estimator over the competing one. In a speech intelligibility test all algorithms significantly improved the word intelligibility score. While the results suggest a minor advantage of using the proposed PSD estimator, the difference between algorithms was found to be statistically significant only in some of the experimental conditions.",

author = "Adam Kuklasinski and Simon Doclo and Jesper Jensen",

year = "2016",

month = mar,

day = "25",

doi = "10.1109/ICASSP.2016.7471745",

language = "English",

pages = "599 -- 603",

booktitle = "IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016",

publisher = "IEEE",

address = "United States",

note = "The 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 ; Conference date: 20-03-2016 Through 25-03-2016",

url = "http://www.icassp2016.org/",

}

Kuklasinski, A, Doclo, S & Jensen, J 2016, Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions. i IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016. IEEE, s. 599 - 603, The 41st IEEE International Conference on Acoustics, Speech and Signal Processing, Shanghai, Kina, 20/03/2016. https://doi.org/10.1109/ICASSP.2016.7471745

Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions. / Kuklasinski, Adam; Doclo, Simon; Jensen, Jesper.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016. IEEE, 2016. s. 599 - 603.

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions

AU - Kuklasinski, Adam

AU - Doclo, Simon

AU - Jensen, Jesper

PY - 2016/3/25

Y1 - 2016/3/25

N2 - We propose a novel Power Spectral Density (PSD) estimator for multi-microphone systems operating in reverberant and noisy conditions. The estimator is derived using the maximum likelihood approach and is based on a blocked and pre-whitened additive signal model. The intended application of the estimator is in speech enhancement algorithms, such as the Multi-channel Wiener Filter (MWF) and the Minimum Variance Distortionless Response(MVDR) beamformer. We evaluate these two algorithms in a speech dereverberation task and compare the performance obtained using the proposed and a competing PSD estimator. Instrumental performance measures indicate an advantage of the proposed estimator over the competing one. In a speech intelligibility test all algorithms significantly improved the word intelligibility score. While the results suggest a minor advantage of using the proposed PSD estimator, the difference between algorithms was found to be statistically significant only in some of the experimental conditions.

AB - We propose a novel Power Spectral Density (PSD) estimator for multi-microphone systems operating in reverberant and noisy conditions. The estimator is derived using the maximum likelihood approach and is based on a blocked and pre-whitened additive signal model. The intended application of the estimator is in speech enhancement algorithms, such as the Multi-channel Wiener Filter (MWF) and the Minimum Variance Distortionless Response(MVDR) beamformer. We evaluate these two algorithms in a speech dereverberation task and compare the performance obtained using the proposed and a competing PSD estimator. Instrumental performance measures indicate an advantage of the proposed estimator over the competing one. In a speech intelligibility test all algorithms significantly improved the word intelligibility score. While the results suggest a minor advantage of using the proposed PSD estimator, the difference between algorithms was found to be statistically significant only in some of the experimental conditions.

U2 - 10.1109/ICASSP.2016.7471745

DO - 10.1109/ICASSP.2016.7471745

M3 - Article in proceeding

SP - 599

EP - 603

BT - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016

PB - IEEE

T2 - The 41st IEEE International Conference on Acoustics, Speech and Signal Processing

Y2 - 20 March 2016 through 25 March 2016

ER -

Maximum likelihood PSD estimation for speech enhancement in reverberant and noisy conditions

Abstract

Konference

Adgang til dokumentet

AUB Link

Fingeraftryk

Citationsformater