Kalman filter for speech enhancement in cocktail party scenarios using a codebook-based approach

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

9 Citationer (Scopus)

Resumé

Enhancement of speech in non-stationary background noise is a challenging task, and conventional single channel speech enhancement algorithms have not been able to improve the speech intelligibility in such scenarios. The work proposed in this paper investigates a single channel Kalman filter based speech enhancement algorithm, whose parameters are estimated using a codebook based approach. The results indicate that the enhancement algorithm is able to improve the speech intelligibility and quality according to objective measures. Moreover, we investigate the effects of utilizing a speaker specific trained codebook over a generic speech codebook in relation to the performance of the speech enhancement system.

Konference

KonferenceThe 41st IEEE International Conference on Acoustics, Speech and Signal Processing
LandKina
ByShanghai
Periode20/03/201625/03/2016
Internetadresse

Fingerprint

Speech enhancement
Kalman filters
Speech intelligibility

Citer dette

@inproceedings{dfa20b1e2a9e44809efc0dd6d2f9cb90,
title = "Kalman filter for speech enhancement in cocktail party scenarios using a codebook-based approach",
abstract = "Enhancement of speech in non-stationary background noise is a challenging task, and conventional single channel speech enhancement algorithms have not been able to improve the speech intelligibility in such scenarios. The work proposed in this paper investigates a single channel Kalman filter based speech enhancement algorithm, whose parameters are estimated using a codebook based approach. The results indicate that the enhancement algorithm is able to improve the speech intelligibility and quality according to objective measures. Moreover, we investigate the effects of utilizing a speaker specific trained codebook over a generic speech codebook in relation to the performance of the speech enhancement system.",
author = "Kavalekalam, {Mathew Shaji} and Christensen, {Mads Gr{\ae}sb{\o}ll} and Fredrik Gran and Boldt, {Jesper B.}",
year = "2016",
month = "4",
doi = "10.1109/ICASSP.2016.7471663",
language = "English",
journal = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",
issn = "1520-6149",
publisher = "IEEE Signal Processing Society",

}

TY - GEN

T1 - Kalman filter for speech enhancement in cocktail party scenarios using a codebook-based approach

AU - Kavalekalam, Mathew Shaji

AU - Christensen, Mads Græsbøll

AU - Gran, Fredrik

AU - Boldt, Jesper B.

PY - 2016/4

Y1 - 2016/4

N2 - Enhancement of speech in non-stationary background noise is a challenging task, and conventional single channel speech enhancement algorithms have not been able to improve the speech intelligibility in such scenarios. The work proposed in this paper investigates a single channel Kalman filter based speech enhancement algorithm, whose parameters are estimated using a codebook based approach. The results indicate that the enhancement algorithm is able to improve the speech intelligibility and quality according to objective measures. Moreover, we investigate the effects of utilizing a speaker specific trained codebook over a generic speech codebook in relation to the performance of the speech enhancement system.

AB - Enhancement of speech in non-stationary background noise is a challenging task, and conventional single channel speech enhancement algorithms have not been able to improve the speech intelligibility in such scenarios. The work proposed in this paper investigates a single channel Kalman filter based speech enhancement algorithm, whose parameters are estimated using a codebook based approach. The results indicate that the enhancement algorithm is able to improve the speech intelligibility and quality according to objective measures. Moreover, we investigate the effects of utilizing a speaker specific trained codebook over a generic speech codebook in relation to the performance of the speech enhancement system.

U2 - 10.1109/ICASSP.2016.7471663

DO - 10.1109/ICASSP.2016.7471663

M3 - Conference article in Journal

JO - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

JF - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

SN - 1520-6149

ER -