Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement

Sidsel Marie Nørholm, Martin Krawczyk-Becker, Timo Gerkmann, Steven van de Par, Jesper Rindom Jensen, Mads Græsbøll Christensen

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

1 Citation (Scopus)
113 Downloads (Pure)

Resumé

In this paper, we consider single-channel speech enhancement in the short time Fourier transform (STFT) domain. We suggest to improve an STFT phase estimate by estimating the initial phases. The method is based on the harmonic model and a model for the phase evolution over time. The initial phases are estimated by setting up a least squares problem between the noisy phase and the model for phase evolution. Simulations on synthetic and speech signals show a decreased error on the phase when an estimate of the initial phase is included compared to using the noisy phase as an initialisation. The error on the phase is decreased at input SNRs from -10 to 10 dB. Reconstructing the signal using the clean amplitude, the mean squared error is decreased and the PESQ score is increased.
OriginalsprogEngelsk
TitelINTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015
ForlagInternational Speech Communications Association
Publikationsdato2015
Sider1750-1754
StatusUdgivet - 2015
BegivenhedINTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association - Dresden, Tyskland
Varighed: 6 sep. 201510 sep. 2015

Konference

KonferenceINTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association
LandTyskland
ByDresden
Periode06/09/201510/09/2015
NavnINTERSPEECH
ISSN1990-9770

Fingerprint

Speech enhancement
Fourier transforms

Citer dette

Nørholm, S. M., Krawczyk-Becker, M., Gerkmann, T., van de Par, S., Jensen, J. R., & Christensen, M. G. (2015). Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement. I INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015 (s. 1750-1754). International Speech Communications Association. INTERSPEECH
Nørholm, Sidsel Marie ; Krawczyk-Becker, Martin ; Gerkmann, Timo ; van de Par, Steven ; Jensen, Jesper Rindom ; Christensen, Mads Græsbøll. / Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement. INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. International Speech Communications Association, 2015. s. 1750-1754 (INTERSPEECH ).
@inproceedings{16d3c2ea846d4411b270558d2edce806,
title = "Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement",
abstract = "In this paper, we consider single-channel speech enhancement in the short time Fourier transform (STFT) domain. We suggest to improve an STFT phase estimate by estimating the initial phases. The method is based on the harmonic model and a model for the phase evolution over time. The initial phases are estimated by setting up a least squares problem between the noisy phase and the model for phase evolution. Simulations on synthetic and speech signals show a decreased error on the phase when an estimate of the initial phase is included compared to using the noisy phase as an initialisation. The error on the phase is decreased at input SNRs from -10 to 10 dB. Reconstructing the signal using the clean amplitude, the mean squared error is decreased and the PESQ score is increased.",
author = "N{\o}rholm, {Sidsel Marie} and Martin Krawczyk-Becker and Timo Gerkmann and {van de Par}, Steven and Jensen, {Jesper Rindom} and Christensen, {Mads Gr{\ae}sb{\o}ll}",
year = "2015",
language = "English",
pages = "1750--1754",
booktitle = "INTERSPEECH 2015",
publisher = "International Speech Communications Association",

}

Nørholm, SM, Krawczyk-Becker, M, Gerkmann, T, van de Par, S, Jensen, JR & Christensen, MG 2015, Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement. i INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. International Speech Communications Association, INTERSPEECH , s. 1750-1754, Dresden, Tyskland, 06/09/2015.

Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement. / Nørholm, Sidsel Marie; Krawczyk-Becker, Martin; Gerkmann, Timo; van de Par, Steven; Jensen, Jesper Rindom; Christensen, Mads Græsbøll.

INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. International Speech Communications Association, 2015. s. 1750-1754 (INTERSPEECH ).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

TY - GEN

T1 - Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement

AU - Nørholm, Sidsel Marie

AU - Krawczyk-Becker, Martin

AU - Gerkmann, Timo

AU - van de Par, Steven

AU - Jensen, Jesper Rindom

AU - Christensen, Mads Græsbøll

PY - 2015

Y1 - 2015

N2 - In this paper, we consider single-channel speech enhancement in the short time Fourier transform (STFT) domain. We suggest to improve an STFT phase estimate by estimating the initial phases. The method is based on the harmonic model and a model for the phase evolution over time. The initial phases are estimated by setting up a least squares problem between the noisy phase and the model for phase evolution. Simulations on synthetic and speech signals show a decreased error on the phase when an estimate of the initial phase is included compared to using the noisy phase as an initialisation. The error on the phase is decreased at input SNRs from -10 to 10 dB. Reconstructing the signal using the clean amplitude, the mean squared error is decreased and the PESQ score is increased.

AB - In this paper, we consider single-channel speech enhancement in the short time Fourier transform (STFT) domain. We suggest to improve an STFT phase estimate by estimating the initial phases. The method is based on the harmonic model and a model for the phase evolution over time. The initial phases are estimated by setting up a least squares problem between the noisy phase and the model for phase evolution. Simulations on synthetic and speech signals show a decreased error on the phase when an estimate of the initial phase is included compared to using the noisy phase as an initialisation. The error on the phase is decreased at input SNRs from -10 to 10 dB. Reconstructing the signal using the clean amplitude, the mean squared error is decreased and the PESQ score is increased.

UR - http://interspeech2015.org//wp-content/uploads/direct/INTERSPEECH_2015_AbstractBook.pdf

UR - http://www.isca-speech.org/archive/interspeech_2015/

M3 - Article in proceeding

SP - 1750

EP - 1754

BT - INTERSPEECH 2015

PB - International Speech Communications Association

ER -

Nørholm SM, Krawczyk-Becker M, Gerkmann T, van de Par S, Jensen JR, Christensen MG. Least Squares Estimate of the Initial Phases in STFT based Speech Enhancement. I INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. International Speech Communications Association. 2015. s. 1750-1754. (INTERSPEECH ).