Enhancement of Non-Stationary Speech using Harmonic Chirp Filters

Sidsel Marie Nørholm; Jesper Rindom Jensen; Mads Græsbøll Christensen

Enhancement of Non-Stationary Speech using Harmonic Chirp Filters

Sidsel Marie Nørholm, Jesper Rindom Jensen, Mads Græsbøll Christensen

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

4 Citationer (Scopus)

262 Downloads (Pure)

Abstract

In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.

Originalsprog	Engelsk
Titel	INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association Dresden, Germany September 6-10, 2015
Forlag	International Speech Communications Association
Publikationsdato	2015
Sider	1755-1759
Status	Udgivet - 2015
Begivenhed	INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association - Dresden, Tyskland Varighed: 6 sep. 2015 → 10 sep. 2015

Konference

Konference	INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association
Land/Område	Tyskland
By	Dresden
Periode	06/09/2015 → 10/09/2015

Navn	INTERSPEECH
ISSN	1990-9770

Adgang til dokumentet

interspeech_chirp_submitted090615

http://www.isca-speech.org/archive/interspeech_2015/papers/i15_1755.pdf

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Citationsformater

Nørholm, S. M., Jensen, J. R., & Christensen, M. G. (2015). Enhancement of Non-Stationary Speech using Harmonic Chirp Filters. I INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association Dresden, Germany September 6-10, 2015 (s. 1755-1759). International Speech Communications Association. http://www.isca-speech.org/archive/interspeech_2015/papers/i15_1755.pdf

@inproceedings{65af72cc3bd34138bf38749950a568ee,

title = "Enhancement of Non-Stationary Speech using Harmonic Chirp Filters",

abstract = "In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.",

keywords = "speech enhancement, single-channel, non-stationary signals, harmonic chirp model",

author = "N{\o}rholm, {Sidsel Marie} and Jensen, {Jesper Rindom} and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2015",

language = "English",

series = "INTERSPEECH ",

publisher = "International Speech Communications Association",

pages = "1755--1759",

booktitle = "INTERSPEECH 2015",

note = "INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association ; Conference date: 06-09-2015 Through 10-09-2015",

}

Nørholm, SM, Jensen, JR & Christensen, MG 2015, Enhancement of Non-Stationary Speech using Harmonic Chirp Filters. i INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association Dresden, Germany September 6-10, 2015. International Speech Communications Association, INTERSPEECH , s. 1755-1759, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association, Dresden, Tyskland, 06/09/2015. <http://www.isca-speech.org/archive/interspeech_2015/papers/i15_1755.pdf>

Enhancement of Non-Stationary Speech using Harmonic Chirp Filters. / Nørholm, Sidsel Marie; Jensen, Jesper Rindom ; Christensen, Mads Græsbøll.
INTERSPEECH 2015: 16th Annual Conference of the International Speech Communication Association Dresden, Germany September 6-10, 2015. International Speech Communications Association, 2015. s. 1755-1759 (INTERSPEECH ).

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Enhancement of Non-Stationary Speech using Harmonic Chirp Filters

AU - Nørholm, Sidsel Marie

AU - Jensen, Jesper Rindom

AU - Christensen, Mads Græsbøll

PY - 2015

Y1 - 2015

N2 - In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.

AB - In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.

KW - speech enhancement, single-channel, non-stationary signals, harmonic chirp model

UR - http://interspeech2015.org//wp-content/uploads/direct/INTERSPEECH_2015_AbstractBook.pdf

UR - http://www.isca-speech.org/archive/interspeech_2015/

M3 - Article in proceeding

T3 - INTERSPEECH

SP - 1755

EP - 1759

BT - INTERSPEECH 2015

PB - International Speech Communications Association

T2 - INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association

Y2 - 6 September 2015 through 10 September 2015

ER -

Enhancement of Non-Stationary Speech using Harmonic Chirp Filters

Abstract

Konference

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater