Enhancement of Non-Stationary Speech using Harmonic Chirp Filters

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.
Close

Details

In this paper, the issue of single channel speech enhancement of non-stationary voiced speech is addressed. The non-stationarity of speech is well known, but state of the art speech enhancement methods assume stationarity within frames of 20–30 ms. We derive optimal distortionless filters that take the non-stationarity nature of voiced speech into account via linear constraints. This is facilitated by imposing a harmonic chirp model on the speech signal. As an implicit part of the filter design, the noise statistics are also estimated based on the observed signal and parameters of the harmonic chirp model. Simulations on real speech show that the chirp based filters perform better than their harmonic counterparts. Further, it is seen that the gain of using the chirp model increases when the estimated chirp parameter is big corresponding to periods in the signal where the instantaneous fundamental frequency changes fast.
Original languageEnglish
Title of host publicationINTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association Dresden, Germany September 6-10, 2015
PublisherInternational Speech Communications Association
Publication date2015
Pages1755-1759
StatePublished - 2015
Publication categoryResearch
Peer-reviewedYes
EventINTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association - Dresden, Germany
Duration: 6 Sep 201510 Sep 2015

Conference

ConferenceINTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association
LandGermany
ByDresden
Periode06/09/201510/09/2015
SeriesINTERSPEECH
ISSN1990-9770

    Research areas

  • speech enhancement, single-channel, non-stationary signals, harmonic chirp model

Download statistics

No data available
ID: 230658150