Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Research output: Contribution to journalConference article in Journal

Abstract

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Close

Details

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Original languageEnglish
JournalI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Pages (from-to)731-735
Number of pages5
ISSN1520-6149
DOI
StatePublished - 19 Jun 2017
Publication categoryResearch
Peer-reviewedYes
EventThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing - New Orleans, United States
Duration: 5 Mar 20179 Mar 2017
http://www.ieee-icassp2017.org/
http://www.ieee-icassp2017.org/

Conference

ConferenceThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing
CountryUnited States
CityNew Orleans
Period05/03/201709/03/2017
Internet address

    Research areas

  • Pole-zero model, least 1-norm cost function , sparse deconvolution, speech analysis

Map

ID: 245960883