Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Close

Details

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Original languageEnglish
Title of host publicationIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017
Number of pages5
PublisherIEEE
Publication date19 Jun 2017
Pages731-735
ISBN (Electronic)978-1-5090-4117-6
DOI
StatePublished - 19 Jun 2017
Publication categoryResearch
Peer-reviewedYes
EventThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing: The Internet of Signals - New Orleans, United States
Duration: 5 Mar 20179 Mar 2017
http://www.ieee-icassp2017.org/
http://www.ieee-icassp2017.org/

Conference

ConferenceThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing
LandUnited States
ByNew Orleans
Periode05/03/201709/03/2017
Internetadresse
SeriesI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN1520-6149

    Research areas

  • Pole-zero model, least 1-norm cost function , sparse deconvolution, speech analysis

Map

ID: 270664016