Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

7 Citations (Scopus)
334 Downloads (Pure)

Abstract

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Original languageEnglish
Title of host publicationIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017
Number of pages5
PublisherIEEE
Publication date19 Jun 2017
Pages731-735
ISBN (Electronic)978-1-5090-4117-6
DOIs
Publication statusPublished - 19 Jun 2017
EventThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing: The Internet of Signals - New Orleans, United States
Duration: 5 Mar 20179 Mar 2017
http://www.ieee-icassp2017.org/
http://www.ieee-icassp2017.org/

Conference

ConferenceThe 42nd IEEE International Conference on Acoustics, Speech and Signal Processing
Country/TerritoryUnited States
CityNew Orleans
Period05/03/201709/03/2017
Internet address
SeriesI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN1520-6149

Keywords

  • Pole-zero model
  • least 1-norm cost function
  • sparse deconvolution
  • speech analysis

Fingerprint

Dive into the research topics of 'Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis'. Together they form a unique fingerprint.

Cite this