Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Liming Shi; Jesper Rindom Jensen; Mads Græsbøll Christensen

doi:10.1109/ICASSP.2017.7952252

Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Liming Shi, Jesper Rindom Jensen, Mads Græsbøll Christensen

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

7 Citations (Scopus)

334 Downloads (Pure)

Abstract

In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.

Original language	English
Title of host publication	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017
Number of pages	5
Publisher	IEEE
Publication date	19 Jun 2017
Pages	731-735
ISBN (Electronic)	978-1-5090-4117-6
DOIs	https://doi.org/10.1109/ICASSP.2017.7952252
Publication status	Published - 19 Jun 2017
Event	The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing: The Internet of Signals - New Orleans, United States Duration: 5 Mar 2017 → 9 Mar 2017 http://www.ieee-icassp2017.org/ http://www.ieee-icassp2017.org/

Conference

Conference	The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing
Country/Territory	United States
City	New Orleans
Period	05/03/2017 → 09/03/2017
Internet address	http://www.ieee-icassp2017.org/ http://www.ieee-icassp2017.org/

Series	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN	1520-6149

Keywords

Pole-zero model
least 1-norm cost function
sparse deconvolution
speech analysis

Access to Document

10.1109/ICASSP.2017.7952252

pdf_fileFinal published version, 279 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{538fb7a44f354528a0964715be16242e,

title = "Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis",

abstract = "In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.",

keywords = "Pole-zero model, least 1-norm cost function , sparse deconvolution, speech analysis",

author = "Liming Shi and Jensen, {Jesper Rindom} and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2017",

month = jun,

day = "19",

doi = "10.1109/ICASSP.2017.7952252",

language = "English",

series = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

publisher = "IEEE",

pages = "731--735",

booktitle = "IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017",

address = "United States",

note = "The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing : The Internet of Signals, ICASSP2017 ; Conference date: 05-03-2017 Through 09-03-2017",

url = "http://www.ieee-icassp2017.org/, http://www.ieee-icassp2017.org/",

}

Shi, L, Jensen, JR & Christensen, MG 2017, Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis. in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. IEEE, I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings, pp. 731-735, The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing, New Orleans, United States, 05/03/2017. https://doi.org/10.1109/ICASSP.2017.7952252

Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis. / Shi, Liming; Jensen, Jesper Rindom ; Christensen, Mads Græsbøll.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. IEEE, 2017. p. 731-735 (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

AU - Shi, Liming

AU - Jensen, Jesper Rindom

AU - Christensen, Mads Græsbøll

PY - 2017/6/19

Y1 - 2017/6/19

N2 - In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.

AB - In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.

KW - Pole-zero model

KW - least 1-norm cost function

KW - sparse deconvolution

KW - speech analysis

U2 - 10.1109/ICASSP.2017.7952252

DO - 10.1109/ICASSP.2017.7952252

M3 - Article in proceeding

T3 - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

SP - 731

EP - 735

BT - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017

PB - IEEE

T2 - The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing

Y2 - 5 March 2017 through 9 March 2017

ER -

Least 1-Norm Pole-Zero Modeling with Sparse Deconvolution for Speech Analysis

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Cite this