Abstract
In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Original language | English |
---|---|
Title of host publication | IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 19 Jun 2017 |
Pages | 731-735 |
ISBN (Electronic) | 978-1-5090-4117-6 |
DOIs | |
Publication status | Published - 19 Jun 2017 |
Event | The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing: The Internet of Signals - New Orleans, United States Duration: 5 Mar 2017 → 9 Mar 2017 http://www.ieee-icassp2017.org/ http://www.ieee-icassp2017.org/ |
Conference
Conference | The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing |
---|---|
Country/Territory | United States |
City | New Orleans |
Period | 05/03/2017 → 09/03/2017 |
Internet address |
Series | I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings |
---|---|
ISSN | 1520-6149 |
Keywords
- Pole-zero model
- least 1-norm cost function
- sparse deconvolution
- speech analysis