Abstract
In this paper, we present a speech analysis method based on sparse pole-zero modeling of speech. Instead of using the all-pole model to approximate the speech production filter, a pole-zero model is used for the combined effect of the vocal tract; radiation at the lips and the glottal pulse shape. Moreover, to consider the spiky excitation form of the pulse train during voiced speech, the modeling parame- ters and sparse residuals are estimated in an iterative fashion using a least 1-norm pole-zero with sparse deconvolution algorithm. Com- pared with the conventional two-stage least squares pole-zero, linear prediction and sparse linear prediction methods, experimental results show that the proposed speech analysis method has lower spectral distortion, higher reconstruction SNR and sparser residuals.
Originalsprog | Engelsk |
---|---|
Titel | IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 |
Antal sider | 5 |
Forlag | IEEE |
Publikationsdato | 19 jun. 2017 |
Sider | 731-735 |
ISBN (Elektronisk) | 978-1-5090-4117-6 |
DOI | |
Status | Udgivet - 19 jun. 2017 |
Begivenhed | The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing: The Internet of Signals - New Orleans, USA Varighed: 5 mar. 2017 → 9 mar. 2017 http://www.ieee-icassp2017.org/ http://www.ieee-icassp2017.org/ |
Konference
Konference | The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing |
---|---|
Land/Område | USA |
By | New Orleans |
Periode | 05/03/2017 → 09/03/2017 |
Internetadresse |
Navn | I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings |
---|---|
ISSN | 1520-6149 |