Variable Frame Rate and Length Analysis for Data Compression in Distributed Speech Recognition

Ivan Kraljevski, Zheng-Hua Tan

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Abstract

This paper addresses the issue of data compression in distributed speech recognition on the basis of a variable frame rate and length analysis method. The method first conducts frame selection by using a posteriori signal-to-noise ratio weighted energy distance to find the right time resolution at the signal level, and then increases the length of the selected frame according to the number of non-selected preceding frames to find the right time-frequency resolution at the frame level. It produces high frame rate and small frame length in rapidly changing regions and low frame rate and large frame length for steady regions. The method is applied to scalable source coding in distributed speech recognition where the target bitrate is met by adjusting the frame rate. Speech recognition results show that the proposed approach outperforms other compression methods in terms of recognition accuracy for noisy speech while achieving higher compression rates.
OriginalsprogEngelsk
TitelNetwork Infrastructure and Digital Content (IC-NIDC), 2014 4th IEEE International Conference on
Antal sider5
ForlagIEEE Press
Publikationsdatosep. 2014
Sider453-457
ISBN (Trykt)978-1-4799-4736-2
ISBN (Elektronisk)978‐1‐4799‐5624‐1, 978-1-4799-4734-8
DOI
StatusUdgivet - sep. 2014
BegivenhedThe 4th IEEE International Conference on Network Infrastructure and Digital Content - Beijing, Kina
Varighed: 19 sep. 201421 sep. 2014

Konference

KonferenceThe 4th IEEE International Conference on Network Infrastructure and Digital Content
Land/OmrådeKina
ByBeijing
Periode19/09/201421/09/2014
NavnIEEE International Conference Network Infrastructure and Digital Content proceedings

Fingeraftryk

Dyk ned i forskningsemnerne om 'Variable Frame Rate and Length Analysis for Data Compression in Distributed Speech Recognition'. Sammen danner de et unikt fingeraftryk.

Citationsformater