Cluster-Based Adaptation Using Density Forest for HMM Phone Recognition

Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

205 Downloads (Pure)

Abstract

The dissimilarity between the training and test data in speech recognition systems is known to have a considerable effect on the recognition accuracy. To solve this problem, we use density forest to cluster the data and use maximum a posteriori (MAP) method to build a cluster-based adapted Gaussian mixture models (GMMs) in HMM speech recognition. Specifically, a set of bagged versions of the training data for each state in the HMM is generated, and each of these versions is used to generate one GMM and one tree in the density forest. Thereafter, an acoustic model forest is built by replacing the data of each leaf (cluster) in each tree with the corresponding GMM adapted by the leaf data using the MAP method. The results show that the proposed approach achieves 3:8% (absolute) lower phone error rate compared with the standard HMM/GMM and 0:8% (absolute) lower PER compared with bagged HMM/GMM.
Original languageEnglish
Title of host publicationSignal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
PublisherIEEE Press
Publication date2014
Pages2065-2069
ISBN (Print)978-0-9928626-1-9
Publication statusPublished - 2014
Event22nd European Signal Processing Conference - Lisbon, Portugal
Duration: 1 Sept 20145 Sept 2014

Conference

Conference22nd European Signal Processing Conference
Country/TerritoryPortugal
CityLisbon
Period01/09/201405/09/2014
SeriesProceedings of the European Signal Processing Conference
ISSN2076-1465

Fingerprint

Dive into the research topics of 'Cluster-Based Adaptation Using Density Forest for HMM Phone Recognition'. Together they form a unique fingerprint.

Cite this