Text-Independent Speaker Identification Using the Histogram Transform Model

Zhanyu Ma, Hong Yu, Zheng-Hua Tan, Jun Guo

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

24 Citationer (Scopus)
322 Downloads (Pure)

Abstract

In this paper, we propose a novel probabilistic method for the task of text-independent speaker identification (SI). In order to capture the dynamic information during SI, we design a super-MFCCs features by cascading three neighboring Mel-frequency Cepstral coefficients (MFCCs) frames together. These super-MFCC vectors are utilized for probabilistic model training such that the speaker’s characteristics can be sufficiently captured. The probability density function (PDF) of the aforementioned super-MFCCs features is estimated by the recently proposed histogram transform (HT) method. To recedes the commonly occurred discontinuity problem in multivariate histograms computing, more training data are generated by the HT method. Using these generated data, a smooth PDF of the super-MFCCs vectors is obtained. Comparing with the typical PDF estimation methods, such as Gaussian mixture model, promising improvements have been obatined by employing the HT-based model in SI.
OriginalsprogEngelsk
Artikelnummer7803586
TidsskriftIEEE Access
Vol/bind4
Sider (fra-til)9733-9739
Antal sider6
ISSN2169-3536
DOI
StatusUdgivet - 2016

Fingeraftryk

Dyk ned i forskningsemnerne om 'Text-Independent Speaker Identification Using the Histogram Transform Model'. Sammen danner de et unikt fingeraftryk.

Citationsformater