Primitive audio genre classification: An investigation of feature vector optimization

Mohammad A. Haque, Sangjin Cho, Jongmyon Kim*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

3 Citations (Scopus)

Abstract

In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

Original languageEnglish
JournalInformation
Volume15
Issue number5
Pages (from-to)1875-1887
Number of pages13
ISSN1343-4500
Publication statusPublished - 1 May 2012

Keywords

  • Audio classification
  • Fuzzy c-means algorithm
  • K-nearest neighbor
  • Multimedia retrieval

Fingerprint

Dive into the research topics of 'Primitive audio genre classification: An investigation of feature vector optimization'. Together they form a unique fingerprint.

Cite this