Primitive audio genre classification: An investigation of feature vector optimization

Mohammad A. Haque; Sangjin Cho; Jongmyon Kim

Primitive audio genre classification: An investigation of feature vector optimization

Mohammad A. Haque, Sangjin Cho, Jongmyon Kim^*

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

3 Citations (Scopus)

Abstract

In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

Original language	English
Journal	Information
Volume	15
Issue number	5
Pages (from-to)	1875-1887
Number of pages	13
ISSN	1343-4500
Publication status	Published - 1 May 2012

Keywords

Audio classification
Fuzzy c-means algorithm
K-nearest neighbor
Multimedia retrieval

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{4011ead2e47840539f4f7cc600f2a44d,

title = "Primitive audio genre classification: An investigation of feature vector optimization",

abstract = "In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.",

keywords = "Audio classification, Fuzzy c-means algorithm, K-nearest neighbor, Multimedia retrieval",

author = "Haque, {Mohammad A.} and Sangjin Cho and Jongmyon Kim",

year = "2012",

month = may,

day = "1",

language = "English",

volume = "15",

pages = "1875--1887",

journal = "Information",

issn = "1343-4500",

publisher = "International Information Institute",

number = "5",

}

TY - JOUR

T1 - Primitive audio genre classification

T2 - An investigation of feature vector optimization

AU - Haque, Mohammad A.

AU - Cho, Sangjin

AU - Kim, Jongmyon

PY - 2012/5/1

Y1 - 2012/5/1

N2 - In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

AB - In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

KW - Audio classification

KW - Fuzzy c-means algorithm

KW - K-nearest neighbor

KW - Multimedia retrieval

UR - http://www.scopus.com/inward/record.url?scp=84863206231&partnerID=8YFLogxK

M3 - Journal article

AN - SCOPUS:84863206231

SN - 1343-4500

VL - 15

SP - 1875

EP - 1887

JO - Information

JF - Information

IS - 5

ER -

Primitive audio genre classification: An investigation of feature vector optimization

Abstract

Keywords

AUB Link

Other files and links

Fingerprint

Cite this