Primitive audio genre classification: An investigation of feature vector optimization

Mohammad A. Haque; Sangjin Cho; Jongmyon Kim

Primitive audio genre classification: An investigation of feature vector optimization

Mohammad A. Haque, Sangjin Cho, Jongmyon Kim^*

^*Kontaktforfatter

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

3 Citationer (Scopus)

Abstract

In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

Originalsprog	Engelsk
Tidsskrift	Information
Vol/bind	15
Udgave nummer	5
Sider (fra-til)	1875-1887
Antal sider	13
ISSN	1343-4500
Status	Udgivet - 1 maj 2012

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=84863206231&partnerID=8YFLogxK

Citationsformater

@article{4011ead2e47840539f4f7cc600f2a44d,

title = "Primitive audio genre classification: An investigation of feature vector optimization",

abstract = "In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.",

keywords = "Audio classification, Fuzzy c-means algorithm, K-nearest neighbor, Multimedia retrieval",

author = "Haque, {Mohammad A.} and Sangjin Cho and Jongmyon Kim",

year = "2012",

month = may,

day = "1",

language = "English",

volume = "15",

pages = "1875--1887",

journal = "Information",

issn = "1343-4500",

publisher = "International Information Institute",

number = "5",

}

TY - JOUR

T1 - Primitive audio genre classification

T2 - An investigation of feature vector optimization

AU - Haque, Mohammad A.

AU - Cho, Sangjin

AU - Kim, Jongmyon

PY - 2012/5/1

Y1 - 2012/5/1

N2 - In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

AB - In this paper, we propose a content-based audio classification approach to classify audio signals into primitive genres such as speech, music, speech with music, and speech with noise. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select me optimal feature vector by employing an analytical method to each feature. In order to ensure automatic classification by a fuzzy c-means (FCM) algorithm, we utilize a hybrid classification framework by combining FCM with k-nearest neighbor (KNN) algorithm and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results on a robust dataset demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification approaches by more than 10% of accuracy improvement in audio genre classification.

KW - Audio classification

KW - Fuzzy c-means algorithm

KW - K-nearest neighbor

KW - Multimedia retrieval

UR - http://www.scopus.com/inward/record.url?scp=84863206231&partnerID=8YFLogxK

M3 - Journal article

AN - SCOPUS:84863206231

SN - 1343-4500

VL - 15

SP - 1875

EP - 1887

JO - Information

JF - Information

IS - 5

ER -

Primitive audio genre classification: An investigation of feature vector optimization

Abstract

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater