An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Mohammad A. Haque; Jong Myon Kim

doi:10.1007/s11042-012-1019-y

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Mohammad A. Haque, Jong Myon Kim^*

^*Kontaktforfatter

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

15 Citationer (Scopus)

Abstract

Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

Originalsprog	Engelsk
Tidsskrift	Multimedia Tools and Applications
Vol/bind	63
Udgave nummer	1
Sider (fra-til)	77-92
Antal sider	16
ISSN	1380-7501
DOI	https://doi.org/10.1007/s11042-012-1019-y
Status	Udgivet - 1 mar. 2013

Adgang til dokumentet

10.1007/s11042-012-1019-y

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=84874937245&partnerID=8YFLogxK

Citationsformater

@article{c9ffb601d448465ab12fa3087247b206,

title = "An analysis of content-based classification of audio signals using a fuzzy c-means algorithm",

abstract = "Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.",

keywords = "Audio segmentation and classification, Database retrieval, Fuzzy c-means algorithm, Multimedia",

author = "Haque, {Mohammad A.} and Kim, {Jong Myon}",

year = "2013",

month = mar,

day = "1",

doi = "10.1007/s11042-012-1019-y",

language = "English",

volume = "63",

pages = "77--92",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

AU - Haque, Mohammad A.

AU - Kim, Jong Myon

PY - 2013/3/1

Y1 - 2013/3/1

N2 - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

AB - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

KW - Audio segmentation and classification

KW - Database retrieval

KW - Fuzzy c-means algorithm

KW - Multimedia

UR - http://www.scopus.com/inward/record.url?scp=84874937245&partnerID=8YFLogxK

U2 - 10.1007/s11042-012-1019-y

DO - 10.1007/s11042-012-1019-y

M3 - Journal article

AN - SCOPUS:84874937245

SN - 1380-7501

VL - 63

SP - 77

EP - 92

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 1

ER -

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater