An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Mohammad A. Haque; Jong Myon Kim

doi:10.1007/s11042-012-1019-y

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Mohammad A. Haque, Jong Myon Kim^*

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

15 Citations (Scopus)

Abstract

Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

Original language	English
Journal	Multimedia Tools and Applications
Volume	63
Issue number	1
Pages (from-to)	77-92
Number of pages	16
ISSN	1380-7501
DOIs	https://doi.org/10.1007/s11042-012-1019-y
Publication status	Published - 1 Mar 2013

Keywords

Audio segmentation and classification
Database retrieval
Fuzzy c-means algorithm
Multimedia

Access to Document

10.1007/s11042-012-1019-y

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{c9ffb601d448465ab12fa3087247b206,

title = "An analysis of content-based classification of audio signals using a fuzzy c-means algorithm",

abstract = "Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.",

keywords = "Audio segmentation and classification, Database retrieval, Fuzzy c-means algorithm, Multimedia",

author = "Haque, {Mohammad A.} and Kim, {Jong Myon}",

year = "2013",

month = mar,

day = "1",

doi = "10.1007/s11042-012-1019-y",

language = "English",

volume = "63",

pages = "77--92",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

AU - Haque, Mohammad A.

AU - Kim, Jong Myon

PY - 2013/3/1

Y1 - 2013/3/1

N2 - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

AB - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.

KW - Audio segmentation and classification

KW - Database retrieval

KW - Fuzzy c-means algorithm

KW - Multimedia

UR - http://www.scopus.com/inward/record.url?scp=84874937245&partnerID=8YFLogxK

U2 - 10.1007/s11042-012-1019-y

DO - 10.1007/s11042-012-1019-y

M3 - Journal article

AN - SCOPUS:84874937245

SN - 1380-7501

VL - 63

SP - 77

EP - 92

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 1

ER -

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Abstract

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this