TY - JOUR
T1 - An analysis of content-based classification of audio signals using a fuzzy c-means algorithm
AU - Haque, Mohammad A.
AU - Kim, Jong Myon
PY - 2013/3/1
Y1 - 2013/3/1
N2 - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.
AB - Content-based audio signal classification into broad categories such as speech, music, or speech with noise is the first step before any further processing such as speech recognition, content-based indexing, or surveillance systems. In this paper, we propose an efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm. We analyze different characteristic features of audio signals in time, frequency, and coefficient domains and select the optimal feature vector by employing a noble analytical scoring method to each feature. We utilize an FCM-based classification scheme and apply it on the extracted normalized optimal feature vector to achieve an efficient classification result. Experimental results demonstrate that the proposed approach outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.
KW - Audio segmentation and classification
KW - Database retrieval
KW - Fuzzy c-means algorithm
KW - Multimedia
UR - http://www.scopus.com/inward/record.url?scp=84874937245&partnerID=8YFLogxK
U2 - 10.1007/s11042-012-1019-y
DO - 10.1007/s11042-012-1019-y
M3 - Journal article
AN - SCOPUS:84874937245
SN - 1380-7501
VL - 63
SP - 77
EP - 92
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 1
ER -