On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

Bob L. Sturm; Pardis Noorzad

On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

Bob L. Sturm, Pardis Noorzad

Institut for Arkitektur og Medieteknologi

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

4142 Downloads (Pure)

Abstract

A recent system combining sparse representation classification (SRC)
and a perceptually-based acoustic feature (ATM)
\cite{Panagakis2009,Panagakis2009b,Panagakis2010c},
outperforms by a significant margin the state of the art in music genre recognition, e.g., \cite{Bergstra2006}.
With genre so difficult to define,
and seemingly based on factors more broad than acoustics,
this remarkable result motivates investigation into, among other things,
why it works and what it means for how humans organize music.
In this paper, we review the application of SRC and ATM to recognizing genre,
and attempt to reproduce the results of \cite{Panagakis2009}.
First, we find that classification results
are consistent for features extracted from different analyses.
Second, we find that SRC accuracy improves
when we pose the sparse representation problem
with inequality constraints.
Finally, we find that only when we reduce the number of classes by half
do we see the high accuracies reported in \cite{Panagakis2009}.

Originalsprog	Engelsk
Titel	Proceedings of the 9th International Symposium on Computer Music Modeling and Retrieval
Udgivelsessted	London
Publikationsdato	2012
Sider	379-394
Status	Udgivet - 2012
Begivenhed	Computer music modeling and retrieval - London, Storbritannien Varighed: 19 jun. 2012 → 22 jun. 2012

Konference

Konference	Computer music modeling and retrieval
Land/Område	Storbritannien
By	London
Periode	19/06/2012 → 22/06/2012

Adgang til dokumentet

GenreSturmNoorzad20120116Indsendt manuskript, 804 KB
CRM 2012 proceedingsForlagets udgivne version, 32,6 MB

http://cmmr2012.eecs.qmul.ac.uk/programme

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Greedy Sparse Approximation and the Automatic Description of Audio and Music Data
Sturm, B. L.
Technology and Production Independent Postdoc Center for Independent Research
01/01/2012 → …
Projekter: Projekt › Forskning

Citationsformater

@inproceedings{a7c2e0cc5c51463fb71f26501f6f16aa,

title = "On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations",

abstract = "A recent system combining sparse representation classification (SRC) and a perceptually-based acoustic feature (ATM) \cite{Panagakis2009,Panagakis2009b,Panagakis2010c},outperforms by a significant margin the state of the art in music genre recognition, e.g., \cite{Bergstra2006}.With genre so difficult to define, and seemingly based on factors more broad than acoustics,this remarkable result motivates investigation into, among other things, why it works and what it means for how humans organize music.In this paper, we review the application of SRC and ATM to recognizing genre, and attempt to reproduce the results of \cite{Panagakis2009}.First, we find that classification results are consistent for features extracted from different analyses.Second, we find that SRC accuracy improveswhen we pose the sparse representation problem with inequality constraints.Finally, we find that only when we reduce the number of classes by halfdo we see the high accuracies reported in \cite{Panagakis2009}.",

author = "Sturm, {Bob L.} and Pardis Noorzad",

year = "2012",

language = "English",

pages = "379--394",

booktitle = "Proceedings of the 9th International Symposium on Computer Music Modeling and Retrieval",

note = "Computer music modeling and retrieval ; Conference date: 19-06-2012 Through 22-06-2012",

}

Sturm, BL & Noorzad, P 2012, On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations. i Proceedings of the 9th International Symposium on Computer Music Modeling and Retrieval. London, s. 379-394, Computer music modeling and retrieval, London, Storbritannien, 19/06/2012. <http://cmmr2012.eecs.qmul.ac.uk/programme>

On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations. / Sturm, Bob L.; Noorzad, Pardis.
Proceedings of the 9th International Symposium on Computer Music Modeling and Retrieval. London, 2012. s. 379-394.

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

AU - Sturm, Bob L.

AU - Noorzad, Pardis

PY - 2012

Y1 - 2012

N2 - A recent system combining sparse representation classification (SRC) and a perceptually-based acoustic feature (ATM) \cite{Panagakis2009,Panagakis2009b,Panagakis2010c},outperforms by a significant margin the state of the art in music genre recognition, e.g., \cite{Bergstra2006}.With genre so difficult to define, and seemingly based on factors more broad than acoustics,this remarkable result motivates investigation into, among other things, why it works and what it means for how humans organize music.In this paper, we review the application of SRC and ATM to recognizing genre, and attempt to reproduce the results of \cite{Panagakis2009}.First, we find that classification results are consistent for features extracted from different analyses.Second, we find that SRC accuracy improveswhen we pose the sparse representation problem with inequality constraints.Finally, we find that only when we reduce the number of classes by halfdo we see the high accuracies reported in \cite{Panagakis2009}.

AB - A recent system combining sparse representation classification (SRC) and a perceptually-based acoustic feature (ATM) \cite{Panagakis2009,Panagakis2009b,Panagakis2010c},outperforms by a significant margin the state of the art in music genre recognition, e.g., \cite{Bergstra2006}.With genre so difficult to define, and seemingly based on factors more broad than acoustics,this remarkable result motivates investigation into, among other things, why it works and what it means for how humans organize music.In this paper, we review the application of SRC and ATM to recognizing genre, and attempt to reproduce the results of \cite{Panagakis2009}.First, we find that classification results are consistent for features extracted from different analyses.Second, we find that SRC accuracy improveswhen we pose the sparse representation problem with inequality constraints.Finally, we find that only when we reduce the number of classes by halfdo we see the high accuracies reported in \cite{Panagakis2009}.

M3 - Article in proceeding

SP - 379

EP - 394

BT - Proceedings of the 9th International Symposium on Computer Music Modeling and Retrieval

CY - London

T2 - Computer music modeling and retrieval

Y2 - 19 June 2012 through 22 June 2012

ER -

On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

Abstract

Konference

Adgang til dokumentet

AUB Link

Fingeraftryk

Projekter

Greedy Sparse Approximation and the Automatic Description of Audio and Music Data

Citationsformater