Convolution-based classification of audio and symbolic representations of music

Gissel Velarde; Carlos Cancino Chacón; David Meredith; Tillman Weyde; Maarten Grachten

doi:10.1080/09298215.2018.1458885

Convolution-based classification of audio and symbolic representations of music

Gissel Velarde, Carlos Cancino Chacón, David Meredith, Tillman Weyde, Maarten Grachten

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

10 Citationer (Scopus)

121 Downloads (Pure)

Abstract

We present a novel convolution-based method for classification of audio and symbolic representations of music, which we apply to classification of music by style. Pieces of music are first sampled to pitch–time representations (piano-rolls or spectrograms) and then convolved with a Gaussian filter, before being classified by a support vector machine or by k-nearest neighbours in an ensemble of classifiers. On the well-studied task of discriminating between string quartet movements by Haydn and Mozart, we obtain accuracies that equal the state of the art on two data-sets. However, in multi-class composer identification, methods specialised for classifying symbolic representations of music are more effective. We also performed experiments on symbolic representations, synthetic audio and two different recordings of The Well-Tempered Clavier by J. S. Bach to study the method’s capacity to distinguish preludes from fugues. Our experimental results show that our approach performs similarly on symbolic representations, synthetic audio and audio recordings, setting our method apart from most previous studies that have been designed for use with either audio or symbolic data, but not both.

Originalsprog	Engelsk
Tidsskrift	Journal of New Music Research
Vol/bind	47
Udgave nummer	3
Sider (fra-til)	191-205
Antal sider	15
ISSN	0929-8215
DOI	https://doi.org/10.1080/09298215.2018.1458885
Status	Udgivet - 27 maj 2018

Adgang til dokumentet

10.1080/09298215.2018.1458885

Velarde-etal-manuscript2018Accepteret manuskript, 1,66 MB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85046440088&partnerID=8YFLogxK

Lrn2Cre8: Learning to Create
Meredith, D. & Bemman, B.
EU Seventh Framework Programme (FP7)
01/10/2013 → 30/09/2016
Projekter: Projekt › Forskning

Citationsformater

@article{91cdb7d1f3bb4e00831e59121e88b03b,

title = "Convolution-based classification of audio and symbolic representations of music",

abstract = "We present a novel convolution-based method for classification of audio and symbolic representations of music, which we apply to classification of music by style. Pieces of music are first sampled to pitch–time representations (piano-rolls or spectrograms) and then convolved with a Gaussian filter, before being classified by a support vector machine or by k-nearest neighbours in an ensemble of classifiers. On the well-studied task of discriminating between string quartet movements by Haydn and Mozart, we obtain accuracies that equal the state of the art on two data-sets. However, in multi-class composer identification, methods specialised for classifying symbolic representations of music are more effective. We also performed experiments on symbolic representations, synthetic audio and two different recordings of The Well-Tempered Clavier by J. S. Bach to study the method{\textquoteright}s capacity to distinguish preludes from fugues. Our experimental results show that our approach performs similarly on symbolic representations, synthetic audio and audio recordings, setting our method apart from most previous studies that have been designed for use with either audio or symbolic data, but not both.",

keywords = "Music analysis, machine learning, convolution, composer recognition, genre recognition, composer classification, symbolic music classification, Classification algorithms, filtering, audio music classification, genre classification",

author = "Gissel Velarde and {Cancino Chac{\'o}n}, Carlos and David Meredith and Tillman Weyde and Maarten Grachten",

year = "2018",

month = may,

day = "27",

doi = "10.1080/09298215.2018.1458885",

language = "English",

volume = "47",

pages = "191--205",

journal = "Journal of New Music Research",

issn = "0929-8215",

publisher = "Routledge",

number = "3",

}

TY - JOUR

T1 - Convolution-based classification of audio and symbolic representations of music

AU - Velarde, Gissel

AU - Cancino Chacón, Carlos

AU - Meredith, David

AU - Weyde, Tillman

AU - Grachten, Maarten

PY - 2018/5/27

Y1 - 2018/5/27

N2 - We present a novel convolution-based method for classification of audio and symbolic representations of music, which we apply to classification of music by style. Pieces of music are first sampled to pitch–time representations (piano-rolls or spectrograms) and then convolved with a Gaussian filter, before being classified by a support vector machine or by k-nearest neighbours in an ensemble of classifiers. On the well-studied task of discriminating between string quartet movements by Haydn and Mozart, we obtain accuracies that equal the state of the art on two data-sets. However, in multi-class composer identification, methods specialised for classifying symbolic representations of music are more effective. We also performed experiments on symbolic representations, synthetic audio and two different recordings of The Well-Tempered Clavier by J. S. Bach to study the method’s capacity to distinguish preludes from fugues. Our experimental results show that our approach performs similarly on symbolic representations, synthetic audio and audio recordings, setting our method apart from most previous studies that have been designed for use with either audio or symbolic data, but not both.

AB - We present a novel convolution-based method for classification of audio and symbolic representations of music, which we apply to classification of music by style. Pieces of music are first sampled to pitch–time representations (piano-rolls or spectrograms) and then convolved with a Gaussian filter, before being classified by a support vector machine or by k-nearest neighbours in an ensemble of classifiers. On the well-studied task of discriminating between string quartet movements by Haydn and Mozart, we obtain accuracies that equal the state of the art on two data-sets. However, in multi-class composer identification, methods specialised for classifying symbolic representations of music are more effective. We also performed experiments on symbolic representations, synthetic audio and two different recordings of The Well-Tempered Clavier by J. S. Bach to study the method’s capacity to distinguish preludes from fugues. Our experimental results show that our approach performs similarly on symbolic representations, synthetic audio and audio recordings, setting our method apart from most previous studies that have been designed for use with either audio or symbolic data, but not both.

KW - Music analysis

KW - machine learning

KW - convolution

KW - composer recognition

KW - genre recognition

KW - composer classification

KW - symbolic music classification

KW - Classification algorithms

KW - filtering

KW - audio music classification

KW - genre classification

UR - http://www.scopus.com/inward/record.url?scp=85046440088&partnerID=8YFLogxK

U2 - 10.1080/09298215.2018.1458885

DO - 10.1080/09298215.2018.1458885

M3 - Journal article

SN - 0929-8215

VL - 47

SP - 191

EP - 205

JO - Journal of New Music Research

JF - Journal of New Music Research

IS - 3

ER -

Convolution-based classification of audio and symbolic representations of music

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Projekter

Lrn2Cre8: Learning to Create

Citationsformater