Using general-purpose compression algorithms for music analysis

Corentin Louboutin; David Meredith

doi:10.1080/09298215.2015.1133656

Using general-purpose compression algorithms for music analysis

Corentin Louboutin, David Meredith

Institut for Arkitektur og Medieteknologi

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

24 Citationer (Scopus)

660 Downloads (Pure)

Abstract

General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings’ occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows–Wheeler and COSIATEC on classifying folk song melodies. A novel method was used, combining multiple viewpoints, the k-nearest-neighbour algorithm and a novel distance metric, corpus compression distance. Using single viewpoints, COSIATEC outperformed the general-purpose compressors, with a classification success rate of 85% on this task. However, by combining 8 of the 10 best-performing viewpoints, including seven that used LZ77, the classification success rate rose to over 94%. In a second experiment, we compared LZ77 with COSIATEC on the task of discovering subject and countersubject entries in fugues by J. S. Bach. When voice information was absent in the input data, COSIATEC outperformed LZ77 with a mean F1 score of 0.123, compared with 0.053 for LZ77. However, when the music was processed a voice at a time, the F1 score for LZ77 more than doubled to 0.124. We also discovered a significant correlation between compression factor and F1 score for all the algorithms, supporting the hypothesis that the best analyses are those represented by the shortest descriptions.

Originalsprog	Engelsk
Tidsskrift	Journal of New Music Research
Vol/bind	45
Udgave nummer	1
Sider (fra-til)	1-16
ISSN	0929-8215
DOI	https://doi.org/10.1080/09298215.2015.1133656
Status	Udgivet - 2016

Adgang til dokumentet

10.1080/09298215.2015.1133656

Author pre-printAccepteret manuskript, 612 KB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Lrn2Cre8: Learning to Create
Meredith, D. & Bemman, B.
EU Seventh Framework Programme (FP7)
01/10/2013 → 30/09/2016
Projekter: Projekt › Forskning

Citationsformater

@article{0f2db37e238149f6b79b5073cb6f278f,

title = "Using general-purpose compression algorithms for music analysis",

abstract = "General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings{\textquoteright} occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows–Wheeler and COSIATEC on classifying folk song melodies. A novel method was used, combining multiple viewpoints, the k-nearest-neighbour algorithm and a novel distance metric, corpus compression distance. Using single viewpoints, COSIATEC outperformed the general-purpose compressors, with a classification success rate of 85% on this task. However, by combining 8 of the 10 best-performing viewpoints, including seven that used LZ77, the classification success rate rose to over 94%. In a second experiment, we compared LZ77 with COSIATEC on the task of discovering subject and countersubject entries in fugues by J. S. Bach. When voice information was absent in the input data, COSIATEC outperformed LZ77 with a mean F1 score of 0.123, compared with 0.053 for LZ77. However, when the music was processed a voice at a time, the F1 score for LZ77 more than doubled to 0.124. We also discovered a significant correlation between compression factor and F1 score for all the algorithms, supporting the hypothesis that the best analyses are those represented by the shortest descriptions.",

keywords = "music analysis, data compression, lempel-ziv, COSIATEC, classification, machine learning, pattern discovery, Kolmogorov complexity, minimum description length",

author = "Corentin Louboutin and David Meredith",

year = "2016",

doi = "10.1080/09298215.2015.1133656",

language = "English",

volume = "45",

pages = "1--16",

journal = "Journal of New Music Research",

issn = "0929-8215",

publisher = "Routledge",

number = "1",

}

TY - JOUR

T1 - Using general-purpose compression algorithms for music analysis

AU - Louboutin, Corentin

AU - Meredith, David

PY - 2016

Y1 - 2016

N2 - General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings’ occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows–Wheeler and COSIATEC on classifying folk song melodies. A novel method was used, combining multiple viewpoints, the k-nearest-neighbour algorithm and a novel distance metric, corpus compression distance. Using single viewpoints, COSIATEC outperformed the general-purpose compressors, with a classification success rate of 85% on this task. However, by combining 8 of the 10 best-performing viewpoints, including seven that used LZ77, the classification success rate rose to over 94%. In a second experiment, we compared LZ77 with COSIATEC on the task of discovering subject and countersubject entries in fugues by J. S. Bach. When voice information was absent in the input data, COSIATEC outperformed LZ77 with a mean F1 score of 0.123, compared with 0.053 for LZ77. However, when the music was processed a voice at a time, the F1 score for LZ77 more than doubled to 0.124. We also discovered a significant correlation between compression factor and F1 score for all the algorithms, supporting the hypothesis that the best analyses are those represented by the shortest descriptions.

AB - General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings’ occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows–Wheeler and COSIATEC on classifying folk song melodies. A novel method was used, combining multiple viewpoints, the k-nearest-neighbour algorithm and a novel distance metric, corpus compression distance. Using single viewpoints, COSIATEC outperformed the general-purpose compressors, with a classification success rate of 85% on this task. However, by combining 8 of the 10 best-performing viewpoints, including seven that used LZ77, the classification success rate rose to over 94%. In a second experiment, we compared LZ77 with COSIATEC on the task of discovering subject and countersubject entries in fugues by J. S. Bach. When voice information was absent in the input data, COSIATEC outperformed LZ77 with a mean F1 score of 0.123, compared with 0.053 for LZ77. However, when the music was processed a voice at a time, the F1 score for LZ77 more than doubled to 0.124. We also discovered a significant correlation between compression factor and F1 score for all the algorithms, supporting the hypothesis that the best analyses are those represented by the shortest descriptions.

KW - music analysis

KW - data compression

KW - lempel-ziv

KW - COSIATEC

KW - classification

KW - machine learning

KW - pattern discovery

KW - Kolmogorov complexity

KW - minimum description length

U2 - 10.1080/09298215.2015.1133656

DO - 10.1080/09298215.2015.1133656

M3 - Journal article

SN - 0929-8215

VL - 45

SP - 1

EP - 16

JO - Journal of New Music Research

JF - Journal of New Music Research

IS - 1

ER -

Using general-purpose compression algorithms for music analysis

Abstract

Adgang til dokumentet

AUB Link

Fingeraftryk

Projekter

Lrn2Cre8: Learning to Create

Citationsformater