A compression-based model of musical learning

David Meredith

A compression-based model of musical learning

Research output: Contribution to conference without publisher/journal › Paper without publisher/journal › Research › peer-review

112 Downloads (Pure)

Abstract

A long tradition of psychological research considers perception to be governed by the simplicity principle [1,2]. Both music perception and music analysis involve searching for the simplest explanations for musical objects, where a musical object could be any quantity of music from a motive to a whole corpus. Musical objects are usually interpreted within the context of larger objects that contain them. For example, a work might be interpreted in the context of a corpus containing all the works by the same composer. Drawing on Kolmogorov complexity theory [3], I propose that an interpretation or reading of a musical object can be modeled as a program that computes an in extenso representation of the object. On this view, when interpreting a musical work for the first time, both the analyst and the listener’s brain are seen to be attempting to find the shortest programs that compute corpora containing the new work. In the case of the listener, the new work is interpreted largely non-consciously and in real-time in the context of a corpus which is a subset of all the music that the listener has previously heard. The analyst, on the other hand, is free to deliberately select a corpus that allows for the most satisfying (usually the most economical) interpretation of the new work. This is modelled as the modification of a pre-existing program, P, that computes some corpus (i.e., a compact encoding of the corpus), so that it can additionally compute the object to be interpreted. In other words, it is proposed that both the analyst and the listener’s brain try to find the shortest description of the new object that reuses as much of P as possible. The perceived structure of the object is then represented by the way in which the modified program, P', computes the object. On this view, both music perception and music analysis reduce to the compression of musical corpora. However, because a greedy encoding algorithm is used that resists changes to existing encodings of previously experienced music, the compression is not optimal. This model therefore predicts that each individual will have a unique interpretation of any given piece of music that depends not only on what music the listener has previously heard but also the order in which this previously heard music was presented. The model therefore suggests a pure compression-based explanation for musical memory, musical learning and individual differences in musical perception. The feasibility of this view is demonstrated in a computational model which is applied to the first book of J. S. Bach’s Das Wohltemperirte Clavier. This model pre-processes the data using the author’s PS13s1 pitch spelling algorithm [4,5], then applies a modified version of the author’s COSIATEC algorithm [6] to derive compact encodings of works that maximise reuse of previous encodings. The resulting analyses will be presented and discussed.

References
[1] Pomerantz, J. R. and Kubovy, M. (1986). Theoretical approaches to perceptual organization: Simplicity and likelihood principles. In: K. R. Boff, L. Kaufman, & J. P. Thomas (Eds.), Handbook of perception and human performance: Volume II. (pp. 36:1 -45). New York: Wiley.
[2] Koffka, K. (1963). Principles of Gestalt Psychology. New York: Harcourt, Brace & World. (Original work published 1935).
[3] Li, M. and Vitányi, P. (2008). An Introduction to Kolmogorov Complexity and Its Applications. (3rd Edition). Berlin: Springer.
[4] Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2):121-159.
[5] Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. DPhil dissertation, University of Oxford.
[6] Meredith, D. (2006). Point-set algorithms for pattern discovery and pattern matching in music. Dagstuhl Seminar on Content-Based Retrieval (No. 06171) (23-28 April, 2006).

Original language	English
Publication date	18 Dec 2012
Publication status	Published - 18 Dec 2012
Event	DMRN+7: Digital Music Research Network One-day Workshop 2012 - Queen Mary University of London, London, United Kingdom Duration: 18 Dec 2012 → 18 Dec 2012

Workshop

Workshop	DMRN+7: Digital Music Research Network One-day Workshop 2012
Location	Queen Mary University of London
Country/Territory	United Kingdom
City	London
Period	18/12/2012 → 18/12/2012

Keywords

computational musicology
machine learning
pattern discovery
learning algorithms
music analysis

Access to Document

Meredith DMRN+7 SlidesAccepted author manuscript, 13.6 MB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@conference{6ada1f3884214da3b02d422a2a309c4d,

title = "A compression-based model of musical learning",

abstract = "A long tradition of psychological research considers perception to be governed by the simplicity principle [1,2]. Both music perception and music analysis involve searching for the simplest explanations for musical objects, where a musical object could be any quantity of music from a motive to a whole corpus. Musical objects are usually interpreted within the context of larger objects that contain them. For example, a work might be interpreted in the context of a corpus containing all the works by the same composer. Drawing on Kolmogorov complexity theory [3], I propose that an interpretation or reading of a musical object can be modeled as a program that computes an in extenso representation of the object. On this view, when interpreting a musical work for the first time, both the analyst and the listener{\textquoteright}s brain are seen to be attempting to find the shortest programs that compute corpora containing the new work. In the case of the listener, the new work is interpreted largely non-consciously and in real-time in the context of a corpus which is a subset of all the music that the listener has previously heard. The analyst, on the other hand, is free to deliberately select a corpus that allows for the most satisfying (usually the most economical) interpretation of the new work. This is modelled as the modification of a pre-existing program, P, that computes some corpus (i.e., a compact encoding of the corpus), so that it can additionally compute the object to be interpreted. In other words, it is proposed that both the analyst and the listener{\textquoteright}s brain try to find the shortest description of the new object that reuses as much of P as possible. The perceived structure of the object is then represented by the way in which the modified program, P', computes the object. On this view, both music perception and music analysis reduce to the compression of musical corpora. However, because a greedy encoding algorithm is used that resists changes to existing encodings of previously experienced music, the compression is not optimal. This model therefore predicts that each individual will have a unique interpretation of any given piece of music that depends not only on what music the listener has previously heard but also the order in which this previously heard music was presented. The model therefore suggests a pure compression-based explanation for musical memory, musical learning and individual differences in musical perception. The feasibility of this view is demonstrated in a computational model which is applied to the first book of J. S. Bach{\textquoteright}s Das Wohltemperirte Clavier. This model pre-processes the data using the author{\textquoteright}s PS13s1 pitch spelling algorithm [4,5], then applies a modified version of the author{\textquoteright}s COSIATEC algorithm [6] to derive compact encodings of works that maximise reuse of previous encodings. The resulting analyses will be presented and discussed.References[1] Pomerantz, J. R. and Kubovy, M. (1986). Theoretical approaches to perceptual organization: Simplicity and likelihood principles. In: K. R. Boff, L. Kaufman, & J. P. Thomas (Eds.), Handbook of perception and human performance: Volume II. (pp. 36:1 -45). New York: Wiley.[2] Koffka, K. (1963). Principles of Gestalt Psychology. New York: Harcourt, Brace & World. (Original work published 1935).[3] Li, M. and Vit{\'a}nyi, P. (2008). An Introduction to Kolmogorov Complexity and Its Applications. (3rd Edition). Berlin: Springer.[4] Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2):121-159.[5] Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. DPhil dissertation, University of Oxford.[6] Meredith, D. (2006). Point-set algorithms for pattern discovery and pattern matching in music. Dagstuhl Seminar on Content-Based Retrieval (No. 06171) (23-28 April, 2006).",

keywords = "computational musicology, machine learning, pattern discovery, learning algorithms, music analysis",

author = "David Meredith",

year = "2012",

month = dec,

day = "18",

language = "English",

note = "DMRN+7: Digital Music Research Network One-day Workshop 2012, DMRN+7 ; Conference date: 18-12-2012 Through 18-12-2012",

}

TY - CONF

T1 - A compression-based model of musical learning

AU - Meredith, David

PY - 2012/12/18

Y1 - 2012/12/18

N2 - A long tradition of psychological research considers perception to be governed by the simplicity principle [1,2]. Both music perception and music analysis involve searching for the simplest explanations for musical objects, where a musical object could be any quantity of music from a motive to a whole corpus. Musical objects are usually interpreted within the context of larger objects that contain them. For example, a work might be interpreted in the context of a corpus containing all the works by the same composer. Drawing on Kolmogorov complexity theory [3], I propose that an interpretation or reading of a musical object can be modeled as a program that computes an in extenso representation of the object. On this view, when interpreting a musical work for the first time, both the analyst and the listener’s brain are seen to be attempting to find the shortest programs that compute corpora containing the new work. In the case of the listener, the new work is interpreted largely non-consciously and in real-time in the context of a corpus which is a subset of all the music that the listener has previously heard. The analyst, on the other hand, is free to deliberately select a corpus that allows for the most satisfying (usually the most economical) interpretation of the new work. This is modelled as the modification of a pre-existing program, P, that computes some corpus (i.e., a compact encoding of the corpus), so that it can additionally compute the object to be interpreted. In other words, it is proposed that both the analyst and the listener’s brain try to find the shortest description of the new object that reuses as much of P as possible. The perceived structure of the object is then represented by the way in which the modified program, P', computes the object. On this view, both music perception and music analysis reduce to the compression of musical corpora. However, because a greedy encoding algorithm is used that resists changes to existing encodings of previously experienced music, the compression is not optimal. This model therefore predicts that each individual will have a unique interpretation of any given piece of music that depends not only on what music the listener has previously heard but also the order in which this previously heard music was presented. The model therefore suggests a pure compression-based explanation for musical memory, musical learning and individual differences in musical perception. The feasibility of this view is demonstrated in a computational model which is applied to the first book of J. S. Bach’s Das Wohltemperirte Clavier. This model pre-processes the data using the author’s PS13s1 pitch spelling algorithm [4,5], then applies a modified version of the author’s COSIATEC algorithm [6] to derive compact encodings of works that maximise reuse of previous encodings. The resulting analyses will be presented and discussed.References[1] Pomerantz, J. R. and Kubovy, M. (1986). Theoretical approaches to perceptual organization: Simplicity and likelihood principles. In: K. R. Boff, L. Kaufman, & J. P. Thomas (Eds.), Handbook of perception and human performance: Volume II. (pp. 36:1 -45). New York: Wiley.[2] Koffka, K. (1963). Principles of Gestalt Psychology. New York: Harcourt, Brace & World. (Original work published 1935).[3] Li, M. and Vitányi, P. (2008). An Introduction to Kolmogorov Complexity and Its Applications. (3rd Edition). Berlin: Springer.[4] Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2):121-159.[5] Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. DPhil dissertation, University of Oxford.[6] Meredith, D. (2006). Point-set algorithms for pattern discovery and pattern matching in music. Dagstuhl Seminar on Content-Based Retrieval (No. 06171) (23-28 April, 2006).

AB - A long tradition of psychological research considers perception to be governed by the simplicity principle [1,2]. Both music perception and music analysis involve searching for the simplest explanations for musical objects, where a musical object could be any quantity of music from a motive to a whole corpus. Musical objects are usually interpreted within the context of larger objects that contain them. For example, a work might be interpreted in the context of a corpus containing all the works by the same composer. Drawing on Kolmogorov complexity theory [3], I propose that an interpretation or reading of a musical object can be modeled as a program that computes an in extenso representation of the object. On this view, when interpreting a musical work for the first time, both the analyst and the listener’s brain are seen to be attempting to find the shortest programs that compute corpora containing the new work. In the case of the listener, the new work is interpreted largely non-consciously and in real-time in the context of a corpus which is a subset of all the music that the listener has previously heard. The analyst, on the other hand, is free to deliberately select a corpus that allows for the most satisfying (usually the most economical) interpretation of the new work. This is modelled as the modification of a pre-existing program, P, that computes some corpus (i.e., a compact encoding of the corpus), so that it can additionally compute the object to be interpreted. In other words, it is proposed that both the analyst and the listener’s brain try to find the shortest description of the new object that reuses as much of P as possible. The perceived structure of the object is then represented by the way in which the modified program, P', computes the object. On this view, both music perception and music analysis reduce to the compression of musical corpora. However, because a greedy encoding algorithm is used that resists changes to existing encodings of previously experienced music, the compression is not optimal. This model therefore predicts that each individual will have a unique interpretation of any given piece of music that depends not only on what music the listener has previously heard but also the order in which this previously heard music was presented. The model therefore suggests a pure compression-based explanation for musical memory, musical learning and individual differences in musical perception. The feasibility of this view is demonstrated in a computational model which is applied to the first book of J. S. Bach’s Das Wohltemperirte Clavier. This model pre-processes the data using the author’s PS13s1 pitch spelling algorithm [4,5], then applies a modified version of the author’s COSIATEC algorithm [6] to derive compact encodings of works that maximise reuse of previous encodings. The resulting analyses will be presented and discussed.References[1] Pomerantz, J. R. and Kubovy, M. (1986). Theoretical approaches to perceptual organization: Simplicity and likelihood principles. In: K. R. Boff, L. Kaufman, & J. P. Thomas (Eds.), Handbook of perception and human performance: Volume II. (pp. 36:1 -45). New York: Wiley.[2] Koffka, K. (1963). Principles of Gestalt Psychology. New York: Harcourt, Brace & World. (Original work published 1935).[3] Li, M. and Vitányi, P. (2008). An Introduction to Kolmogorov Complexity and Its Applications. (3rd Edition). Berlin: Springer.[4] Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2):121-159.[5] Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. DPhil dissertation, University of Oxford.[6] Meredith, D. (2006). Point-set algorithms for pattern discovery and pattern matching in music. Dagstuhl Seminar on Content-Based Retrieval (No. 06171) (23-28 April, 2006).

KW - computational musicology

KW - machine learning

KW - pattern discovery

KW - learning algorithms

KW - music analysis

M3 - Paper without publisher/journal

T2 - DMRN+7: Digital Music Research Network One-day Workshop 2012

Y2 - 18 December 2012 through 18 December 2012

ER -

A compression-based model of musical learning

Abstract

Workshop

Keywords

Access to Document

AUB Link

Fingerprint

Cite this