A compression-based model of musical learning

Research output: Contribution to conference without publisher/journalPaper without publisher/journalResearchpeer-review

112 Downloads (Pure)

Abstract

A long tradition of psychological research considers perception to be governed by the simplicity principle [1,2]. Both music perception and music analysis involve searching for the simplest explanations for musical objects, where a musical object could be any quantity of music from a motive to a whole corpus. Musical objects are usually interpreted within the context of larger objects that contain them. For example, a work might be interpreted in the context of a corpus containing all the works by the same composer. Drawing on Kolmogorov complexity theory [3], I propose that an interpretation or reading of a musical object can be modeled as a program that computes an in extenso representation of the object. On this view, when interpreting a musical work for the first time, both the analyst and the listener’s brain are seen to be attempting to find the shortest programs that compute corpora containing the new work. In the case of the listener, the new work is interpreted largely non-consciously and in real-time in the context of a corpus which is a subset of all the music that the listener has previously heard. The analyst, on the other hand, is free to deliberately select a corpus that allows for the most satisfying (usually the most economical) interpretation of the new work. This is modelled as the modification of a pre-existing program, P, that computes some corpus (i.e., a compact encoding of the corpus), so that it can additionally compute the object to be interpreted. In other words, it is proposed that both the analyst and the listener’s brain try to find the shortest description of the new object that reuses as much of P as possible. The perceived structure of the object is then represented by the way in which the modified program, P', computes the object. On this view, both music perception and music analysis reduce to the compression of musical corpora. However, because a greedy encoding algorithm is used that resists changes to existing encodings of previously experienced music, the compression is not optimal. This model therefore predicts that each individual will have a unique interpretation of any given piece of music that depends not only on what music the listener has previously heard but also the order in which this previously heard music was presented. The model therefore suggests a pure compression-based explanation for musical memory, musical learning and individual differences in musical perception. The feasibility of this view is demonstrated in a computational model which is applied to the first book of J. S. Bach’s Das Wohltemperirte Clavier. This model pre-processes the data using the author’s PS13s1 pitch spelling algorithm [4,5], then applies a modified version of the author’s COSIATEC algorithm [6] to derive compact encodings of works that maximise reuse of previous encodings. The resulting analyses will be presented and discussed.

References
[1] Pomerantz, J. R. and Kubovy, M. (1986). Theoretical approaches to perceptual organization: Simplicity and likelihood principles. In: K. R. Boff, L. Kaufman, & J. P. Thomas (Eds.), Handbook of perception and human performance: Volume II. (pp. 36:1 -45). New York: Wiley.
[2] Koffka, K. (1963). Principles of Gestalt Psychology. New York: Harcourt, Brace & World. (Original work published 1935).
[3] Li, M. and Vitányi, P. (2008). An Introduction to Kolmogorov Complexity and Its Applications. (3rd Edition). Berlin: Springer.
[4] Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2):121-159.
[5] Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. DPhil dissertation, University of Oxford.
[6] Meredith, D. (2006). Point-set algorithms for pattern discovery and pattern matching in music. Dagstuhl Seminar on Content-Based Retrieval (No. 06171) (23-28 April, 2006).
Original languageEnglish
Publication date18 Dec 2012
Publication statusPublished - 18 Dec 2012
EventDMRN+7: Digital Music Research Network One-day Workshop 2012 - Queen Mary University of London, London, United Kingdom
Duration: 18 Dec 201218 Dec 2012

Workshop

WorkshopDMRN+7: Digital Music Research Network One-day Workshop 2012
LocationQueen Mary University of London
Country/TerritoryUnited Kingdom
CityLondon
Period18/12/201218/12/2012

Keywords

  • computational musicology
  • machine learning
  • pattern discovery
  • learning algorithms
  • music analysis

Fingerprint

Dive into the research topics of 'A compression-based model of musical learning'. Together they form a unique fingerprint.

Cite this