Abstract
We introduce two algorithms, RecurSIA and RRT, designed to increase the compression factor achievable using point-set cover algorithms based on the SIA and SIATEC pattern discovery algorithms. SIA computes the maximal translatable patterns (MTPs) in a point set, while SIATEC computes the translational equivalence class (TEC) of every MTP in a point set, where the TEC of an MTP is the set of translationally invariant occurrences of that MTP in the point set. In its output, SIATEC encodes each MTP TEC as a pair, ⟨P,V⟩, where P is the first occurrence of the MTP and V is the set of non-zero vectors that map P onto its other occurrences. RecurSIA recursively applies a TEC cover algorithm to the pattern P, in each TEC, ⟨P,V⟩, that it discovers. RRT attempts to remove translators from V in each TEC without reducing the total set of points covered by the TEC. When evaluated with COSIATEC, SIATECCompress and Forth’s algorithm on the JKU Patterns Development Database, using RecurSIA with or without RRT increased compression factor and recall but reduced precision. Using RRT alone increased compression factor and reduced recall and precision, but had a smaller effect than RecurSIA.
Original language | English |
---|---|
Title of host publication | Machin Learning and Knowledge Discovery in Databases : International Workshops of ECML PKDD 2019 Würzburg, Germany, September 16–20, 2019 Proceedings, Part II |
Editors | Peggy Cellier, Kurt Driessens |
Number of pages | 9 |
Volume | 1168 |
Place of Publication | Cham, Switzerland |
Publisher | Springer |
Publication date | 2020 |
Pages | 485-493 |
ISBN (Print) | 978-3-030-43886-9 |
ISBN (Electronic) | 978-3-030-43887-6 |
DOIs | |
Publication status | Published - 2020 |
Event | International Workshop on Machine Learning and Music - Würzburg, Germany Duration: 16 Sept 2019 → 16 Sept 2019 Conference number: 12 https://musml2019.weebly.com/ |
Conference
Conference | International Workshop on Machine Learning and Music |
---|---|
Number | 12 |
Country/Territory | Germany |
City | Würzburg |
Period | 16/09/2019 → 16/09/2019 |
Internet address |
Series | Communications in Computer and Information Science |
---|---|
Volume | 1168 |
ISSN | 1865-0929 |
Keywords
- COSIATEC
- Data compression
- Forth’s algorithm
- Geometric pattern discovery in music
- Music analysis
- Pattern discovery
- Point sets
- SIATEC
- SIATECCompress