Abstract
We introduce HiPaR, a novel pattern-aided regression method for data with both categorical and numerical attributes. HiPaR mines hybrid rules of the form p⇒ y= f(X) where p is the characterization of a data region and f(X) is a linear regression model on a variable of interest y. The novelty of the method lies in the combination of an enumerative approach to explore the space of regions and efficient heuristics that guide the search. Such a strategy provides more flexibility when selecting a small set of jointly accurate and human-readable hybrid rules that explain the entire dataset. As our experiments shows, HiPaR mines fewer rules than existing pattern-based regression methods while still attaining state-of-the-art prediction performance.
Original language | English |
---|---|
Title of host publication | Advances in Knowledge Discovery and Data Mining - 25th Pacific-Asia Conference, PAKDD 2021, Proceedings |
Editors | Kamal Karlapalem, Hong Cheng, Naren Ramakrishnan, R. K. Agrawal, P. Krishna Reddy, Jaideep Srivastava, Tanmoy Chakraborty |
Number of pages | 13 |
Publisher | Springer |
Publication date | 2021 |
Pages | 320-332 |
ISBN (Print) | 9783030757618 |
DOIs | |
Publication status | Published - 2021 |
Event | 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2021 - Virtual, Online Duration: 11 May 2021 → 14 May 2021 |
Conference
Conference | 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2021 |
---|---|
City | Virtual, Online |
Period | 11/05/2021 → 14/05/2021 |
Series | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12712 LNAI |
ISSN | 0302-9743 |
Bibliographical note
Publisher Copyright:© 2021, Springer Nature Switzerland AG.
Keywords
- Linear regression
- Rule mining