Abstract
This paper studies filter and hybrid filter-wrapper feature subset selection for unsupervised learning (data clustering). We constrain the search for the best feature subset by scoring the dependence of every feature on the rest of the features, conjecturing that these scores discriminate some irrelevant features. We report experimental results on artificial and real data for unsupervised learning of naive Bayes models. Both the filter and hybrid approaches perform satisfactorily.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings on the Workshop on Probabilistic Graphical Models for Classification : (within ECML/PKDD 2003) |
Antal sider | 11 |
Publikationsdato | 2003 |
Sider | 71-82 |
Status | Udgivet - 2003 |
Begivenhed | ECML/PKDD - Cavtat-Dubrovnik, Kroatien Varighed: 22 sep. 2003 → 26 sep. 2003 Konferencens nummer: 14th / 7th |
Konference
Konference | ECML/PKDD |
---|---|
Nummer | 14th / 7th |
Land/Område | Kroatien |
By | Cavtat-Dubrovnik |
Periode | 22/09/2003 → 26/09/2003 |