Less is More: Non-Redundant Subspace Clustering

Ira Assent; Emmanuel Müller; Stephan Günnemann; Ralph Krieger; Thomas Seidl

Less is More: Non-Redundant Subspace Clustering

Ira Assent, Emmanuel Müller, Stephan Günnemann, Ralph Krieger, Thomas Seidl

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

Abstract

Clustering is an important data mining task for grouping
similar objects. In high dimensional data, however, eects
attributed to the \curse of dimensionality", render clustering
in high dimensional data meaningless. Due to this, recent
years have seen research on subspace clustering which
searches for clusters in relevant subspace projections of high
dimensional data. As the number of possible subspace projections
is exponential in the number of dimensions, the
number of possible subspace clusters can be overwhelming.
In this position paper, we present our work on identifying
non-redundant, relevant subspace clusters which reduce the
result set to a manageable size. We discuss techniques for
evaluating, visualizing and exploring subspace clusterings,
and propose some directions for future work.

Originalsprog	Engelsk
Titel	1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010)
Forlag	Association for Computing Machinery
Publikationsdato	2010
ISBN (Trykt)	978-1-4503-0227-2
Status	Udgivet - 2010

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

Assent, I., Müller, E., Günnemann, S., Krieger, R., & Seidl, T. (2010). Less is More: Non-Redundant Subspace Clustering. I 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010) Association for Computing Machinery.

Assent, Ira ; Müller, Emmanuel ; Günnemann, Stephan et al. / Less is More : Non-Redundant Subspace Clustering. 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010). Association for Computing Machinery, 2010.

@inproceedings{4fd80cfe8e90494ea954d220db70a20b,

title = "Less is More: Non-Redundant Subspace Clustering",

abstract = "Clustering is an important data mining task for groupingsimilar objects. In high dimensional data, however, eectsattributed to the \curse of dimensionality{"}, render clusteringin high dimensional data meaningless. Due to this, recentyears have seen research on subspace clustering whichsearches for clusters in relevant subspace projections of highdimensional data. As the number of possible subspace projectionsis exponential in the number of dimensions, thenumber of possible subspace clusters can be overwhelming.In this position paper, we present our work on identifyingnon-redundant, relevant subspace clusters which reduce theresult set to a manageable size. We discuss techniques forevaluating, visualizing and exploring subspace clusterings,and propose some directions for future work.",

author = "Ira Assent and Emmanuel M{\"u}ller and Stephan G{\"u}nnemann and Ralph Krieger and Thomas Seidl",

year = "2010",

language = "English",

isbn = "978-1-4503-0227-2",

booktitle = "1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010)",

publisher = "Association for Computing Machinery",

address = "United States",

}

Assent, I, Müller, E, Günnemann, S, Krieger, R & Seidl, T 2010, Less is More: Non-Redundant Subspace Clustering. i 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010). Association for Computing Machinery.

Less is More: Non-Redundant Subspace Clustering. / Assent, Ira; Müller, Emmanuel; Günnemann, Stephan et al.
1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010). Association for Computing Machinery, 2010.

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Less is More

T2 - Non-Redundant Subspace Clustering

AU - Assent, Ira

AU - Müller, Emmanuel

AU - Günnemann, Stephan

AU - Krieger, Ralph

AU - Seidl, Thomas

PY - 2010

Y1 - 2010

N2 - Clustering is an important data mining task for groupingsimilar objects. In high dimensional data, however, eectsattributed to the \curse of dimensionality", render clusteringin high dimensional data meaningless. Due to this, recentyears have seen research on subspace clustering whichsearches for clusters in relevant subspace projections of highdimensional data. As the number of possible subspace projectionsis exponential in the number of dimensions, thenumber of possible subspace clusters can be overwhelming.In this position paper, we present our work on identifyingnon-redundant, relevant subspace clusters which reduce theresult set to a manageable size. We discuss techniques forevaluating, visualizing and exploring subspace clusterings,and propose some directions for future work.

AB - Clustering is an important data mining task for groupingsimilar objects. In high dimensional data, however, eectsattributed to the \curse of dimensionality", render clusteringin high dimensional data meaningless. Due to this, recentyears have seen research on subspace clustering whichsearches for clusters in relevant subspace projections of highdimensional data. As the number of possible subspace projectionsis exponential in the number of dimensions, thenumber of possible subspace clusters can be overwhelming.In this position paper, we present our work on identifyingnon-redundant, relevant subspace clusters which reduce theresult set to a manageable size. We discuss techniques forevaluating, visualizing and exploring subspace clusterings,and propose some directions for future work.

M3 - Article in proceeding

SN - 978-1-4503-0227-2

BT - 1st International Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust 2010) in conjunction with 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2010)

PB - Association for Computing Machinery

ER -