Co-clustering for Weblogs in Semantic Space

Yu Zong, Guandong Xu, Peter Dolog, Yanchun Zhang, Renjin Liu

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

2 Citationer (Scopus)
628 Downloads (Pure)

Abstract

Web clustering is an approach for aggregating web objects into various groups according to underlying relationships among them. Finding co-clusters of web objects in semantic space is an interesting topic in the context of web usage mining, which is able to capture the underlying user navigational interest and content preference simultaneously. In this paper we will present a novel web co-clustering algorithm named Co-Clustering in Semantic space (COCS) to simultaneously partition web users and pages via a latent semantic analysis approach. In COCS, we first, train the latent semantic space of weblog data by using Probabilistic Latent Semantic Analysis (PLSA) model, and then, project all weblog data objects into this semantic space with probability distribution to capture the relationship among web pages and web users, at last, propose a clustering algorithm to generate the co-cluster corresponding to each semantic factor in the latent semantic space via probability inference. The proposed approach is evaluated by experiments performed on real datasets in terms of precision and recall metrics. Experimental results have demonstrated the proposed method can effectively reveal the co-aggregates of web users and pages which are closely related.
OriginalsprogEngelsk
BogserieLecture Notes in Computer Science
Vol/bind6488
Sider (fra-til)120-127
ISSN0302-9743
DOI
StatusUdgivet - 12 dec. 2010
BegivenhedWeb Information Systems Engineering – WISE 2010 - Hong Kong, Kina
Varighed: 12 dec. 201014 dec. 2010

Konference

KonferenceWeb Information Systems Engineering – WISE 2010
Land/OmrådeKina
ByHong Kong
Periode12/12/201014/12/2010

Fingeraftryk

Dyk ned i forskningsemnerne om 'Co-clustering for Weblogs in Semantic Space'. Sammen danner de et unikt fingeraftryk.

Citationsformater