Towards distributed node similarity search on graphs

Tianming Zhang, Yunjun Gao*, Baihua Zheng, Lu Chen, Shiting Wen, Wei Guo

*Kontaktforfatter

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

5 Citationer (Scopus)
148 Downloads (Pure)

Abstract

Node similarity search on graphs has wide applications in recommendation, link prediction, to name just a few. However, existing studies are insufficient due to two reasons: (i) the scale of the real-world graph is growing rapidly, and (ii) vertices are always associated with complex attributes. In this paper, we propose an efficiently distributed framework to support node similarity search on massive graphs, which considers both graph structure correlation and node attribute similarity in metric spaces. The framework consists of preprocessing stage and query stage. In the preprocessing stage, a parallel KD-tree construction (KDC) algorithm is developed to form a newly defined graph so-called hybrid graph, in order to integrate node attribute similarity into the original graph. To equally divide graph vertices into subsets, KDC adopts the KD-tree partitioning after the pivot mapping. In addition, two metric pruning rules and an optimized allocation strategy are presented to reduce communication and computation costs. In the query stage, based on the formed hybrid graph, we develop similarity search methods using random walk with restart (RWR) to measure node similarity. To boost efficiency, we derive tight bounds to rapidly shrink the search region. Extensive experiments with three real massive graphs are conducted to verify the effectiveness, efficiency, and scalability of our proposed techniques.

OriginalsprogEngelsk
TidsskriftWorld Wide Web
Vol/bind23
Udgave nummer6
Sider (fra-til)3025-3053
Antal sider29
ISSN1386-145X
DOI
StatusUdgivet - 1 nov. 2020

Bibliografisk note

Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Towards distributed node similarity search on graphs'. Sammen danner de et unikt fingeraftryk.

Citationsformater