Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects

Gao Cong, Christian Søndergaard Jensen, Dingming Wu

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

382 Citationer (Scopus)

Resumé

The conventional Internet is acquiring a geo-spatial dimension. Web documents are being geo-tagged, and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables a new kind of top-k query that takes into account both location proximity and text relevancy. To our knowledge, only naive techniques exist that are capable of computing a general web information retrieval query while also taking location into account. This paper proposes a new indexing framework for location-aware top-k text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within the framework. The framework encompasses algorithms that utilize the proposed indexes for computing the top-k query, thus taking into account both text relevancy and location proximity to prune the search space. Results of empirical studies with an implementation of the framework demonstrate that the paper’s proposal offers scalability and is capable of excellent performance.
OriginalsprogEngelsk
TidsskriftInternational Conference on Very Large Data Bases. Proceedings
Vol/bind2
Udgave nummer1
Sider (fra-til)337-348
ISSN1047-7349
StatusUdgivet - 2009
BegivenhedInternational Conference on Very Large Databases VLDB '09 - Lyon, Frankrig
Varighed: 24 aug. 200928 aug. 2009
Konferencens nummer: 35

Konference

KonferenceInternational Conference on Very Large Databases VLDB '09
Nummer35
LandFrankrig
ByLyon
Periode24/08/200928/08/2009

Fingerprint

Information retrieval
Scalability
Fusion reactions
Internet

Citer dette

@inproceedings{e9395ad0ebbd11deb63d000ea68e967b,
title = "Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects",
abstract = "The conventional Internet is acquiring a geo-spatial dimension. Web documents are being geo-tagged, and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables a new kind of top-k query that takes into account both location proximity and text relevancy. To our knowledge, only naive techniques exist that are capable of computing a general web information retrieval query while also taking location into account. This paper proposes a new indexing framework for location-aware top-k text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within the framework. The framework encompasses algorithms that utilize the proposed indexes for computing the top-k query, thus taking into account both text relevancy and location proximity to prune the search space. Results of empirical studies with an implementation of the framework demonstrate that the paper’s proposal offers scalability and is capable of excellent performance.",
author = "Gao Cong and Jensen, {Christian S{\o}ndergaard} and Dingming Wu",
year = "2009",
language = "English",
volume = "2",
pages = "337--348",
journal = "International Conference on Very Large Data Bases. Proceedings",
issn = "1047-7349",
publisher = "A C M Special Interest Group",
number = "1",

}

Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. / Cong, Gao; Jensen, Christian Søndergaard; Wu, Dingming.

I: International Conference on Very Large Data Bases. Proceedings, Bind 2, Nr. 1, 2009, s. 337-348.

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

TY - GEN

T1 - Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects

AU - Cong, Gao

AU - Jensen, Christian Søndergaard

AU - Wu, Dingming

PY - 2009

Y1 - 2009

N2 - The conventional Internet is acquiring a geo-spatial dimension. Web documents are being geo-tagged, and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables a new kind of top-k query that takes into account both location proximity and text relevancy. To our knowledge, only naive techniques exist that are capable of computing a general web information retrieval query while also taking location into account. This paper proposes a new indexing framework for location-aware top-k text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within the framework. The framework encompasses algorithms that utilize the proposed indexes for computing the top-k query, thus taking into account both text relevancy and location proximity to prune the search space. Results of empirical studies with an implementation of the framework demonstrate that the paper’s proposal offers scalability and is capable of excellent performance.

AB - The conventional Internet is acquiring a geo-spatial dimension. Web documents are being geo-tagged, and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables a new kind of top-k query that takes into account both location proximity and text relevancy. To our knowledge, only naive techniques exist that are capable of computing a general web information retrieval query while also taking location into account. This paper proposes a new indexing framework for location-aware top-k text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within the framework. The framework encompasses algorithms that utilize the proposed indexes for computing the top-k query, thus taking into account both text relevancy and location proximity to prune the search space. Results of empirical studies with an implementation of the framework demonstrate that the paper’s proposal offers scalability and is capable of excellent performance.

M3 - Conference article in Journal

VL - 2

SP - 337

EP - 348

JO - International Conference on Very Large Data Bases. Proceedings

JF - International Conference on Very Large Data Bases. Proceedings

SN - 1047-7349

IS - 1

ER -