Multi-Source Spatial Entity Linkage

Suela Isaj, Esteban Zimányi, Torben Bach Pedersen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

6 Citations (Scopus)
198 Downloads (Pure)

Abstract

Besides the traditional cartographic data sources, spatial information can also be derived from location-based sources. Location-based sources offer rich spatial information describing the semantics of locations. However, even though different location-based sources refer to the same physical world, each one has only partial coverage of the spatial entities of interest, describe them with different attributes and sometimes provide contradicting information. Hence, the problem of finding which pairs of spatial entities belong to the same physical spatial entity demands specific attention. We propose a solution (QuadSky) to the problem of spatial entity linkage across diverse location-based sources. QuadSky starts with a spatial blocking technique (QuadFlex) that inherits the concept and the complexity from the quadtree algorithm but improves the splitting technique not to separate nearby points. After comparing the spatial entities of the same block, we propose a novel algorithm, referred to as SkyEx that separates the pairs considered as a match (positive class) from the rest (negative class) by using Pareto optimality. SkyEx does not require weights on the attributes, scoring function or a training set. QuadSky achieves 0.85 precision and 0.85 recall for a manually labeled dataset of 1,500 pairs and 0.87 precision and 0.6 recall for a semi-manually labeled dataset of 777,452 pairs. Moreover, QuadSky provides the best trade-off between precision and recall and consequently, the best F-measure compared to the existing baselines.
Original languageEnglish
Title of host publicationProceedings of the 16th International Symposium on Spatial and Temporal Databases, SSTD 2019 : SSTD
Number of pages10
PublisherAssociation for Computing Machinery
Publication date19 Aug 2019
Pages1-10
ISBN (Print)978-1-4503-6280-1
ISBN (Electronic)978-1-4503-6280-1
DOIs
Publication statusPublished - 19 Aug 2019
EventInternational Symposium on Spatial and Temporal Databases - Wien, Austria
Duration: 19 Aug 201921 Aug 2019
Conference number: 16th
http://sstd2019.org/

Conference

ConferenceInternational Symposium on Spatial and Temporal Databases
Number16th
Country/TerritoryAustria
CityWien
Period19/08/201921/08/2019
Internet address

Fingerprint

Dive into the research topics of 'Multi-Source Spatial Entity Linkage'. Together they form a unique fingerprint.

Cite this