Abstrakt
Besides the traditional cartographic data sources, spatial information can also be derived from location-based sources. Location-based sources offer rich spatial information describing the semantics of locations. However, even though different location-based sources refer to the same physical world, each one has only partial coverage of the spatial entities of interest, describe them with different attributes and sometimes provide contradicting information. Hence, the problem of finding which pairs of spatial entities belong to the same physical spatial entity demands specific attention. We propose a solution (QuadSky) to the problem of spatial entity linkage across diverse location-based sources. QuadSky starts with a spatial blocking technique (QuadFlex) that inherits the concept and the complexity from the quadtree algorithm but improves the splitting technique not to separate nearby points. After comparing the spatial entities of the same block, we propose a novel algorithm, referred to as SkyEx that separates the pairs considered as a match (positive class) from the rest (negative class) by using Pareto optimality. SkyEx does not require weights on the attributes, scoring function or a training set. QuadSky achieves 0.85 precision and 0.85 recall for a manually labeled dataset of 1,500 pairs and 0.87 precision and 0.6 recall for a semi-manually labeled dataset of 777,452 pairs. Moreover, QuadSky provides the best trade-off between precision and recall and consequently, the best F-measure compared to the existing baselines.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 16th International Symposium on Spatial and Temporal Databases, SSTD 2019 : SSTD |
Antal sider | 10 |
Forlag | Association for Computing Machinery |
Publikationsdato | 19 aug. 2019 |
Sider | 1-10 |
ISBN (Trykt) | 978-1-4503-6280-1 |
ISBN (Elektronisk) | 978-1-4503-6280-1 |
DOI | |
Status | Udgivet - 19 aug. 2019 |
Begivenhed | International Symposium on Spatial and Temporal Databases - Wien, Østrig Varighed: 19 aug. 2019 → 21 aug. 2019 Konferencens nummer: 16th http://sstd2019.org/ |
Konference
Konference | International Symposium on Spatial and Temporal Databases |
---|---|
Nummer | 16th |
Land | Østrig |
By | Wien |
Periode | 19/08/2019 → 21/08/2019 |
Internetadresse |
Fingeraftryk Dyk ned i forskningsemnerne om 'Multi-Source Spatial Entity Linkage'. Sammen danner de et unikt fingeraftryk.
Priser
-
Best Paper Award Runner-Up
Isaj, Suela (Modtager), Zimányi, E. (Modtager) & Pedersen, Torben Bach (Modtager), 20 aug. 2019
Pris: Konferencepriser
Fil