Abstract
Accurate Semantic Table Interpretation (STI) annotation has a significant impact on interpreting unlabeled data for data analysis. In this paper, we present SemTex, a system for solving the three tasks Cell Entity Annotation (CEA), Cell Type Annotation (CTA) and Cell Property Annotation (CPA) in the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab). We utilise a hybrid approach combining analysis of relationships in knowledge graphs and gradient boosting for annotation. We document and benchmark our performance using datasets from the SemTab challenge 2022 and 2023. Our approach yields competitive results compared to the current state-of-the-art tools in all three tasks.
Original language | English |
---|---|
Journal | CEUR Workshop Proceedings |
Volume | 3557 |
Pages (from-to) | 38-49 |
Number of pages | 12 |
ISSN | 1613-0073 |
Publication status | Published - 2023 |
Event | 2023 Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, SemTab 2023 - Athens, Greece Duration: 6 Nov 2023 → 10 Nov 2023 |
Conference
Conference | 2023 Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, SemTab 2023 |
---|---|
Country/Territory | Greece |
City | Athens |
Period | 06/11/2023 → 10/11/2023 |
Bibliographical note
Publisher Copyright:© 2023 CEUR-WS. All rights reserved.
Keywords
- Gradient Boosting
- Knowledge Graph
- Semantic Table Interpretation
- SemTab Challenge
- Tabular Data