Abstract
The Web of Data has grown explosively over the past few years, and as with any dataset, there are bound to be invalid statements in the data, as well as gaps. Natural Language Processing (NLP) is gaining interest to fill gaps in data by transforming (unstructured) text into structured data. However, there is currently a fundamental mismatch in approaches between Linked Data and NLP as the latter is often based on statistical methods, and the former on explicitly modelling knowledge. However, these fields can strengthen each other by joining forces. In this position paper, we argue that using linked data to validate the output of an NLP system, and using textual data to validate Linked Open Data (LOD) cloud statements is a promising research avenue. We illustrate our proposal with a proof of concept on a corpus of historical travel stories.
Original language | English |
---|---|
Title of host publication | 2nd Conference on Language, Data and Knowledge (LDK 2019) |
Editors | Maria Eskevich, Gerard de Melo, Christian Fath, John P. McCrae, Paul Buitelaar, Christian Chiarcos, Bettina Klimek, Milan Dojchinovski |
Number of pages | 8 |
Publisher | Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing |
Publication date | 2019 |
Pages | 13:1-13:8 |
Article number | 13 |
ISBN (Print) | 978-3-95977-105-4 |
ISBN (Electronic) | 9783959771054 |
DOIs | |
Publication status | Published - 2019 |
Event | Conference on Language, Data and Knowledge - Leipzig, Germany Duration: 20 May 2019 → 23 May 2019 Conference number: 2nd http://2019.ldk-conf.org/ |
Conference
Conference | Conference on Language, Data and Knowledge |
---|---|
Number | 2nd |
Country/Territory | Germany |
City | Leipzig |
Period | 20/05/2019 → 23/05/2019 |
Internet address |
Series | Open Access Series in Informatics |
---|---|
Volume | 70 |
ISSN | 2190-6807 |
Keywords
- Data validity
- Linked data
- Natural language processing