Projekter pr. år
Abstract
Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) based applications including automated text generation, question answering, chatbots, and others. However, they face a significant challenge: hallucinations, where models produce plausible-sounding but factually incorrect responses. This undermines trust and limits the applicability of LLMs in different domains. Knowledge Graphs (KGs), on the other hand, provide a structured collection of interconnected facts represented as entities (nodes) and their relationships (edges). In recent research, KGs have been leveraged to provide context that can fill gaps in an LLM’s understanding of certain topics offering a promising approach to mitigate hallucinations in LLMs, enhancing their reliability and accuracy while benefiting from their wide applicability. Nonetheless, it is still a very active area of research with various unresolved open problems. In this paper, we discuss these open challenges covering state-of-the-art datasets and benchmarks as well as methods for knowledge integration and evaluating hallucinations. In our discussion, we consider the current use of KGs in LLM systems and identify future directions within each of these challenges.
Originalsprog | Engelsk |
---|---|
Artikelnummer | 100844 |
Tidsskrift | Journal of Web Semantics |
Vol/bind | 85 |
ISSN | 1570-8268 |
DOI | |
Status | Udgivet - maj 2025 |
Fingeraftryk
Dyk ned i forskningsemnerne om 'Knowledge Graphs, Large Language Models, and Hallucinations: An NLP Perspective'. Sammen danner de et unikt fingeraftryk.Projekter
- 1 Igangværende
-
Poul Due Jensen Professorate in Big Data and Artificial Intelligence
Hose, K. (PI (principal investigator)), Jendal, T. E. (Projektdeltager) & Hansen, E. R. (Projektdeltager)
01/11/2019 → 31/12/2025
Projekter: Projekt › Forskning
-
1st Annual AAU NLP Symposium
Lavrinovics, E. (Arrangør) & Bjerva, J. (Arrangør)
3 dec. 2024Aktivitet: Deltagelse i faglig begivenhed › Organisering af eller deltagelse i konference
-
MultiHal: Multi-lingual, Multi-Prompt, Multi-Task Dataset for Hallucination Evaluation
Lavrinovics, E. (Oplægsholder)
3 dec. 2024Aktivitet: Foredrag og mundtlige bidrag › Konferenceoplæg