Abstract
Traditional information retrieval techniques based on keyword search help to identify a ranked set of relevant documents, which often contains many documents in the top ranks that do not meet the user's intention. By considering the semantics of the keywords and their relationships, both precision and recall can be improved. Using an ontology and mapping keywords to entities/concepts and identifying the relationship between them that the user is interested in, allows for retrieving documents that actually meet the user's intention. In this paper, we present a framework that enables semantic-aware document retrieval. User queries are mapped to semantic statements based on entities and their relationships. The framework searches for documents expressing these statements in different variations, e.g., synonymous names for entities or different textual expressions for relations between them. The size of potential result sets makes ranking documents according to their relevance to the user an essential component of such a system. The ranking model proposed in this paper is based on statistical language-models and considers aspects such as the authority of a document and the confidence in the textual pattern representing the queried information.
Originalsprog | Engelsk |
---|---|
Titel | CIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management |
Antal sider | 10 |
Publikationsdato | 13 dec. 2011 |
Sider | 37-46 |
ISBN (Trykt) | 9781450307178 |
DOI | |
Status | Udgivet - 13 dec. 2011 |
Udgivet eksternt | Ja |
Begivenhed | 20th ACM Conference on Information and Knowledge Management, CIKM'11 - Glasgow, Storbritannien Varighed: 24 okt. 2011 → 28 okt. 2011 |
Konference
Konference | 20th ACM Conference on Information and Knowledge Management, CIKM'11 |
---|---|
Land/Område | Storbritannien |
By | Glasgow |
Periode | 24/10/2011 → 28/10/2011 |
Sponsor | Special Interest Group on Information Retrieval (ACM SIGIR), ACM SIGWEB |