Rhetorical relations for information retrieval

Christina Lioma*, Birger Larsen, Wei Lu

*Kontaktforfatter

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

17 Citationer (Scopus)

Abstract

Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so-called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document's rhetorical relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness notably (>10% in mean average precision over a state-of-the-art baseline).

OriginalsprogEngelsk
TitelSIGIR'12 - Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval
Antal sider10
Publikationsdato28 sep. 2012
Sider931-940
ISBN (Trykt)9781450316583
DOI
StatusUdgivet - 28 sep. 2012
Begivenhed35th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012 - Portland, OR, USA
Varighed: 12 aug. 201216 aug. 2012

Konference

Konference35th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012
Land/OmrådeUSA
ByPortland, OR
Periode12/08/201216/08/2012
SponsorAssoc. Comput. Mach., Spec., Interest Group Inf. Retr. (ACM SIGIR)

Fingeraftryk

Dyk ned i forskningsemnerne om 'Rhetorical relations for information retrieval'. Sammen danner de et unikt fingeraftryk.

Citationsformater