Optimal citation context window sizes for biomedical retrieval

Boris Lykke Nielsen, Stefan Lavlund Skau, Florian Meier, Birger Larsen*

*Kontaktforfatter

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

2 Citationer (Scopus)
87 Downloads (Pure)

Abstract

We investigate the TREC-CDS 2016 test collections as a new resource for citation context and citation-based IR experiments. The collection contains more than 1.25 million biomedical full-text articles in XML. We find that a citation index can easily be extracted, and citation contexts easily be identified. We conduct initial experiments to determine the optimal citation context window size in this domain and collection. Surprisingly We find that quite long citation contexts of more than 250 word yield the best performance when combined linearly with the fulltext and with moderate weight on the citation contexts.

OriginalsprogEngelsk
TidsskriftCEUR Workshop Proceedings
Vol/bind2345
Sider (fra-til)51-63
Antal sider13
ISSN1613-0073
StatusUdgivet - 1 jan. 2019
Begivenhed8th International Workshop on Bibliometric-Enhanced Information Retrieval, BIR 2019 - Cologne, Tyskland
Varighed: 14 apr. 2019 → …

Konference

Konference8th International Workshop on Bibliometric-Enhanced Information Retrieval, BIR 2019
Land/OmrådeTyskland
ByCologne
Periode14/04/2019 → …

Fingeraftryk

Dyk ned i forskningsemnerne om 'Optimal citation context window sizes for biomedical retrieval'. Sammen danner de et unikt fingeraftryk.

Citationsformater