The use of categorization information in language models for question retrieval

Xin Cao, Gao Cong, Bin Cui, Christian Søndergaard Jensen, Ce Zhang

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

91 Citationer (Scopus)

Abstract

Community Question Answering (CQA) has emerged as a popular type of service meeting a wide range of information needs. Such services enable users to ask and answer questions and to access existing question-answer pairs. CQA archives contain very large volumes of valuable user-generated content and have become important information resources on the Web. To make the body of knowledge accumulated in CQA archives accessible, effective and efficient question search is required. Question search in a CQA archive aims to retrieve historical questions that are relevant to new questions posed by users. This paper proposes a category-based framework for search in CQA archives. The framework embodies several new techniques that use language models to exploit categories of questions for improving question-answer search. Experiments conducted on real data from Yahoo! Answers demonstrate that the proposed techniques are effective and efficient and are capable of outperforming baseline methods significantly.
OriginalsprogEngelsk
TitelProceeding of the 18th ACM conference on Information and knowledge management
RedaktørerDavid Wai-Lok Cheung, Il-Yeol Song, Wesley W. Chu, Xiaohua Hu, Jimmy J Lin
Antal sider10
ForlagAssociation for Computing Machinery
Publikationsdato2009
Sider265-274
ISBN (Elektronisk)978-1-60558-512-3
StatusUdgivet - 2009
BegivenhedACM Conference on Information and Knowledge Management - Hong Kong, Kina
Varighed: 2 nov. 20096 nov. 2009
Konferencens nummer: 18

Konference

KonferenceACM Conference on Information and Knowledge Management
Nummer18
Land/OmrådeKina
ByHong Kong
Periode02/11/200906/11/2009
NavnConference on Information and Knowledge Management

Fingeraftryk

Dyk ned i forskningsemnerne om 'The use of categorization information in language models for question retrieval'. Sammen danner de et unikt fingeraftryk.

Citationsformater