TY - GEN
T1 - Improving retrieval accuracy of difficult queries through generalizing negative document language models
AU - Karimzadehgan, Maryam
AU - Zhai, Chengxiang
PY - 2011
Y1 - 2011
N2 - When a query topic is difficult and the search results are very poor, negative feedback is a very useful method to improve the retrieval accuracy and user experience. One challenge in negative feedback is that negative documents tend to be distracting in different ways, thus as training examples, negative examples are sparse. In this paper, we solve the problem of data sparseness in the language modeling framework. We propose an optimization framework, in which we learn from a few top-ranked non-relevant examples, and search in a large space of all language models to build a more general negative language model. This general negative language model has more power in pruning the non-relevant documents, thus potentially improving the performance for difficult queries. Experiment results on representative TREC collections show that the proposed optimization framework can improve negative feedback performance over the state-of-the-art negative feedback method through generalizing negative language models.
AB - When a query topic is difficult and the search results are very poor, negative feedback is a very useful method to improve the retrieval accuracy and user experience. One challenge in negative feedback is that negative documents tend to be distracting in different ways, thus as training examples, negative examples are sparse. In this paper, we solve the problem of data sparseness in the language modeling framework. We propose an optimization framework, in which we learn from a few top-ranked non-relevant examples, and search in a large space of all language models to build a more general negative language model. This general negative language model has more power in pruning the non-relevant documents, thus potentially improving the performance for difficult queries. Experiment results on representative TREC collections show that the proposed optimization framework can improve negative feedback performance over the state-of-the-art negative feedback method through generalizing negative language models.
KW - difficult topics
KW - generalizing language model
KW - language models
KW - negative feedback
KW - optimization
UR - http://www.scopus.com/inward/record.url?scp=83055187806&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=83055187806&partnerID=8YFLogxK
U2 - 10.1145/2063576.2063586
DO - 10.1145/2063576.2063586
M3 - Conference contribution
AN - SCOPUS:83055187806
SN - 9781450307178
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 27
EP - 36
BT - CIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management
T2 - 20th ACM Conference on Information and Knowledge Management, CIKM'11
Y2 - 24 October 2011 through 28 October 2011
ER -