Regularized estimation of mixture models for robust pseudo-relevance feedback

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Pseudo-relevance feedback has proven to be an effective strategy for improving retrieval accuracy in all retrieval models. However the performance of existing pseudo feedback methods is often affected significantly by some parameters, such as the number of feedback documents to use and the relative weight of original query terms; these parameters generally have to be set by trial-and-error without any guidance. In this paper, we present a more robust method for pseudo feedback based on statistical language models. Our main idea is to integrate the original query with feedback documents in a single probabilistic mixture model and regularize the estimation of the language model parameters in the model so that the information in the feedback documents can be gradually added to the original query. Unlike most existing feedback methods, our new method has no parameter to tune. Experiment results on two representative data sets show that the new method is significantly more robust than a state-of-the-art baseline language modeling approach for feedback with comparable or better retrieval accuracy.

Original languageEnglish (US)
Title of host publicationProceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
PublisherAssociation for Computing Machinery
Pages162-169
Number of pages8
ISBN (Print)1595933697, 9781595933690
DOIs
StatePublished - 2006
Event29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - Seatttle, WA, United States
Duration: Aug 6 2006Aug 11 2006

Publication series

NameProceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Volume2006

Other

Other29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Country/TerritoryUnited States
CitySeatttle, WA
Period8/6/068/11/06

Keywords

  • EM
  • Mixture model
  • Pseudo feedback
  • Regulation

ASJC Scopus subject areas

  • General Engineering
  • Information Systems
  • Software
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Regularized estimation of mixture models for robust pseudo-relevance feedback'. Together they form a unique fingerprint.

Cite this