TY - GEN
T1 - Axiomatic analysis of translation language model for information retrieval
AU - Karimzadehgan, Maryam
AU - Zhai, Chengxiang
PY - 2012/4/27
Y1 - 2012/4/27
N2 - Statistical translation models have been shown to outperform simple document language models which rely on exact matching of words in the query and documents. A main challenge in applying translation models to ad hoc information retrieval is to estimate a translation model without training data. In this paper, we perform axiomatic analysis of translation language model for retrieval in order to gain insights about how to optimize the estimation of translation probabilities. We propose a set of constraints that a reasonable translation language model should satisfy. We check these constraints on the state-of-the-art translation estimation method based on Mutual Information and find that it does not satisfy most of the constraints. We then propose a new estimation method that better satisfies the defined constraints. Experimental results on representative TREC data sets show that the proposed new estimation method outperforms the existing Mutual Information-based estimation, suggesting that the proposed constraints are indeed helpful for designing better estimation methods for translation language model.
AB - Statistical translation models have been shown to outperform simple document language models which rely on exact matching of words in the query and documents. A main challenge in applying translation models to ad hoc information retrieval is to estimate a translation model without training data. In this paper, we perform axiomatic analysis of translation language model for retrieval in order to gain insights about how to optimize the estimation of translation probabilities. We propose a set of constraints that a reasonable translation language model should satisfy. We check these constraints on the state-of-the-art translation estimation method based on Mutual Information and find that it does not satisfy most of the constraints. We then propose a new estimation method that better satisfies the defined constraints. Experimental results on representative TREC data sets show that the proposed new estimation method outperforms the existing Mutual Information-based estimation, suggesting that the proposed constraints are indeed helpful for designing better estimation methods for translation language model.
UR - http://www.scopus.com/inward/record.url?scp=84860120846&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84860120846&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-28997-2_23
DO - 10.1007/978-3-642-28997-2_23
M3 - Conference contribution
AN - SCOPUS:84860120846
SN - 9783642289965
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 268
EP - 280
BT - Advances in Information Retrieval - 34th European Conference on IR Research, ECIR 2012, Proceedings
T2 - 34th European Conference on Information Retrieval, ECIR 2012
Y2 - 1 April 2012 through 5 April 2012
ER -