A probabilistic relevance propagation model for hypertext retrieval

Azadeh Shakery, Cheng Xiang Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A major challenge in developing models for hypertext retrieval is to effectively combine content information with the link structure available in hypertext collections. Although several link-based ranking methods have been developed to improve retrieval results, none of them can fully exploit the discrimination power of contents as well as fully exploit all useful link structures. In this paper, we propose a general relevance propagation framework for combining content and link information. The framework gives a probabilistic score to each document defined based on a probabilistic surfing model. Two main characteristics of our framework are our probabilistic view on the relevance propagation model and propagation through multiple sets of neighbors. We compare eight different models derived from the probabilistic relevance propagation framework on two standard TREC Web test collections. Our results show that all the eight relevance propagation models can outperform the baseline content only ranking method for a wide range of parameter values, indicating that the relevance propagation framework provides a general, effective and robust way of exploiting link information. Our experiments also show that using multiple neighbor sets outperforms using just one type of neighbors significantly and taking a probabilistic view of propagation provides guidance on setting propagation parameters.

Original languageEnglish (US)
Title of host publicationProceedings of the 15th ACM Conference on Information and Knowledge Management, CIKM 2006
Pages550-558
Number of pages9
DOIs
StatePublished - 2006
Event15th ACM Conference on Information and Knowledge Management, CIKM 2006 - Arlington, VA, United States
Duration: Nov 6 2006Nov 11 2006

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

Other15th ACM Conference on Information and Knowledge Management, CIKM 2006
Country/TerritoryUnited States
CityArlington, VA
Period11/6/0611/11/06

Keywords

  • Content and link ranking
  • Hypertext retrieval model
  • Probabilistic relevance propagation
  • Web information retrieval

ASJC Scopus subject areas

  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'A probabilistic relevance propagation model for hypertext retrieval'. Together they form a unique fingerprint.

Cite this