Modeling and exploiting heterogeneous bibliographic networks for expertise ranking

Hongbo Deng, Jiawei Han, Michael R. Lyu, Irwin King

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recently expertise retrieval has received increasing interests in both academia and industry. Finding experts with demonstrated expertise for a given query is a nontrivial task especially from a large-scale Web 2.0 systems, such as question answering and bibliography data, where users are actively publishing useful content online, interacting with each other, and forming social networks in various ways, leading to heterogeneous networks in addition to the large amounts of textual content information. Many approaches have been proposed and shown to be useful for expertise ranking. However, most of these methods only consider the textual documents while ignoring heterogeneous network structures or can merely integrate with one additional kind of information. None of them can fully exploit the characteristics of heterogeneous networks. In this paper, we propose a joint regularization framework to enhance expertise retrieval by modeling heterogeneous networks as regularization constraints on top of document-centric model. We argue that multi-typed linking edges reveal valuable information which should be treated differently. Motivated by this intuition, we formulate three hypotheses to capture unique characteristics for different graphs, and mathematically model those hypotheses jointly with the document and other information. To illustrate our methodology, we apply the framework to expert finding applications using a bibliography dataset with 1.1 million papers and 0.7 million authors. The experimental results show that our proposed approach can achieve significantly better results than the baseline and other enhanced models.

Original languageEnglish (US)
Title of host publicationJCDL '12 - Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries
Pages71-80
Number of pages10
DOIs
StatePublished - Jul 11 2012
Event12th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '12 - Washington, DC, United States
Duration: Jun 10 2012Jun 14 2012

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
ISSN (Print)1552-5996

Other

Other12th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '12
Country/TerritoryUnited States
CityWashington, DC
Period6/10/126/14/12

Keywords

  • expertise ranking
  • graph regularization
  • heterogeneous bibliographic network
  • probabilistic model

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Modeling and exploiting heterogeneous bibliographic networks for expertise ranking'. Together they form a unique fingerprint.

Cite this