Joint inference for cross-document information extraction

Qi Li, Sam Anzaroot, Wen Pin Lin, Xiang Li, Heng Ji

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Previous information extraction (IE) systems are typically organized as a pipeline architecture of separated stages which make independent local decisions. When the data grows beyond some certain size, the extracted facts become inter-dependent and thus we can take advantage of information redundancy to conduct reasoning across documents and improve the performance of IE. We describe a joint inference approach based on information network structure to conduct cross-fact reasoning with an integer linear programming framework. Without using any additional labeled data this new method obtained 13.7%-24.4% user browsing cost reduction over a state-of-the-art IE system which extracts various types of facts independently.

Original languageEnglish (US)
Title of host publicationCIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management
Number of pages4
StatePublished - Dec 13 2011
Externally publishedYes
Event20th ACM Conference on Information and Knowledge Management, CIKM'11 - Glasgow, United Kingdom
Duration: Oct 24 2011Oct 28 2011

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings


Other20th ACM Conference on Information and Knowledge Management, CIKM'11
Country/TerritoryUnited Kingdom


  • global reasoning
  • information extraction
  • integer linear programming

ASJC Scopus subject areas

  • Decision Sciences(all)
  • Business, Management and Accounting(all)


Dive into the research topics of 'Joint inference for cross-document information extraction'. Together they form a unique fingerprint.

Cite this