Analysis and enhancement of wikification for microblogs with context expansion

Taylor Cassidy, Heng Ji, Lev Ratinov, Arkaitz Zubiaga, Hongzhao Huang

Research output: Contribution to conferencePaperpeer-review

Abstract

Disambiguation to Wikipedia (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia entries. Most previous work has focused on linking terms in formal texts (e.g. newswire) to Wikipedia. Linking terms in short informal texts (e.g. tweets) is difficult for systems and humans alike as they lack a rich disambiguation context. We first evaluate an existing Twitter dataset as well as the D2W task in general. We then test the effects of two tweet context expansion methods, based on tweet authorship and topic-based clustering, on a state-of-the-art D2W system and evaluate the results.

Original languageEnglish (US)
Pages441-456
Number of pages16
StatePublished - Dec 1 2012
Externally publishedYes
Event24th International Conference on Computational Linguistics, COLING 2012 - Mumbai, India
Duration: Dec 8 2012Dec 15 2012

Other

Other24th International Conference on Computational Linguistics, COLING 2012
CountryIndia
CityMumbai
Period12/8/1212/15/12

Keywords

  • Disambiguation context
  • Disambiguation to wikipedia (D2W)
  • Twitter

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Analysis and enhancement of wikification for microblogs with context expansion'. Together they form a unique fingerprint.

Cite this