Coreference by Appearance: Visually Grounded Event Coreference Resolution

Liming Wang, Shengyu Feng, Xudong Lin, Manling Li, Heng Ji, Shih Fu Chang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Event coreference resolution is critical to understand events in growing number of online news with multiple modalities including text, video, speech, etc. However, the events and entities depicting in different modalities may not be perfectly aligned and can be difficult to annotate, which makes the task especially challenging with little supervision available. To address the above issues, we propose a supervised model based on attention mechanism and an unsupervised model based on statistical machine translation, capable of learning the relative importance of modalities for event coreference resolution. Experiments on a video multimedia event dataset show that our multimodal models outperform text-only systems in the event coreference resolution task. A careful analysis reveals that the performance gain of the multimodal model especially under the unsupervised setting comes from better learning of visually salient events.

Original languageEnglish (US)
Title of host publication4th Workshop on Computational Models of Reference, Anaphora and Coreference, CRAC 2021 - Proceedings of the Workshop
EditorsMaciej Ogrodniczuk, Sameer Pradhan, Massimo Poesio, Yulia Grishina, Vincent Ng
PublisherAssociation for Computational Linguistics (ACL)
Pages132-140
Number of pages9
ISBN (Electronic)9781955917025
StatePublished - 2021
Event4th Workshop on Computational Models of Reference, Anaphora and Coreference, CRAC 2021 - Punta Cana, Dominican Republic
Duration: Nov 10 2021Nov 11 2021

Publication series

Name4th Workshop on Computational Models of Reference, Anaphora and Coreference, CRAC 2021 - Proceedings of the Workshop

Conference

Conference4th Workshop on Computational Models of Reference, Anaphora and Coreference, CRAC 2021
Country/TerritoryDominican Republic
CityPunta Cana
Period11/10/2111/11/21

ASJC Scopus subject areas

  • Artificial Intelligence
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Coreference by Appearance: Visually Grounded Event Coreference Resolution'. Together they form a unique fingerprint.

Cite this