TEXTEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

  • Kuan Hao Huang
  • , I. Hung Hsu
  • , Tanmay Parekh
  • , Zhiyu Xie
  • , Zixuan Zhang
  • , Premkumar Natarajan
  • , Kai Wei Chang
  • , Nanyun Peng
  • , Heng Ji

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Event extraction has gained considerable interest due to its wide-ranging applications. However, recent studies draw attention to evaluation issues, suggesting that reported scores may not accurately reflect the true performance. In this work, we identify and address evaluation challenges, including inconsistency due to varying data assumptions or preprocessing steps, the insufficiency of current evaluation frameworks that may introduce dataset or data split bias, and the low reproducibility of some previous approaches. To address these challenges, we present TEXTEE, a standardized, fair, and reproducible benchmark for event extraction. TEXTEE comprises standardized data preprocessing scripts and splits for 16 datasets spanning eight diverse domains and includes 14 recent methodologies, conducting a comprehensive benchmark reevaluation. We also evaluate five varied large language models on our TEXTEE benchmark and demonstrate how they struggle to achieve satisfactory performance. Inspired by our reevaluation results and findings, we discuss the role of event extraction in the current NLP era, as well as future challenges and insights derived from TEXTEE. We believe TEXTEE, the first standardized comprehensive benchmarking tool, will significantly facilitate future event extraction research.

Original languageEnglish (US)
Title of host publicationThe 62nd Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationFindings of the Association for Computational Linguistics, ACL 2024
EditorsLun-Wei Ku, Andre Martins, Vivek Srikumar
PublisherAssociation for Computational Linguistics (ACL)
Pages12804-12825
Number of pages22
ISBN (Electronic)9798891760998
DOIs
StatePublished - 2024
EventFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, Thailand
Duration: Aug 11 2024Aug 16 2024

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

ConferenceFindings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
Country/TerritoryThailand
CityHybrid, Bangkok
Period8/11/248/16/24

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'TEXTEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction'. Together they form a unique fingerprint.

Cite this