UIUC BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP Contributions

Haoyang Liu, Janina Sarol, Halil Kilicoglu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose a cascade of neural models that performs sentence classification, phrase recognition, and triple extraction to automatically structure the scholarly contributions of NLP publications in English. To identify the most important contribution sentences in a paper, we used a BERT-based classifier with positional features (Subtask 1). A BERT-CRF model was used to recognize and characterize relevant phrases in contribution sentences (Subtask 2). We categorized the triples into several types based on whether and how their elements were expressed in text, and addressed each type using separate BERT-based classifiers as well as rules (Subtask 3). Our system was officially ranked second in Phase 1 evaluation and first in both parts of Phase 2 evaluation. After fixing a submission error in Phase 1, our approach yielded the best results overall. In this paper, in addition to a system description, we also provide further analysis of our results, highlighting its strengths and limitations. We make our code publicly available at https://github.com/Liu-Hy/nlp-contrib-graph.

Original languageEnglish (US)
Title of host publicationSemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop
EditorsAlexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
PublisherAssociation for Computational Linguistics (ACL)
Pages377-386
Number of pages10
ISBN (Electronic)9781954085701
StatePublished - 2021
Event15th International Workshop on Semantic Evaluation, SemEval 2021 - Virtual, Bangkok, Thailand
Duration: Aug 5 2021Aug 6 2021

Publication series

NameSemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop

Conference

Conference15th International Workshop on Semantic Evaluation, SemEval 2021
Country/TerritoryThailand
CityVirtual, Bangkok
Period8/5/218/6/21

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'UIUC BioNLP at SemEval-2021 Task 11: A Cascade of Neural Models for Structuring Scholarly NLP Contributions'. Together they form a unique fingerprint.

Cite this