Skip to main navigation Skip to search Skip to main content

Abstract

Funding for nutrition research in the United States is less than 5% of the National Institutes of Health budget, so nutrition researchers often turn to published work. This provides an ideal environment for text mining, where entity detection is the task of finding food mentions in text and entity linking connects each food expression to a specific food. For example, the system should harmonize the expressions soybean, soy bean, soya bean and the scientific name Glycine max (L) Merrill along with their plural forms to a single concept soybean. However, the system must not harmonize soybean with soy sprouts because these different forms of soya foods have very different nutritional profiles. Despite the numerous food ontologies available, our work on developing a gold standard for food entities revealed a unique set of challenges that would limit the utility of automated extraction for nutrition researchers.

Original languageEnglish (US)
Pages (from-to)1364-1366
Number of pages3
JournalProceedings of the Association for Information Science and Technology
Volume62
Issue number1
DOIs
StatePublished - Oct 2025

Keywords

  • Entity Detection
  • Entity Linking
  • Large Language Models
  • Natural Language Processing
  • Nutrition

ASJC Scopus subject areas

  • General Computer Science
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'In the Weeds: Entity Detection for Plant Based Foods'. Together they form a unique fingerprint.

Cite this