Abstract
Funding for nutrition research in the United States is less than 5% of the National Institutes of Health budget, so nutrition researchers often turn to published work. This provides an ideal environment for text mining, where entity detection is the task of finding food mentions in text and entity linking connects each food expression to a specific food. For example, the system should harmonize the expressions soybean, soy bean, soya bean and the scientific name Glycine max (L) Merrill along with their plural forms to a single concept soybean. However, the system must not harmonize soybean with soy sprouts because these different forms of soya foods have very different nutritional profiles. Despite the numerous food ontologies available, our work on developing a gold standard for food entities revealed a unique set of challenges that would limit the utility of automated extraction for nutrition researchers.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 1364-1366 |
| Number of pages | 3 |
| Journal | Proceedings of the Association for Information Science and Technology |
| Volume | 62 |
| Issue number | 1 |
| DOIs | |
| State | Published - Oct 2025 |
Keywords
- Entity Detection
- Entity Linking
- Large Language Models
- Natural Language Processing
- Nutrition
ASJC Scopus subject areas
- General Computer Science
- Library and Information Sciences
Fingerprint
Dive into the research topics of 'In the Weeds: Entity Detection for Plant Based Foods'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS