TY - JOUR
T1 - Fecal Metagenomics to Identify Biomarkers of Food Intake in Healthy Adults
T2 - Findings from Randomized, Controlled, Nutrition Trials
AU - Shinn, Leila M.
AU - Mansharamani, Aditya
AU - Baer, David J.
AU - Novotny, Janet A.
AU - Charron, Craig S.
AU - Khan, Naiman A.
AU - Zhu, Ruoqing
AU - Holscher, Hannah D.
N1 - This work was supported by the USDA (DJB, JAN, CSC), National Cancer Institute (DJB, JAN, CSC), Almond Board of California (DJB, JAN), California Walnut Commission (DJB, JAN), Kellogg (DJB, JAN), Hass Avocado Board (HDH, NAK), Foundation for Food and Agriculture Research New Innovator Award (HDH) , USDA National Institute of Food and Agriculture Hatch Project 1009249 (HDH) , ACES Jonathan Baldwin Turner Fellowship (LMS), the USDA Agriculture and Food Research Initiative Competitive Grant Accession Number 1026383 (LMS), and the National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign through the NCSA Faculty Fellows program (HDH, RZ).
PY - 2024/1
Y1 - 2024/1
N2 - Background: Undigested components of the human diet affect the composition and function of the microorganisms present in the gastrointestinal tract. Techniques like metagenomic analyses allow researchers to study functional capacity, thus revealing the potential of using metagenomic data for developing objective biomarkers of food intake. Objectives: As a continuation of our previous work using 16S and metabolomic datasets, we aimed to utilize a computationally intensive, multivariate, machine-learning approach to identify fecal KEGG (Kyoto encyclopedia of genes and genomes) Orthology (KO) categories as biomarkers that accurately classify food intake. Methods: Data were aggregated from 5 controlled feeding studies that studied the individual impact of almonds, avocados, broccoli, walnuts, barley, and oats on the adult gastrointestinal microbiota. Deoxyribonucleic acid from preintervention and postintervention fecal samples underwent shotgun genomic sequencing. After preprocessing, sequences were aligned and functionally annotated with Double Index AlignMent Of Next-generation sequencing Data v2.0.11.149 and MEtaGenome ANalyzer v6.12.2, respectively. After the count normalization, the log of the fold change ratio for resulting KOs between pre- and postintervention of the treatment group against its corresponding control was utilized to conduct differential abundance analysis. Differentially abundant KOs were used to train machine-learning models examining potential biomarkers in both single-food and multi-food models. Results: We identified differentially abundant KOs in the almond (n = 54), broccoli (n = 2474), and walnut (n = 732) groups (q < 0.20), which demonstrated classification accuracies of 80%, 87%, and 86% for the almond, broccoli, and walnut groups using a random forest model to classify food intake into each food group's respective treatment and control arms, respectively. The mixed-food random forest achieved 81% accuracy. Conclusions: Our findings reveal promise in utilizing fecal metagenomics to objectively complement self-reported measures of food intake. Future research on various foods and dietary patterns will expand these exploratory analyses for eventual use in feeding study compliance and clinical settings.
AB - Background: Undigested components of the human diet affect the composition and function of the microorganisms present in the gastrointestinal tract. Techniques like metagenomic analyses allow researchers to study functional capacity, thus revealing the potential of using metagenomic data for developing objective biomarkers of food intake. Objectives: As a continuation of our previous work using 16S and metabolomic datasets, we aimed to utilize a computationally intensive, multivariate, machine-learning approach to identify fecal KEGG (Kyoto encyclopedia of genes and genomes) Orthology (KO) categories as biomarkers that accurately classify food intake. Methods: Data were aggregated from 5 controlled feeding studies that studied the individual impact of almonds, avocados, broccoli, walnuts, barley, and oats on the adult gastrointestinal microbiota. Deoxyribonucleic acid from preintervention and postintervention fecal samples underwent shotgun genomic sequencing. After preprocessing, sequences were aligned and functionally annotated with Double Index AlignMent Of Next-generation sequencing Data v2.0.11.149 and MEtaGenome ANalyzer v6.12.2, respectively. After the count normalization, the log of the fold change ratio for resulting KOs between pre- and postintervention of the treatment group against its corresponding control was utilized to conduct differential abundance analysis. Differentially abundant KOs were used to train machine-learning models examining potential biomarkers in both single-food and multi-food models. Results: We identified differentially abundant KOs in the almond (n = 54), broccoli (n = 2474), and walnut (n = 732) groups (q < 0.20), which demonstrated classification accuracies of 80%, 87%, and 86% for the almond, broccoli, and walnut groups using a random forest model to classify food intake into each food group's respective treatment and control arms, respectively. The mixed-food random forest achieved 81% accuracy. Conclusions: Our findings reveal promise in utilizing fecal metagenomics to objectively complement self-reported measures of food intake. Future research on various foods and dietary patterns will expand these exploratory analyses for eventual use in feeding study compliance and clinical settings.
KW - KEGG
KW - dietary intake biomarkers
KW - gastrointestinal microbiome
KW - genomic sequencing
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85179617364&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85179617364&partnerID=8YFLogxK
U2 - 10.1016/j.tjnut.2023.11.001
DO - 10.1016/j.tjnut.2023.11.001
M3 - Article
C2 - 37949114
AN - SCOPUS:85179617364
SN - 0022-3166
VL - 154
SP - 271
EP - 283
JO - Journal of Nutrition
JF - Journal of Nutrition
IS - 1
ER -