Coupling growth kinetics modeling with machine learning reveals microbial immigration impacts and identifies key environmental parameters in a biological wastewater treatment process

Ran Mei, Jinha Kim, Fernanda P. Wilson, Benjamin T.W. Bocher, Wen-Tso Liu

Research output: Contribution to journalArticle

Abstract

Background: Ubiquitous in natural and engineered ecosystems, microbial immigration is one of the mechanisms shaping community assemblage. However, quantifying immigration impact remains challenging especially at individual population level. The activities of immigrants in the receiving community are often inadequately considered, leading to potential bias in identifying the relationship between community composition and environmental parameters. Results: This study quantified microbial immigration from an upstream full-scale anaerobic reactor to downstream activated sludge reactors. A mass balance was applied to 16S rRNA gene amplicon sequencing data to calculate the net growth rates of individual populations in the activated sludge reactors. Among the 1178 observed operational taxonomic units (OTUs), 582 had a positive growth rate, including all the populations with abundance > 0.1%. These active populations collectively accounted for 99% of the total sequences in activated sludge. The remaining 596 OTUs with a growth rate ≤ 0 were classified as inactive populations. All the abundant populations in the upstream anaerobic reactor were inactive in the activated sludge process, indicating a negligible immigration impact. We used a supervised learning regressor to predict environmental parameters based on community composition and compared the prediction accuracy based on either the entire community or the active populations. Temperature was the most predictable parameter, and the prediction accuracy was improved when only active populations were used to train the regressor. Conclusions: Calculating growth rate of individual microbial populations in the downstream system provides an effective approach to determine microbial activity and quantify immigration impact. For the studied biological process, a marginal immigration impact was observed, likely due to the significant differences in the growth environments between the upstream and downstream processes. Excluding inactive populations as a result of immigration further enhanced the prediction of key environmental parameters affecting process performance.

Original languageEnglish (US)
Article number65
JournalMicrobiome
Volume7
Issue number1
DOIs
StatePublished - Apr 17 2019

Fingerprint

Emigration and Immigration
Waste Water
Growth
Population
Sewage
Therapeutics
Machine Learning
Biological Phenomena
rRNA Genes
Ecosystem
Learning
Temperature

Keywords

  • Active population
  • Immigration impact
  • Machine learning

ASJC Scopus subject areas

  • Microbiology
  • Microbiology (medical)

Cite this

Coupling growth kinetics modeling with machine learning reveals microbial immigration impacts and identifies key environmental parameters in a biological wastewater treatment process. / Mei, Ran; Kim, Jinha; Wilson, Fernanda P.; Bocher, Benjamin T.W.; Liu, Wen-Tso.

In: Microbiome, Vol. 7, No. 1, 65, 17.04.2019.

Research output: Contribution to journalArticle

@article{4e428aa8f4b24734a491dddd8fc6aa01,
title = "Coupling growth kinetics modeling with machine learning reveals microbial immigration impacts and identifies key environmental parameters in a biological wastewater treatment process",
abstract = "Background: Ubiquitous in natural and engineered ecosystems, microbial immigration is one of the mechanisms shaping community assemblage. However, quantifying immigration impact remains challenging especially at individual population level. The activities of immigrants in the receiving community are often inadequately considered, leading to potential bias in identifying the relationship between community composition and environmental parameters. Results: This study quantified microbial immigration from an upstream full-scale anaerobic reactor to downstream activated sludge reactors. A mass balance was applied to 16S rRNA gene amplicon sequencing data to calculate the net growth rates of individual populations in the activated sludge reactors. Among the 1178 observed operational taxonomic units (OTUs), 582 had a positive growth rate, including all the populations with abundance > 0.1{\%}. These active populations collectively accounted for 99{\%} of the total sequences in activated sludge. The remaining 596 OTUs with a growth rate ≤ 0 were classified as inactive populations. All the abundant populations in the upstream anaerobic reactor were inactive in the activated sludge process, indicating a negligible immigration impact. We used a supervised learning regressor to predict environmental parameters based on community composition and compared the prediction accuracy based on either the entire community or the active populations. Temperature was the most predictable parameter, and the prediction accuracy was improved when only active populations were used to train the regressor. Conclusions: Calculating growth rate of individual microbial populations in the downstream system provides an effective approach to determine microbial activity and quantify immigration impact. For the studied biological process, a marginal immigration impact was observed, likely due to the significant differences in the growth environments between the upstream and downstream processes. Excluding inactive populations as a result of immigration further enhanced the prediction of key environmental parameters affecting process performance.",
keywords = "Active population, Immigration impact, Machine learning",
author = "Ran Mei and Jinha Kim and Wilson, {Fernanda P.} and Bocher, {Benjamin T.W.} and Wen-Tso Liu",
year = "2019",
month = "4",
day = "17",
doi = "10.1186/s40168-019-0682-x",
language = "English (US)",
volume = "7",
journal = "Microbiome",
issn = "2049-2618",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Coupling growth kinetics modeling with machine learning reveals microbial immigration impacts and identifies key environmental parameters in a biological wastewater treatment process

AU - Mei, Ran

AU - Kim, Jinha

AU - Wilson, Fernanda P.

AU - Bocher, Benjamin T.W.

AU - Liu, Wen-Tso

PY - 2019/4/17

Y1 - 2019/4/17

N2 - Background: Ubiquitous in natural and engineered ecosystems, microbial immigration is one of the mechanisms shaping community assemblage. However, quantifying immigration impact remains challenging especially at individual population level. The activities of immigrants in the receiving community are often inadequately considered, leading to potential bias in identifying the relationship between community composition and environmental parameters. Results: This study quantified microbial immigration from an upstream full-scale anaerobic reactor to downstream activated sludge reactors. A mass balance was applied to 16S rRNA gene amplicon sequencing data to calculate the net growth rates of individual populations in the activated sludge reactors. Among the 1178 observed operational taxonomic units (OTUs), 582 had a positive growth rate, including all the populations with abundance > 0.1%. These active populations collectively accounted for 99% of the total sequences in activated sludge. The remaining 596 OTUs with a growth rate ≤ 0 were classified as inactive populations. All the abundant populations in the upstream anaerobic reactor were inactive in the activated sludge process, indicating a negligible immigration impact. We used a supervised learning regressor to predict environmental parameters based on community composition and compared the prediction accuracy based on either the entire community or the active populations. Temperature was the most predictable parameter, and the prediction accuracy was improved when only active populations were used to train the regressor. Conclusions: Calculating growth rate of individual microbial populations in the downstream system provides an effective approach to determine microbial activity and quantify immigration impact. For the studied biological process, a marginal immigration impact was observed, likely due to the significant differences in the growth environments between the upstream and downstream processes. Excluding inactive populations as a result of immigration further enhanced the prediction of key environmental parameters affecting process performance.

AB - Background: Ubiquitous in natural and engineered ecosystems, microbial immigration is one of the mechanisms shaping community assemblage. However, quantifying immigration impact remains challenging especially at individual population level. The activities of immigrants in the receiving community are often inadequately considered, leading to potential bias in identifying the relationship between community composition and environmental parameters. Results: This study quantified microbial immigration from an upstream full-scale anaerobic reactor to downstream activated sludge reactors. A mass balance was applied to 16S rRNA gene amplicon sequencing data to calculate the net growth rates of individual populations in the activated sludge reactors. Among the 1178 observed operational taxonomic units (OTUs), 582 had a positive growth rate, including all the populations with abundance > 0.1%. These active populations collectively accounted for 99% of the total sequences in activated sludge. The remaining 596 OTUs with a growth rate ≤ 0 were classified as inactive populations. All the abundant populations in the upstream anaerobic reactor were inactive in the activated sludge process, indicating a negligible immigration impact. We used a supervised learning regressor to predict environmental parameters based on community composition and compared the prediction accuracy based on either the entire community or the active populations. Temperature was the most predictable parameter, and the prediction accuracy was improved when only active populations were used to train the regressor. Conclusions: Calculating growth rate of individual microbial populations in the downstream system provides an effective approach to determine microbial activity and quantify immigration impact. For the studied biological process, a marginal immigration impact was observed, likely due to the significant differences in the growth environments between the upstream and downstream processes. Excluding inactive populations as a result of immigration further enhanced the prediction of key environmental parameters affecting process performance.

KW - Active population

KW - Immigration impact

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85064461123&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85064461123&partnerID=8YFLogxK

U2 - 10.1186/s40168-019-0682-x

DO - 10.1186/s40168-019-0682-x

M3 - Article

C2 - 30995941

AN - SCOPUS:85064461123

VL - 7

JO - Microbiome

JF - Microbiome

SN - 2049-2618

IS - 1

M1 - 65

ER -