TY - JOUR
T1 - Biosystems Design by Machine Learning
AU - Volk, Michael Jeffrey
AU - Lourentzou, Ismini
AU - Mishra, Shekhar
AU - Vo, Lam Tung
AU - Zhai, Chengxiang
AU - Zhao, Huimin
N1 - Funding Information:
This work was mainly funded by the DOE Center for Advanced Bioenergy and Bioproducts Innovation (U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research under Award Number DE-SC0018420). In addition, we thank financial supports from U.S. Department of Energy (DE-SC0018260) (H.Z.) and U.S. National Institutes of Health (1UM1HG009402, 1U54DK107965, and AI144967) (H.Z.).
Publisher Copyright:
© 2020 American Chemical Society.
PY - 2020/7/17
Y1 - 2020/7/17
N2 - Biosystems such as enzymes, pathways, and whole cells have been increasingly explored for biotechnological applications. However, the intricate connectivity and resulting complexity of biosystems poses a major hurdle in designing biosystems with desirable features. As -omics and other high throughput technologies have been rapidly developed, the promise of applying machine learning (ML) techniques in biosystems design has started to become a reality. ML models enable the identification of patterns within complicated biological data across multiple scales of analysis and can augment biosystems design applications by predicting new candidates for optimized performance. ML is being used at every stage of biosystems design to help find nonobvious engineering solutions with fewer design iterations. In this review, we first describe commonly used models and modeling paradigms within ML. We then discuss some applications of these models that have already shown success in biotechnological applications. Moreover, we discuss successful applications at all scales of biosystems design, including nucleic acids, genetic circuits, proteins, pathways, genomes, and bioprocesses. Finally, we discuss some limitations of these methods and potential solutions as well as prospects of the combination of ML and biosystems design.
AB - Biosystems such as enzymes, pathways, and whole cells have been increasingly explored for biotechnological applications. However, the intricate connectivity and resulting complexity of biosystems poses a major hurdle in designing biosystems with desirable features. As -omics and other high throughput technologies have been rapidly developed, the promise of applying machine learning (ML) techniques in biosystems design has started to become a reality. ML models enable the identification of patterns within complicated biological data across multiple scales of analysis and can augment biosystems design applications by predicting new candidates for optimized performance. ML is being used at every stage of biosystems design to help find nonobvious engineering solutions with fewer design iterations. In this review, we first describe commonly used models and modeling paradigms within ML. We then discuss some applications of these models that have already shown success in biotechnological applications. Moreover, we discuss successful applications at all scales of biosystems design, including nucleic acids, genetic circuits, proteins, pathways, genomes, and bioprocesses. Finally, we discuss some limitations of these methods and potential solutions as well as prospects of the combination of ML and biosystems design.
KW - biosystems design
KW - machine learning
KW - metabolic engineering
KW - synthetic biology
UR - http://www.scopus.com/inward/record.url?scp=85088238532&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088238532&partnerID=8YFLogxK
U2 - 10.1021/acssynbio.0c00129
DO - 10.1021/acssynbio.0c00129
M3 - Article
C2 - 32485108
AN - SCOPUS:85088238532
SN - 2161-5063
VL - 9
SP - 1514
EP - 1533
JO - ACS synthetic biology
JF - ACS synthetic biology
IS - 7
ER -