TY - GEN
T1 - Balancing Minimum Free Energy and Codon Adaptation Index for Pareto Optimal RNA Design
AU - Gu, Xinyu
AU - Qi, Yuanyuan
AU - El-Kebir, Mohammed
N1 - National Science Foundation award number CCF 2046488
PY - 2023/8
Y1 - 2023/8
N2 - The problem of designing an RNA sequence v that encodes for a given target protein w plays an important role in messenger RNA (mRNA) vaccine design. Due to codon degeneracy, there exist exponentially many RNA sequences for a single target protein. These candidate RNA sequences may adopt different secondary structure conformations with varying minimum free energy (MFE), affecting their thermodynamic stability and consequently mRNA half-life. In addition, species-specific codon usage bias, as measured by the codon adaptation index (CAI), also plays an essential role in translation efficiency. While previous works have focused on optimizing either MFE or CAI, more recent works have shown the merits of optimizing both objectives. Importantly, there is a trade-off between MFE and CAI, i.e. optimizing one objective is at the expense of the other. Here, we formulate the Pareto Optimal RNA Design problem, seeking the set of Pareto optimal solutions for which no other solution exists that is better in terms of both MFE and CAI. We introduce DERNA (DEsign RNA), which uses the weighted sum method to enumerate the Pareto front by optimizing convex combinations of both objectives. DERNA uses dynamic programming to solve each convex combination in O(|w|3) time and O(|w|2) space. Compared to a previous approach that only optimizes MFE, we show on a benchmark dataset that DERNA obtains solutions with identical MFE but superior CAI. Additionally, we show that DERNA matches the performance in terms of solution quality of LinearDesign, a recent approach that similarly seeks to balance MFE and CAI. Finally, we demonstrate our method's potential for mRNA vaccine design using SARS-CoV-2 spike as the target protein.
AB - The problem of designing an RNA sequence v that encodes for a given target protein w plays an important role in messenger RNA (mRNA) vaccine design. Due to codon degeneracy, there exist exponentially many RNA sequences for a single target protein. These candidate RNA sequences may adopt different secondary structure conformations with varying minimum free energy (MFE), affecting their thermodynamic stability and consequently mRNA half-life. In addition, species-specific codon usage bias, as measured by the codon adaptation index (CAI), also plays an essential role in translation efficiency. While previous works have focused on optimizing either MFE or CAI, more recent works have shown the merits of optimizing both objectives. Importantly, there is a trade-off between MFE and CAI, i.e. optimizing one objective is at the expense of the other. Here, we formulate the Pareto Optimal RNA Design problem, seeking the set of Pareto optimal solutions for which no other solution exists that is better in terms of both MFE and CAI. We introduce DERNA (DEsign RNA), which uses the weighted sum method to enumerate the Pareto front by optimizing convex combinations of both objectives. DERNA uses dynamic programming to solve each convex combination in O(|w|3) time and O(|w|2) space. Compared to a previous approach that only optimizes MFE, we show on a benchmark dataset that DERNA obtains solutions with identical MFE but superior CAI. Additionally, we show that DERNA matches the performance in terms of solution quality of LinearDesign, a recent approach that similarly seeks to balance MFE and CAI. Finally, we demonstrate our method's potential for mRNA vaccine design using SARS-CoV-2 spike as the target protein.
KW - Multi-objective optimization
KW - RNA sequence design
KW - dynamic programming
KW - mRNA vaccine design
KW - reverse translation
UR - http://www.scopus.com/inward/record.url?scp=85172097836&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85172097836&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.WABI.2023.21
DO - 10.4230/LIPIcs.WABI.2023.21
M3 - Conference contribution
AN - SCOPUS:85172097836
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 23rd International Workshop on Algorithms in Bioinformatics, WABI 2023
A2 - Belazzougui, Djamal
A2 - Ouangraoua, A�da
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 23rd International Workshop on Algorithms in Bioinformatics, WABI 2023
Y2 - 4 September 2023 through 6 September 2023
ER -