APPLICATION OF CELL-FREE SYSTEMS FOR THE DESIGN OF A GLUCOSE-RESPONSIVE BIOSENSOR AND FOR STUDYING THE EFFECTS OF KEY SARS-COV-2 PROTEINS A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Master of Science by Anirudh Murali Narayanan December 2021 © 2021 Anirudh Murali Narayanan ALL RIGHTS RESERVED ABSTRACT Cell-free platforms have been key in the rapid advancement of synthetic biol- ogy. They enable the design of metabolic pathways for the production of any desired proteins. Cell-free systems also present many advantages over tradi- tional in vivo systems such as higher synthesis rates, direct manipulation of the chemical environment, and the ability to even produce proteins toxic to cells. In this study, we made use of the PURE cell-free system to construct a genetic cir- cuit to sense glucose by use of a transcription factor, GntR. Towards this aim, we performed tests of its repression and de-repression characteristics on a reporter with a promoter construct containing the GntR operator site. The Pareto Opti- mal Ensemble Technique was used to determine the parameters of the model for the simulation of this circuit. This work helps to provide a cell-free alternative for rapid point-of-care detection of glucose for diabetics. Additionally, in light of the recent Covid-19 pandemic, we made use of the myTXTL cell-free system to study some key SARS-CoV-2 proteins. We synthesized the viral host transla- tion inhibitor protein, nsp1 in cell-free and tested its effectiveness to inhibit the translation of a reporter in the same cell-free system. Further, we developed a model to simulate the effect of the RNA-dependent RNA Polymerase (RdRp) on viral protein replication in the myTXTL cell-free system. This work paves a way for simulating the effect of RdRp on viral infection in host cells. BIOGRAPHICAL SKETCH Anirudh is from Chennai, India. He graduated with a Bachelor of Technology in Chemical Engineering from Anna University in 2019. In the fall of that year, he joined the Master of Science Program at the Robert Frederick Smith School of Chemical and Biomolecular Engineering at Cornell University and started his research with Dr. Jeffrey D. Varner. It is difficult to find an Anirudh in the wild, but he has occasionally been spotted haunting the halls of Olin or Kimball after midnight. iii This work is dedicated to my awesome brother, Vignesh. iv ACKNOWLEDGEMENTS I would like to thank Dr. Jeffrey D. Varner for giving me the opportunity to work in his research group as well as the guidance and support he has given me over the past two years. My gratitude also goes out to Dr. Matthew DeLisa for his insightful comments and his amazing courses which have been instru- mental in shaping my interests. I am grateful for the friendship and support I received from all VarnerLab members. In particular, I would like to thank Ab- hinav Adhikari for mentoring and training me in pretty much everything I can do in the wet lab. Thank you Alagappan, Sachin, Vashist, and Vivek for those weekly game nights that often ran into the mornings. They were the highlights of each week during the Covid-19 pandemic. I would also like to thank the amazing friends I have had in Ithaca, particularly Avinash, Arvind, Abhaiguru, and Harish for being such amazing housemates. Most importantly, I would like to express my gratitude to my parents for encouraging and supporting me all these years. Finally, I would like to thank my brothers, sisters, grandparents, great-grandparents, and every uncle and aunt of my amazing family for always being there for me. v TABLE OF CONTENTS Biographical Sketch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii Dedication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v Table of Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii 1 Introduction 1 1.1 Cell-free protein synthesis . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Types of Cell-free systems . . . . . . . . . . . . . . . . . . . . . . . 2 1.3 Applications of Cell-free platforms . . . . . . . . . . . . . . . . . . 5 1.3.1 Production of Biologics and Specialized Proteins . . . . . . 7 1.3.2 Metabolic Engineering . . . . . . . . . . . . . . . . . . . . . 8 1.4 Mathematical Modeling of Cell-free systems . . . . . . . . . . . . 8 1.4.1 Metabolic Modeling . . . . . . . . . . . . . . . . . . . . . . 10 1.4.2 Integrated cell-free models . . . . . . . . . . . . . . . . . . 13 2 Development of a gluconate-responsive transcription factor-based ge- netic circuit in a reconstituted cell-free system 15 2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2 Materials and Methods . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.2.1 mP70 promoter construction . . . . . . . . . . . . . . . . . 19 2.2.2 Synthetic Circuits Architecture . . . . . . . . . . . . . . . . 19 2.2.3 Cell-Free Protein Synthesis Reactions . . . . . . . . . . . . 21 2.2.4 mRNA Quantification . . . . . . . . . . . . . . . . . . . . . 22 2.2.5 Protein Quantification . . . . . . . . . . . . . . . . . . . . . 22 2.2.6 Estimation of model parameters . . . . . . . . . . . . . . . 23 2.3 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . 26 2.3.1 Parameter Estimation . . . . . . . . . . . . . . . . . . . . . 27 2.3.2 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . 29 3 I(nsp)ection of the SARS-CoV-2 nsp1’s host-translation effect in a cell- free system 32 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 3.2 Materials and Methods . . . . . . . . . . . . . . . . . . . . . . . . . 35 3.2.1 Cell-Free Protein Synthesis Reactions . . . . . . . . . . . . 35 3.2.2 mRNA Quantification . . . . . . . . . . . . . . . . . . . . . 35 3.2.3 Protein Quantification . . . . . . . . . . . . . . . . . . . . . 36 3.2.4 Coexpression of deGFP with nsp1 . . . . . . . . . . . . . . 37 3.2.5 Qualitative testing of nsp1 binding with E.coli Ribosomes 37 3.3 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . 38 3.3.1 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . 41 vi 4 Modeling the effect of the RNA-dependent RNA polymerase in vitro 43 4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 4.2 Materials and Methods . . . . . . . . . . . . . . . . . . . . . . . . . 46 4.2.1 Model Derivation . . . . . . . . . . . . . . . . . . . . . . . . 46 4.2.2 Control functions, Transcription and Translation Kinetic Limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 4.2.3 Building the model . . . . . . . . . . . . . . . . . . . . . . . 52 4.2.4 Cell-Free Protein Synthesis Reactions . . . . . . . . . . . . 53 4.2.5 mRNA Quantification . . . . . . . . . . . . . . . . . . . . . 54 4.2.6 Protein Quantification . . . . . . . . . . . . . . . . . . . . . 54 4.3 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.3.1 Future Direction . . . . . . . . . . . . . . . . . . . . . . . . 57 5 Conclusion 59 Bibliography 60 vii LIST OF FIGURES 1.1 Cell-free system based on cell lysate. The Cell-free lysate is pre- pared by the lysis of cells followed by removal of cellular debris and chromosome DNA. The energy sources, amino acids, nu- cleotides, and cofactors are added to this to prepare the cell-free extract. The template DNA can be added to it immediately to ex- press the target protein. Alternatively, the cell-free extract can be freeze-dried as pellets for storage or transportation along with lyophilized DNA constructs. These can be rehydrated through the simple addition of water for the synthesis of target proteins. [179] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2 Cell-free systems and their applications. The left-hand side of the image indicates the components needed for a cell-free system and the right-hand side indicates the various fields in which cell- free systems can be applied. [170] . . . . . . . . . . . . . . . . . . 5 1.3 The integration of metabolism with transcription and translation (TXTL) processes: The TXTL processes utilize the macromolecu- lar precursors from metabolism (such as NTPs, amino acids, and cofactors) for gene expression. The integrated framework can be represented as a stoichiometric matrix of metabolites participat- ing in certain reactions, along with a description of the metabolic demands for protein expression. By applying a pseudo-steady- state assumption, making use of various constraints and an ob- jective function, the metabolic fluxes in the system can be esti- mated. In the case of cell-free systems, the objective function is the maximization of target protein yield. [177] . . . . . . . . . . . 13 2.1 In this image, the first box indicates the components involved in an aTF regulated circuit: the RNA polymerases, aTFs, and DNA transcription templates. The second box indicates the circuit in the absence of the ligand. In the presence of the ligand, the aTF will no longer bind to the operator sequence and allows expres- sion of the downstream reporter as shown in the third box. [72] . 16 2.2 (a) C1 Circuit: σ70 induces Venus expression (b) C2 Circuit: σ70 induces the expression of GntR and Venus. The GntR protein once expressed represses the mP70 promoter of Venus. . . . . . 20 2.3 C3 circuit: σ70 induces the expression of GntR and Venus. The GntR protein once expressed represses the mP70 pro- moter of Venus. In the presence of a high concentration of D- Gluconate, the binding of GntR to mP70 gets repressed thereby de-repressing Venus transcription . . . . . . . . . . . . . . . . . . 21 viii 2.4 Venus protein expression without GntR (C1 - Red), Venus repres- sion by GntR (C2 - Blue), and de-repression by D-gluconate (C3 - Green). The shaded regions represent one standard deviation of the experimental measurements . . . . . . . . . . . . . . . . . . 26 2.5 Model simulations versus experimental measurements for σ70 induced Venus expression for the C1 circuit. (a) Simulated and measured Venus mRNA concentration versus time. (b) Simu- lated and measured Venus protein concentration versus time. The black points indicate the experimental values while the black curve is the predicted curve. The shaded region denotes the 95% confidence interval of the predicted curve. The error bars denote one standard deviation of the experimental data . . . 27 2.6 Model simulations versus experimental measurements for the C2 circuit. (a) Simulated and measured Venus mRNA concentra- tion versus time. (b) Simulated and measured Venus protein con- centration versus time. (c) Simulated and measured Gntr mRNA concentration versus time. The black points indicate the experi- mental values while the black curve is the predicted curve. The shaded region denotes the 95% confidence interval of the pre- dicted curve. The error bars denote one standard deviation of the experimental data . . . . . . . . . . . . . . . . . . . . . . . . . 28 2.7 Schematic of a glucose biosensor: σ70 induces the expression of GntR, σ28, insulin and α-σ28 which is downstream to insulin. σ28 induces expression of glucagon while α-σ28 inhibits the P28 pro- moter thereby inhibiting the expression of glucagon. When the gluconate levels are high, insulin will be expressed along with σ28. At low gluconate levels, only glucagon will be expressed . . 31 3.1 Circuit representing coexpression of nsp1 along with a reporter protein. The nsp1 binds to the ribosomes and selectively pre- vents translation of the reporter protein. This circuit doesn’t in- clude the potential mRNA degradation activity of nsp1 [61, 73, 165]. (Created with BioRender.com) . . . . . . . . . . . . . . . . . 34 3.2 Gene expression of nsp1. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 3.3 Coexpression of deGFP with nsp1: (a) RFU measurements of deGFP when coexpressed with nsp1 (Red). The controls: expres- sion of deGFP alone (Blue), expression of deGFP along with nsp8 (Green). The shaded regions indicate one standard deviation (b) SDS-PAGE Analysis of these samples to confirm the production of all the proteins. The gels have been Coomassie-stained . . . . 39 ix 3.4 Rudimentary nsp1-70S Ribosome binding assay: The third chan- nel contains pure nsp1. The first and second channels contain nsp1 mixed with two times its volume of purified 70S ribosomes solution. In other words, the concentration of nsp1 in the third channel is thrice the concentration of nsp1 in the first two. Mea- surement of the fluorescence of the nsp1 bands show that the normalized concentration of nsp1 in all wells are approximately the same indicating that there is negligible binding of nsp1 with these ribosomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 4.1 Replication strategy of a (+)ssRNA virus: The viral genome is translated using cellular machinery. This genome also acts as a template for the production of complementary -mRNA. This complementary strand then acts as a template for the production of a new viral genome (+mRNA). (Image adapted from [89]) . . . 45 4.2 Introduction of RdRp into a genetic circuit: RdRp uses the +mRNA as a template for the production of a complementary -mRNA and uses the -mRNA as a template for the production of +mRNA. Meanwhile, the +mRNA is also translated to synthe- size proteins (Created with BioRender.com) . . . . . . . . . . . . 48 4.3 Temporal variation of N mRNA and protein when expressed alongside RdRp. The blue curves represent the expression when the model doesn’t account for the effect of RdRp, while the or- ange curves account for the RdRp feedback to amplify mRNA and thereby amplify protein expression . . . . . . . . . . . . . . . 55 4.4 Gene expression of nsp7. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 4.5 Gene expression of nsp8. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 x CHAPTER 1 INTRODUCTION 1.1 Cell-free protein synthesis Cell-free systems (CFS) have recently evolved into key platforms for synthetic biology applications. Cell-free systems present many advantages over tradi- tional in vivo systems. Since cell-free systems don’t require cellular growth and maintenance, they can allow for higher synthesis rates by allocating more re- sources towards production of the desired product. The lack of cellular growth also means that they can be used for the production of proteins that are oth- erwise difficult to produce due to their toxicity to cells. It is also possible to directly observe and manipulate the chemical environment of cell-free systems allowing for rapid tuning of the reaction conditions [46, 92, 179]. This chapter is adapted with permission from the study of my colleagues Vilkhovoy et al. at the VarnerLab [179]. Cell-free systems have historically been used as investigative tools for Molecular Biology research. Some of the first examples of their use were in the 1950s by Borsook [15] and Winnick [190] to study the incorporation of amino acids into proteins. The work of Gale et al. to study the effect of nucleic acids in protein synthesis made use of Staphylococcus aureus extract [40]. Even the famous Nirenberg and Matthaei experiments in the early 1960s to discover the genetic code was performed using E. coli cell-free extracts [100, 117]. One of the first precursors to modern cell-free transcription and translation (TXTL) platforms was developed by Lederman and Zubay [84] who developed 1 a coupled transcription-translation bacterial extract that allowed DNA to be used as a template to directly synthesize proteins. In 1988, Spirin et al. de- veloped a continuous cell-free system with a continuous flow of the feeding buffer and a continuous removal of the protein product. They were able to run their system for tens of hours [155]. The Swartz lab improved the energy effi- ciency of E.coli based CFPS by generating ATP with substrate-level phosphory- lation [77]. And this was further improved by oxidative phosphorylation in the Cytomim system that they developed [67–69]. Then the Noireaux lab developed the myTXTL platform that couples ATP regeneration with inorganic phosphate recycling thereby further boosting the protein production [42]. Cell-free platforms can also be used to design synthetic genetic circuits to control gene expression. Bacteriophage-encoded RNA polymerases (RNAP) such as T7, T3, and SP6 RNAP are commonly used in CFPS for their higher tran- scription efficiency [118,143]. The use of bacterial regulatory elements based on the sigma factor family has allowed for the implementation of multi-layer ge- netic cascades in which the protein produced in each stage is the input that is required to promote or inhibit the following stage [2, 42, 118, 143, 144]. 1.2 Types of Cell-free systems Cell-free systems can be broadly classified into two major classes: crude cell lysate-based systems and reconstituted systems. Cell lysate-based systems have been in use for the major part of the history of cell-free systems. These are prepared by the lysis of cells where the cell’s transcription and translation ma- chinery is retained while cellular debris and chromosomal DNA are discarded. 2 This is followed by the addition of necessary components such as amino acids, energy sources, salts, and buffers. They can be prepared from a variety of cells such as cells from bacteriae, protozoa, plants, insects, and mammals. E. coli , S. cerevisiae, HeLa, rabbit reticulocytes, wheat germ, and High Five in- sect cells are commonly used cells for the preparation of such cell-free extracts [9, 35, 42, 127, 132, 149, 193]. Figure 1.1: Cell-free system based on cell lysate. The Cell-free lysate is prepared by the lysis of cells followed by removal of cellular debris and chromosome DNA. The energy sources, amino acids, nucleotides, and cofactors are added to this to prepare the cell-free extract. The template DNA can be added to it immediately to express the target protein. Alternatively, the cell-free extract can be freeze-dried as pellets for storage or transportation along with lyophilized DNA constructs. These can be rehydrated through the simple addition of water for the synthesis of target proteins. [179] On the other hand, reconstituted systems are well defined, and are prepared using a ”bottom-up” approach where the factors essential for protein synthe- sis such as the purified enzymes, tRNAs, ribosomes (either prokaryotic or eu- karyotic), amino acids, NTPs, and other energy molecules are reconstituted into a system. The PURE (Protein synthesis Using Recombinant Elements) system developed by Shimizu et al. was one of the first reconstituted cell-free sys- tems [142]. 3 Both classes of cell-free systems tend to have their own set of benefits and drawbacks. Crude cell extracts tend to be less expensive and have higher yields than reconstituted systems [46]. They also offer more complex metabolic capa- bilities that can be exploited for energy regeneration, that allow for larger-scale reactions and extend the duration of protein synthesis [56, 160]. Reconstituted systems also have their own advantages. For one, since the reconstituted systems are prepared from individual components, their exact composition is known and they can be used for studying biological processes including protein expression and folding in the context of a completely defined system. For instance, Li et al. showed that the efficiency of the PURE system could be improved up to 5 fold by adding or adjusting a variety of factors that affect transcription and translation, such as elongation factors, ribosome recy- cling factors, release factors, chaperones, BSA and tRNAs. showed that the ef- ficiency of protein synthesis was limited by translation elongation capacity, ri- bosome release, and ribosome recycling [90]. Reconstituted systems such as the PURE system can also be reconstituted in such a way that they do not contain proteases and nucleases, to further improve protein production [142, 160]. 4 1.3 Applications of Cell-free platforms Figure 1.2: Cell-free systems and their applications. The left-hand side of the image indicates the components needed for a cell-free system and the right- hand side indicates the various fields in which cell-free systems can be applied. [170] Cell-free systems have found many roles in synthetic biology ranging from portable diagnostics to fundamental discovery and prototyping. Cell-free sys- tems have been very useful for the development of biosensors because of their unique advantages over cell-based sensors. They can be used to detect cell wall- impermeable or cytotoxic analytes, and they are immune to issues like muta- tions and plasmid loss [149]. Cell-free biosensors can be lyophilized (freeze- dried) which allows them to maintain their activity for months for use in 5 portable diagnostics. They can then be transported to the desired location, rehy- drated with water for field use [121,160]. Low-cost cell-free paper-based biosen- sors have been utilized for the detection of viruses such as norovirus, Ebola virus, and Zika virus [97, 121, 122]. Hamada et al. made use of cell-free plat- forms for the bottom-up construction of dynamic biomaterials with emergent locomotion behavior powered by artificial metabolism. These could have appli- cations in pathogen detection, and as scaffolds for hybrid nanomaterials [51]. Cell-free systems have a huge scope for both small-scale prototyping of bio- logical processes and larger-scale bioengineering efforts. Because of their abil- ity to allow for direct manipulation of the chemical environment, prototyping is more efficient in cell-free systems. They can also be used to prototype in- dividual genetic parts such as promoters or complex genetic designs in vitro before implementing them in vivo. Moreover, the genetic constructs need not be assembled into plasmids as cell-free systems even allow for the use of lin- ear DNA [149]. Cell-free systems can also be used to analyze how individual genetic parts function together in synthetic genetic circuits by considering each individual part as a logic gate. A number of cell-free genetic circuits have been assembled and prototyped using this strategy including sigma factor-based cas- cades, RNA transcriptional cascades, RNA single input modules, and feed- forward/feedback loops [2, 42, 118, 162, 163]. Developments in experimental setup and analysis techniques such as the use of real-time fluorescent probes for mRNA measurement, automated liquid- handling robotic systems for rapid screening of cell-free reactions, microflu- idic devices, and cell-sized droplet-based expression have all improved the effi- ciency of prototyping in cell-free systems [25, 106, 107, 115, 133, 149]. 6 1.3.1 Production of Biologics and Specialized Proteins CFPS can be utilized for the production of biologics and other specialized pro- teins such as membrane proteins [44,93,113,145,160,185]. The ease of rapid par- allel assay and screening with cell-free systems makes them important for the production of libraries for protein engineering and post-genomic research [164]. Recently, cell-free systems have also been used to achieve proteins with complex post-translational modifications such as N-linked glycoproteins [47]. A single- pot glycoprotein synthesis system has also been developed by Jaroentomeechai et al. that allows for on-demand biomanufacturing of glycoproteins [65]. Even proteins incorporating noncanonical amino acids have been synthesized using CFPS systems [6, 99]. By constructing cell-free systems in microfluidic devices, we can achieve point-of-care (POC) production of proteins. Continuous flow microfluidic re- actors offer more precise control over mixing when compared to batch reactors [53,102]. Georgi et al. have developed an automated, microfluidic CFPS reactor called the TRITT (Transcription - RNA Immobilization and Transfer - Transla- tion) platform that runs both transcription and translation reactions in separate compartments [43]. The quasi-continuous transfer of mRNA to the translation chamber allows for longer CFPS reactions and better protein yield. Microfluidic platforms can also be constructed with integrated purification methods such as dialysis and affinity chromatography [107, 114]. 7 1.3.2 Metabolic Engineering Cell-free systems possess many advantages over in vivo systems for metabolic engineering applications. The cells in vivo systems have their own objec- tives such as growth or maintenance, which uses up more resources and drives metabolic flux away from the desired pathways making it challenging to achieve high flux in those pathways. On the other hand, in cell-free systems, the allocation of resources to the production of desired products can be maxi- mized [48]. The complexity of living cells also makes computational modeling and optimization of metabolic flux difficult [32]. Cell-free systems, on the other hand, can be accurately modeled and the reaction environment can be directly manipulated and tuned according to the bio-synthetic needs. Metabolic engineering in cell-free systems has been used to increase flux through enzymatic pathways and improve product yield for the production of high-value small molecule products such as cannabinoids, polyhydroxybu- tyrate bioplastic, isobutanol, ethanol, limonene, etc. [8,33,49,82,120,174]. It has also been used to handle bottlenecks in CFPS such as the need for energy and cofactor regeneration in the system [14, 18, 68, 77]. 1.4 Mathematical Modeling of Cell-free systems There have been several mathematical models of CFPS. Most of these models focus on the TXTL processes and are mostly systems of ordinary differential equations (ODEs) based on saturation or Hill-like equations. Karzbrun et al. derived a coarse-grained model comprising the entire en- 8 dogenous transcription, translation, as well as mRNA and protein degradation machinery for an E.coli cell-free extract [74]. To simplify calculations, this model was based on four enzymes and ten relevant rate constants. Transcription and translation processes were assumed to follow Michaelis–Menten kinetics. They assumed that the protein expression was governed by zeroth-order degradation and noted that the expression followed a sharp transition from undetectable lev- els to constant-rate accumulation, without reaching steady state. However, their study only focused on the first hour of the reaction as the protein synthesis rate of their system began to exponentially decay after that. This sudden decay was assumed to be due to resource depletion and waste accumulation. Stögbauer et al. developed a model that accounted for resource consumption and degradation [157]. The model predicts that the protein yield depends on the experiment timing in addition to template DNA concentration. They attempted to use Hill functions to better predict the observed saturation effects of mRNA and protein, but the optimized Hill coefficients were close to one, effectively reducing the Hill equation to Michaelis–Menten equation. They identified ribo- somal degradation as the cause for the cessation of cell-free protein synthesis in the specific reconstituted system that they used. More recently, Neiß et al. developed a comprehensive experimentally val- idated model that was able to identify the major limiting factors of cell-free protein synthesis - the supply of ternary complexes consisting of EFTu and tRNA [116]. This model used an unusual hybrid black-box approach: Although the translation processes were well detailed, the transcription processes were simplified. The entire model was a large system of differential algebraic equa- tions - a system of eight algebraic equations and over 400 ODEs. Gyorgy and 9 Murry’s model explicitly included competition for shared resources in genetic networks [50]. This model also predicted possible product concentrations in multiple-protein expression systems. The authors illustrated that resource com- petition was a key consideration in genetic circuit design. Genetic circuits based on regulation by RNA have also been explored and mathematical models for such systems have been developed for cell-free plat- forms. Transcriptional regulatory RNAs are of interest because they bypass the need for regulatory proteins and they have been used to create various logic gates and cascades [17, 21, 60, 94]. Hu et al. developed an experimentally val- idated model of a synthetic RNA circuit that contained 8 ODEs and 13 previ- ously unknown parameters which were estimated from sensitivity analysis [60]. Although these models of transcription, translation, resource competition, and gene regulatory circuits have provided useful information for optimizing cell- free biomanufacturing, more sophisticated models integrating metabolic path- ways are necessary. 1.4.1 Metabolic Modeling Traditional approaches to metabolic modeling, which were first developed to describe living cells, could also be applied to cell-free systems. Mechanistic in vivo metabolic models arose from the desire to predict microbial phenotypes resulting from changes in intracellular or extracellular states [38]. The Shuler lab constructed large-scale, dynamic metabolic models for single-cells that in- corporated multiple regulated catabolic and anabolic pathways constrained by experimentally determined kinetic parameters [31, 156, 192], minimal cell archi- 10 tectures [19], and DNA sequence-based whole-cell models of E. coli [10]. Next- generation metabolic models described cellular processes such as RNA synthe- sis, chromosome synthesis, and regulated catabolic and macromolecular syn- thesis pathways in detail using ODEs [171]. Karr et al. even developed a whole- cell computational model of the life cycle of the human pathogen Mycoplasma genitalium that includes all of its molecular components and their interactions inside the cell. However, traditional metabolic modeling approaches are often complex and nonlinear and require the estimation of a large number of unknown param- eters. This is a difficult process because of the inherent noisiness of biolog- ical data and the computational complexity involved in the repeated solving of the model equations. To overcome such obstacles, constraint-based meth- ods were developed to describe metabolic networks with only a limited need for kinetic parameters [175]. Constraint-based approaches such as Flux Bal- ance Analysis (FBA) have been used for the stoichiometric reconstructions of metabolism [88]. FBA, Metabolic Flux Analysis (MFA), and convex network decomposition approaches such as elementary modes and extreme pathways have been developed to model intracellular metabolism using the biochemi- cal stoichiometry and other constraints such as thermodynamic feasibility un- der pseudo-steady-state conditions [52, 55, 137, 139, 189]. Such constraint-based approaches use linear programming to predict productivity, mutant behavior, and growth phenotypes for biochemical networks of even genome-scale net- works [34, 119, 135, 175]. The objective function used in such constraint-based approaches in a cell-free system is the maximization of the desired protein yield. These stoichiometric reconstructions have been expanded to even include 11 the metabolic demands for protein synthesis. One of the earliest such mod- els were the sequence-specific constraint-based models developed by Allen and Palsson, which depended on the DNA and protein sequences of interest and the TXTL processes were integrated with metabolism [7]. Since then, sequence- specific constraint-based models have been expanded to the genome-scale with detailed descriptions of gene expression (ME-model) and protein structures (GEM-PRO) [7, 20, 88, 119, 201]. These developments have greatly expanded the scope of stoichiometric reconstructions. Constraint-based methods can be potentially used to predict non-intuitive strategies to optimize the interaction between metabolism and gene expression in cell-free platforms. Thus, the use of integrated constraint-based models for cell-free optimization studies is a promising future research direction. 12 1.4.2 Integrated cell-free models Figure 1.3: The integration of metabolism with transcription and translation (TXTL) processes: The TXTL processes utilize the macromolecular precursors from metabolism (such as NTPs, amino acids, and cofactors) for gene expres- sion. The integrated framework can be represented as a stoichiometric matrix of metabolites participating in certain reactions, along with a description of the metabolic demands for protein expression. By applying a pseudo-steady-state assumption, making use of various constraints and an objective function, the metabolic fluxes in the system can be estimated. In the case of cell-free systems, the objective function is the maximization of target protein yield. [177] Although CFPS also depends on central carbon metabolism and other metabolic pathways and doesn’t just rely on TXTL processes, very few mathematical mod- els have managed to integrate cell-free TXTL processes with metabolic path- ways [24, 59, 181]. Wayman et al. developed a hybrid cell-free modeling ap- proach of Wayman that integrated kinetic modeling with a rule-based descrip- tion of allosteric control [186]. Horvath et al. developed an ensemble of dynamic E. coli CFPS models on top of this, using parameters estimated from measure- ments of metabolite, amino acid, and protein concentrations from CFPS reac- tions conducted using the PANOx-SP cell-free system [59]. By simulating re- action group knockouts, they suggested that cell-free metabolism and protein 13 synthesis were strongly coupled with oxidative phosphorylation and glycolytic flux. Vilkhovoy et al. developed an experimentally validated constraint-based model of CFPS [181] to circumvent computationally expensive parameter esti- mation. This model integrated the expression of a model protein product with the supply of metabolic precursors and energy (Figure 1.3). This model coupled TXTL processes with the available resources using only six adjustable parame- ters. Model analysis suggested that protein expression in the PANOx-SP system was translationally limited. Further, the same modeling approach was also able to describe protein expression in the myTXTL system using only a limited num- ber of experimentally derived parameters. Taken together, the incorporation of complex metabolism with genetic regulatory networks using constraint-based modeling is a promising approach to simulate cell-free systems. 14 CHAPTER 2 DEVELOPMENT OF A GLUCONATE-RESPONSIVE TRANSCRIPTION FACTOR-BASED GENETIC CIRCUIT IN A RECONSTITUTED CELL-FREE SYSTEM 2.1 Introduction Diabetes mellitus is a group of metabolic diseases that can be characterized by chronic hyperglycemia occurring due to imperfect insulin synthesis, insulin ac- tion, or both [76]. Diabetes is increasing at an alarming rate globally. According to International Diabetes Federation, about 463 million adults were living with diabetes in 2019 and this number is expected to grow to 700 million by 2045 [1]. There are two main classes of the disease: (i) Type 1 diabetes occurs due to the destruction of β cells of the pancreas causing the body to produce little to no insulin [26, 30]. (ii) Type 2 diabetes is the more prevalent one where the body doesn’t utilize the insulin well and may also not produce insulin [76]. Insulin is a hormone necessary to prevent blood sugar levels from rising too high (hyperglycemia) by allowing adipose tissues to take up glucose. Glucagon is a counter-regulatory hormone for insulin that stops blood sugar levels from dropping too low (hypoglycemia) by breaking down glycogen to produce glu- cose [70]. Cell-free systems can be engineered to create an input-responsive, on-demand, protein synthesis system. In this work, we propose a genetic glu- cose sensor circuit to direct the downstream synthesis of insulin or glucagon. This circuit can be used for POC treatment of diabetes. A number of effective metabolic engineering strategies have made use of 15 the advances in promoter engineering to dynamically regulate the expression of pathway enzymes to optimize product yields [28, 58, 80, 91]. Allosteric tran- scription factors (aTFs) are a core component of this. An aTF is a protein that binds to the operator sequence of a DNA and prevents transcription. Upon binding to a specific ligand, the aTF undergoes a conformational change that al- ters its affinity for an operator DNA sequence [167]. Ligand binding leads to the release of the aTF from the operator, allowing the RNA polymerase holoenzyme to effectively bind to the promoter and increase the transcription rate. aTFs have been co-opted for use as gene expression switches [96]. Figure 2.1: In this image, the first box indicates the components involved in an aTF regulated circuit: the RNA polymerases, aTFs, and DNA transcription templates. The second box indicates the circuit in the absence of the ligand. In the presence of the ligand, the aTF will no longer bind to the operator sequence and allows expression of the downstream reporter as shown in the third box. [72] More recently, aTFs have been increasingly leveraged for use in POC biosen- sors. CFPS is a favorable platform for implementing biosensors because of its portability, ability to manipulate the reaction environment in real-time, and tol- erance to cytotoxic environments. aTFs can be used to ’sense’ a specific molecule (the ligand that suppresses it) by altering the gene expression of a downstream reporter protein. By monitoring the signal from the reporter, it would be pos- sible to determine the amount of ligand present in the system. Such biosensors have recently been used for detecting ions, biomarkers, and antibiotics (See Fig- ure 2.1) [72, 153, 182]. 16 The ability to sense glucose concentration in the bloodstream and direct the synthesis of either insulin or glucagon (or a combination of both) via the use of genetic circuits is very attractive, and given the advanced state of cell-free sys- tems, it is an achievable feat. Toward this aim, it is important to first construct a transcriptional unit that can effectively respond to varying levels of glucose or its derivatives that can act as its proxy. We aim to design a biosensing system that has the ability to sense glucose concentration in the bloodstream and direct the synthesis of either insulin or glucagon hormone (or a combination of both). In order to achieve this, we need to construct a transcriptional unit that can effectively respond to variation in the concentration of glucose or its derivatives (such as D-gluconate) that can act as its proxy. We plan to study the effectiveness of aTFs towards this purpose. An aTF has a DNA binding region at the N-terminal and a ligand-binding region at the C-terminal. Studies on the Gluconate operon transcriptional repressor (GntR) family of E. coli and B. subtilis have shown that two derivatives of glucose, D- glucono-1,5-lactone and D-gluconate, can bind to GntR protein at the C-terminal and as an anti-repressor [39, 124, 130, 148, 173]. Since our biosensor should be able to sense the presence of glucose in the system we decided to make use of the GntR family in the biosensor. In this work, we aimed to first construct a promoter (incorporating the op- erator sequence of GntR) that can be efficiently used in an E. coli-based CFPS. Toward this aim, we first cloned the GntR operator sequence to a bacteriophage lambda promoter (P70) developed by Noireaux and coworkers [143]. Next, we added a reporter protein gene downstream of this modified promoter to easily 17 characterize the promoter dynamics. Earlier work done by my colleague, Abhinav Adhikari in the VarnerLab made use of a deGFP reporter constructed with an mP70 promoter. This work studied the repression and de-repression performance of GntR in the myTXTL system. However, the D-gluconate was consumed by the metabolism in the myTXTL system resulting in poor de-repression. So in this work, we plan to make use of the PURE cell-free system which lacks the metabolic capability to unintentionally consume D-gluconate. The PURExpress system also makes it easier to purify the protein products. Since all the components of the system (other than the ribosomes) are his-tagged, protein products in the native form (with no His tag) can be purified easily and rapidly through ultrafiltration (to re- move the ribosomes) followed by the use of Ni-NTA affinity column (to remove the His-tagged components). In addition to this, we also changed the reporter protein. We use the Yellow Fluorescent Protein, Venus, instead of deGFP because Venus is a much brighter reporter protein in this system, the time it took to reach maximum growth was similar to different mutated GFP variants [87]. The CFPS reactions to express Venus were carried out either in the absence or presence of the GntR repressor gene. We characterized the repression and de- repression characteristics (using D-gluconate) in the PURExpress reconstituted system based on PURE. Taken together, our results show that the GntR protein effectively represses the expression of the Venus reporter protein, and that the addition of D-gluconate is able to inhibit this repression. 18 2.2 Materials and Methods 2.2.1 mP70 promoter construction The first step in constructing the promoter responsive to D-gluconate levels was to include the operator sequence for the E.coli GntR in the promoter region. The sequences of the operator and the repressor were obtained from the litera- ture [124]. The 16 bp E. coli GntR operator sequence (ATGTTACCCGTATCAT) was added between the -35 and -10 regions of mutated bacteriophage lambda promoter (P70) developed by Noireaux and coworkers [143]. This modified P70 promoter with added GntR operator site was named mP70. The -10 binding site of the RpoD holoenzyme was altered from GATAAT in the original promoter to TATCAT in the mP70. The gene sequence of the reporter protein, Venus, was added downstream of this modified promoter. On the other hand, the GntR repressor gene was cloned downstream of the original P70 promoter. The ter- minator and 5’ UTR for the DNA with the mP70 promoter were identical to that of the DNA with the P70 promoter. The full constructs were ordered as linear DNA fragments, with 150-200 bp flanker sequences on both ends, from Twist Bioscience and Integrated DNA Technologies. 2.2.2 Synthetic Circuits Architecture The three genetic circuits (C1, C2, and C3) used in this study were based upon the bacterial sigma factor regulatory system. Sigma factor 70 (σ70) was the pri- mary driver of each circuit. The E.coli RNA Polymerase Holoenzyme is the core enzyme saturated withσ70 and allows for RNA synthesis fromσ70 specific pro- 19 moters. In C1, σ70 induced Venus (YFP) expression was explored in the absence of additional regulators (Figure 2.2(a)). In C2, σ70 induced the expression of GntR and Venus (Figure 2.2(b)). The GntR protein repressed the mP70 promoter of Venus, thereby down-regulating Venus transcription. In C3, D-gluconate was introduced into the circuit (Figure 2.3). D-gluconate suppresses the binding of GntR to the mP70 promoter. This de-represses the Venus transcription. Study- ing C1 allows us to estimate parameters governing the interaction of σ70 with the mP70 promoter. Whereas, the C2 allows us to characterize the strength of the transcriptional repression by GntR. Finally, C3 allows us to characterize the de-repression effect D-gluconate has on GntR. (a) (b) Figure 2.2: (a) C1 Circuit: σ70 induces Venus expression (b) C2 Circuit: σ70 induces the expression of GntR and Venus. The GntR protein once expressed represses the mP70 promoter of Venus. 20 Figure 2.3: C3 circuit: σ70 induces the expression of GntR and Venus. The GntR protein once expressed represses the mP70 promoter of Venus. In the presence of a high concentration of D-Gluconate, the binding of GntR to mP70 gets repressed thereby de-repressing Venus transcription 2.2.3 Cell-Free Protein Synthesis Reactions The cell-free protein synthesis (CFPS) reactions were carried out using the PUR- Express In Vitro Protein Synthesis Kit (New England Biolabs Inc) in 120 µL 384- well plates (Thermofisher NUNC, flat-bottom) in Varioskan Lux plate reader at 37°C. The working volume of all the reactions was 25 µL, composed of the PURE solutions A and B (17.5 µL), RNase inhibitor, Murine (0.5 µL), NEB E.coli RNA Polymerase holoenzyme (2 µL), the linear DNA: Venus (7 nM), GntR (10 nM). The E. coli RNA Polymerase, Holoenzyme is the core enzyme saturated with sigma factor 70 and initiates RNA synthesis from sigma 70 specific pro- moters. The full constructs were ordered as linear DNA fragments, with 150- 200 bp flanker sequences on both ends, from Twist Bioscience and Integrated DNA Technologies. For the control reactions without GntR, an equal volume of nuclease-free water was used in its stead. For the de-repression reactions, 21 D-gluconate (10 mM) was added while maintaining the same total volume. 2.2.4 mRNA Quantification Following each CFPS run, the total RNA was extracted from 5 µL reaction mix- ture using PureLink RNA Mini Kit (Thermo Fisher Scientific) and stored at −80°C. The quantitative RT-PCR reactions were done using Applied Biosystems TaqMan RNA-to-CT 1-Step Kit and Custom TaqMan Gene Expression Assays (Thermo Fisher Scientific). An mRNA standard curve was used to determine absolute mRNA concentrations for each of the samples. The mRNA standards were prepared as follows: separate CFPS reactions for 5 nM of DNA (Venus and GntR) were carried out for 2 hr. Total RNA was extracted using the full reac- tion volume. This was followed by the removal of 16S and 23S rRNA using the MICROBExpress Bacterial mRNA Enrichment Kit (Life Technologies Corpora- tion). Lastly, the MEGAclear Kit (Life Technologies Corporation) was used to further purify the mRNA. The mRNA concentrations were determined using the Qubit™ RNA assay kit (ThermoFisher Scientific). At least three technical replicates were performed for each standard. 2.2.5 Protein Quantification Venus (Yellow Fluorescent Protein) fluorescence was measured using the Var- ioskan Lux plate reader at 513 nm (excitation) and 531 nm (emission). The flu- orescence was measured with 25 µL for each of the mixtures. For all measure- ments, at least three biological replicates were performed. A protein standard 22 curve was used to determine the absolute protein concentrations for each of the samples. The standard curve was prepared by using Venus standards of known Relative Fluorescence Units (RFU). An SDS-PAGE of these standards along with a Protein ladder of samples with known concentrations was performed at 300V until the dye fronts reached the bottom of the gel. The SDS-PAGE was per- formed in a Mini-PROTEAN Tetra Cell (Bio-Rad Laboratories) using Any kD™ Mini-PROTEAN® TGX Stain-Free™ Protein Gels, 15 well, 15 µl gels (Bio-Rad Laboratories). This was followed by staining the gel with Bio-Safe™ Coomassie Stain (Bio-Rad Laboratories). The stained gel was visualized using the Chemi- Doc MP and then the concentrations were measured using Image Lab software. 2.2.6 Estimation of model parameters To model the circuit, the mRNA and protein balances for mRNA mi and protein pi are described using the following balance equations ṁi = rX,iui − θm,imi (2.1) ṗi = rL,iwi − θp,i pi (2.2) where rX and rL are the transcription and translation rates and θm and θp are the respective degradation rates. Here, wi and ui are the control functions modeled as: ( ) − ln(2)ẇi =∑ wi (2.3)τL,1/2∑ j∈χ W j f j(...)ui = (2.4) i∈C Wi fi(...)i Wi denotes the weight of configuration i and fi is the binding function modeled as a hill function. The rate of transcription rX, ju j was modeled as the product of a 23 kinetic limit, rX, j, and a control term u j ∈ [0, 1]. The kinetic limit of transcription was derived from elementary reactions leading to the formation of m j, as previ- ously done by McClure [101]. The kinetic limit of transcription was formulated as:  G jr = Vmax  ( ) X, j X j   , (2.5) τX, jKX, j + τX, j + 1 G j where the initiation time constant τX, j, and saturation constant KX, j were exper- imentally measured by McClure [101]. The maximum transcription rate, VmaxX, j , was given by: [ ( )] Vmax v̇X X, j = RX,T (2.6)lG, j where RX,T is the RNA polymerase concentration (nM), v̇X is the RNA poly- merase elongation rate (nt/h) and lG, j is the length of gene j in nucleotides (nt). By analogy, the kinetic limit of translation was formulated as: max  r V  x( mRNAL, j = L j ) , (2.7) τL, jKL, j + τL, j + 1 xmRNA where x is the mRNA concentration and VmaxmRNA L, j is the maximum translation rate which was formulated as: [ ( )] Vmax v̇L L, j = KPRL,T (2.8)lP, j The model previously used by Adhikari [2] was extremely effective in capturing the expression dynamics of mRNA and protein in synthetic cell-free circuits, but it suffered from one major drawback. The parameters related to the Gibbs free energy of binding between RNAP and other genes in the systems were over- estimated. This was primarily due to the lack of literature that provides the constraints for ∆G. Since the Gibbs free energy of binding drives the weight of a particular configuration, any overestimation will lead to a high background expression. This will affect the transcription rate which will thereby affect the 24 translation rate and the protein expression. Therefore, to address this issue, we revised the model by changing the way the control function for transcrip- tion is formulated. A similar modeling approach as previously mentioned was adopted but the control functions w∑ere formulated as -∑ j∈χ W j f j(...)ui = (2.9) i∈C W f (...)i i i where W was formulated as- ( ) −∆E W ii = exp (2.10)kbT This had a two-fold advantage. Firstly, there were literature estimates available for difference in binding energy (∆E) from Bintu and coworkers [12], in refer- ence units, as multiples of Boltzmann constant (kb) and temperature (T). More- over, this formulation was much closer to the statistical mechanics approach we have undertaken to represent the configurations. 25 2.3 Results and Discussion Figure 2.4: Venus protein expression without GntR (C1 - Red), Venus repression by GntR (C2 - Blue), and de-repression by D-gluconate (C3 - Green). The shaded regions represent one standard deviation of the experimental measurements We first tested the performance of linear DNA with the mP70-Venus construct (7 nM) in the PURExpress system (C1 circuit) in order to obtain a standard for comparison for the repression and de-repression characteristics. In order to characterize the repression by Gntr, we made use of the C2 circuit. We ex- pressed two linear DNA fragments: E. coli GntR (10 nM) and mP70-Venus (7 nM). For the control case, the GntR DNA was excluded from the reaction mix- ture. While the Venus concentration in the control case steadily increased, it reached a steady plateau in approximately 4 hours in the +GntR reaction (well before the -GntR control plateaus) and had a significant reduction in Venus ex- pression. This showed that GntR was indeed repressing Venus expression. 26 In order to test the de-repression characteristic of GntR, we added 10mM of D-gluconate (approximately 9 times the value of its KD established in litera- ture [23]) along with GntR and Venus (C3 circuit). Although the Venus concen- tration was not as high as in the case of the C1 circuit, it was appreciably higher compared to when D-gluconate was not added (C2 circuit). 2.3.1 Parameter Estimation (a) (b) Figure 2.5: Model simulations versus experimental measurements for σ70 in- duced Venus expression for the C1 circuit. (a) Simulated and measured Venus mRNA concentration versus time. (b) Simulated and measured Venus protein concentration versus time. The black points indicate the experimental values while the black curve is the predicted curve. The shaded region denotes the 95% confidence interval of the predicted curve. The error bars denote one stan- dard deviation of the experimental data Parameter estimation for the C1 circuit was set up as a multi-objective opti- mization problem with two objectives. One objective was the minimization of the error between experimental and simulated values of Venus mRNA, and the other for the corresponding Venus protein measurements. The algorithm was run for 20 generations and produced an ensemble of 187 parameter sets, of the 27 11 unknown model parameters. The criterion for a parameter ensemble selec- tion was a Pareto rank of 2 or below. The parameters generated for the C1 circuit were able to create a good fit with the experimental data (Figure 2.5). (a) (b) (c) Figure 2.6: Model simulations versus experimental measurements for the C2 circuit. (a) Simulated and measured Venus mRNA concentration versus time. (b) Simulated and measured Venus protein concentration versus time. (c) Sim- ulated and measured Gntr mRNA concentration versus time. The black points indicate the experimental values while the black curve is the predicted curve. The shaded region denotes the 95% confidence interval of the predicted curve. The error bars denote one standard deviation of the experimental data In the case of the C2 circuit, the multi-objective optimization problem had three objectives: Minimization of error between experimental and simulated values of (i)Venus protein (ii) Venus mRNA, and (iii) GntR mRNA. The algo- rithm was run for 10 generations to estimate 22 unknown model parameters and estimated an ensemble of 565 parameter sets that had a Pareto rank of 2 or below. Although less effectively than for the C1 circuit, the model was able to capture the mRNA concentration profile for Venus fairly well. The model accurately captured the mRNA concentration profile of GntR and protein ex- pression of Venus. Taken together, the model is able to simulate the repression of mP70-Venus in the presence of GntR in a PURE system. 28 2.3.2 Future Directions The D-gluconate levels at the different time points can be measured by Liq- uid Chromatography/Mass Spectrometry (LCMS) using our lab’s published derivatization protocol [180]. D-gluconate acts as a dynamic second messen- ger for D-glucose concentration in myTXTL cell-free extracts. We used the gluconate-responsive transcription factor GntR as the key transcription element for the glucose sensor circuit. Thus, information about the background D- gluconate concentration, and its temporal evolution in response to changes in glucose concentration, is critical to the glucose-sensing capability of the pro- posed circuit. We also need to construct a dose-response curve to study the de-repression of GntR in the circuit at different concentrations of D-gluconate. This would be necessary to better optimize the switch behavior of the circuit. Secondly, in order to fully characterize repression and de-repression, we plan to use different concentrations of the DNA construct with the mP70 pro- moter and the GntR protein to perform Electrophoretic Mobility Shift Assays (EMSA) in order to determine the binding parameters between Gntr protein and mP70. We also plan to do end point measurements of the reporter fluorescence for these concentrations. We also plan to implement real-time measurement of mRNA to get more mRNA measurements making it easier to develop accurate models [115]. Thirdly, we also plan to use different GntR constructs derived from P. aerug- inosa and B. subtilis and test their effectiveness in repressing reporter fluores- cence. The P. aeruginosa construct was shown to be partly responsive to D- glucose, eliminating the need for a proxy, D-gluconate [23]. We are also in the process of testing the effectiveness of a circuit which uses promoters recognized 29 by T7 polymerase instead of the E. coli RNAP. The T7 polymerase provides a two-fold advantage over the E. coli RNAP: It synthesizes RNA at a much higher rate [161]; It is inherently present in the PURE system and doesn’t need to be added separately [142]. Fourthly, we plan to develop a mathematical model for the C3 circuit to take into effect the presence of D-gluconate in the system to effectively characterize transcription and translation behavior. We would have to estimate the model parameters using multi-objective optimization similar to how they were deter- mined for C1 and C2 circuits. Such a model would also help guide our future experiments based on this system. Lastly, we also plan to further engineer the GntR protein (and the promoter region) to enable more efficient repression and derepression properties. Muta- tions in the C-terminal region of GntR have been shown to affect its ability to bind to D-gluconate while mutations in the N-terminal region affect operator binding properties [196, 197]. On the basis of these details, we plan to gen- erate a library of GntR protein with random mutations in the C-terminal and N-terminal regions via error-prone PCR in order to discover GntR mutants that have good operator binding strength, while still being effectively antagonized by D-gluconate. Once we have accomplished these, we can test the circuit with fluorescent labeled insulin and glucagon. The ultimate aim is to make use of this transcrip- tional unit to construct a viable glucose biosensor (Figure 2.7) that is capable of producing insulin at high glucose (gluconate) levels and glucagon at high glucose (gluconate) levels. 30 Figure 2.7: Schematic of a glucose biosensor: σ70 induces the expression of GntR, σ28, insulin and α-σ28 which is downstream to insulin. σ28 induces ex- pression of glucagon while α-σ28 inhibits the P28 promoter thereby inhibiting the expression of glucagon. When the gluconate levels are high, insulin will be expressed along with σ28. At low gluconate levels, only glucagon will be expressed 31 CHAPTER 3 I(NSP)ECTION OF THE SARS-COV-2 NSP1’S HOST-TRANSLATION EFFECT IN A CELL-FREE SYSTEM 3.1 Introduction The SARS-CoV-2 virus which is the cause of the unprecedented Covid-19 pan- demic belongs to the class of coronaviruses. Coronaviruses are enveloped viruses with a positive-sense single-stranded RNA genome. This mRNA con- tains a 5’ cap structure along with a 3’ poly (A) tail allowing it to serve as the genetic material as well as the messenger RNA which translates to the pro- teins [36, 183]. The SARS-CoV-2 has a genome of ∼30 kb. It possesses 14 Open Reading Frames (ORFs) encoding for a total of 27 known proteins [75,78,131]. The over- lapping ORFs ORF1a and ORF1ab located at the 5’ terminal of the genome en- code the pp1a and pp1ab polyproteins respectively. The translation of ORF1ab is mediated by a -1 ribosomal frameshift at the overlap of these two ORFs [37, 183]. The other ORFs are located at the 3’ terminal and encode for the 4 structural proteins (Spike protein (S), Small Envelope protein (E), Matrix pro- tein (M), Nucleocapsid (N)) and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, ORF14) [183, 184, 191]. The pp1a and pp1ab polyproteins together contain 15 functional non- structural proteins (nsp) (nsp11 is a 13 amino acid cleavage product that isn’t known to have any function [109, 159]). These nsps are released from pp1a and pp1ab upon proteolytic cleavage by nsp3 and nsp5 [37, 183]. 32 The non-structural protein 1 (nsp1), also known as the host shutoff factor, is encoded by the gene closest to the 5’ end of the viral genome and is one of the first proteins to be expressed after the virus infects a host cell [138,198]. It is one of the most important nsps that is necessary for the propagation of the virus. It inhibits host gene expression and helps the virus suppress the host immune response [110, 159, 169, 183]. In other coronaviruses like the SARS-CoV and MERS-CoV, the nsp1 inhibits host translation by forming a complex with the 40S ribosomal unit which ren- ders this inactive for host translation [66,108,110,111,138,165,169,198]. Further, this also causes a selective endonucleolytic cleavage in the 5’ untranslated re- gion (UTR) of the host mRNA [61, 108, 169]. This promotes degradation of the host mRNA while the viral mRNA remains intact. The SARS-CoV-2 nsp1 has ∼84% amino acid sequence similarity with its SARS-CoV homolog [134, 150, 159, 169]. Such high sequence similarity sug- gests that they share common properties and functions. The SARS-CoV-2 nsp1 binds to the human 40S ribosomal subunit to shut off host translation. The protein inserts its C-terminal domain into the mRNA entry channel of the ri- bosome (at the 18S ribosomal RNA), where it interferes with the host mRNA binding [11, 83, 138, 150, 169]. A stem-loop structure of SARS-CoV-2 mRNA 5’UTR has been observed that is recognized by nsp1. This recognition en- ables selective translation of the viral mRNA while still inhibiting host trans- lation [134,138,150,169]. Thus, nsp1 also helps to boost viral protein production by hijacking more of the cellular machinery for translation of viral proteins [11]. This recognition also protects the viral mRNA from the possible mRNA degra- dation activity of the nsp1 [61, 73, 165]. 33 Type I interferon (IFN) induction and signaling represents one of the major innate antiviral defense mechanisms and many viruses have developed strate- gies to evade this immune response [54,110]. The expression of these IFNs trig- gers innate antiviral immune responses which help to suppress the viral repli- cation. Coronavirus infections are sensed by RIG-I (Retinoic acid–Inducible Gene I)-like receptors (RLRs), which activate this IFN-based defense mecha- nism [129, 150, 154, 169]. nsp1’s translation inhibition functions interfere with the production of these proteins by shutting down the host translation machin- ery. Thus by affecting the translation of antiviral defense factors such as IFN-β or RIG-I, nsp1 effectively weakens the innate immune response [166, 169]. Its central role in the suppression of host translation and the immune re- sponse makes nsp1 a viable target for the development of anti-viral drugs and further studies. In this work, we aimed to produce the SARS-CoV-2 nsp1 using E.coli-based CFPS. Secondly, we wanted to test the potential of nsp1 to inhibit host translation in the same cell-free system (See Figure 3.1). And then math- ematically model the effect of nsp1 on the host translation if it turned out to demonstrate statistically significant inhibition. Figure 3.1: Circuit representing coexpression of nsp1 along with a reporter pro- tein. The nsp1 binds to the ribosomes and selectively prevents translation of the reporter protein. This circuit doesn’t include the potential mRNA degradation activity of nsp1 [61, 73, 165]. (Created with BioRender.com) 34 To this purpose, we designed a DNA construct that was ”reverse-translated” from the sequence of the protein [200]. We used a P70 promoter sequence, along with its 5’ UTR, as it is optimized to maximize expression in the E. coli-based myTXTL Cell-free system [42]. We also carried out an assay to test the ability of nsp1 to inhibit translation of dual emission Green Fluorescent Protein (deGFP) (which would take the place of a host protein) in myTXTL extract. 3.2 Materials and Methods 3.2.1 Cell-Free Protein Synthesis Reactions The cell-free protein synthesis (CFPS) reactions were carried out using the myTXTL Sigma 70 Master Mix (Arbor Biosciences) in 1.5 mL Eppendorf tubes. The working volume of all the reactions was 12.8 µL, composed of the Sigma 70 Master Mix (9 µL), GamS nuclease inhibitor (0.8 µL), and the nsp1 linear DNA fragments (3 µL - 9.38nM). The full constructs were ordered as linear DNA frag- ments, with 150-200 bp flanker sequences on both ends, from Twist Bioscience and Integrated DNA Technologies. The CFPS reactions were incubated at 29°C. The CFPS runs were carried out in triplicate for 2hr, 4hr, 8hr, and 16hr time points. Following each CFPS run, the samples were stored at -80°C. 3.2.2 mRNA Quantification Following each CFPS run, the total RNA was extracted from 5 µL reaction mix- ture using PureLink RNA Mini Kit (Thermo Fisher Scientific) and stored at 35 −80°C. The quantitative RT-PCR reactions were done using Applied Biosystems TaqMan RNA-to-CT 1-Step Kit and Custom TaqMan Gene Expression Assays (Thermo Fisher Scientific). An mRNA standard curve was used to determine absolute mRNA concentrations for each of the samples. The mRNA standards were prepared as follows: CFPS reaction for 5 nM of nsp1 DNA was carried out for 2 hr. Total RNA was extracted using the full reaction volume. This was fol- lowed by the removal of 16S and 23S rRNA using the MICROBExpress Bacterial mRNA Enrichment Kit (Life Technologies Corporation). Lastly, the MEGAclear Kit (Life Technologies Corporation) was used to further purify the mRNA. The mRNA concentrations were determined using the Qubit™ RNA assay kit (Ther- moFisher Scientific). At least three technical replicates were performed for each standard. 3.2.3 Protein Quantification Following each CFPS run, 2 µL of each sample was extracted and added to 4 µL of water. And an equal amount of 2x Laemmli sample buffer (with 2- mercaptoethanol as a reducing agent) was added to these. An SDS-PAGE of 10µL of each of these diluted samples along with a Protein ladder of sam- ples with known concentrations was performed at 300V until the dye fronts reached the bottom of the gel. This was followed by staining the gel with Bio- Safe™ Coomassie Stain (Bio-Rad Laboratories). The SDS-PAGE was performed in a Mini-PROTEAN Tetra Cell (Bio-Rad Laboratories) using Any kD™ Mini- PROTEAN® TGX Stain-Free™ Protein Gels, 15 well, 15 µl gels (Bio-Rad Labo- ratories). The stained gel was visualized using the ChemiDoc MP and then the concentrations were measured using Image Lab software. 36 3.2.4 Coexpression of deGFP with nsp1 The cell-free protein synthesis (CFPS) reactions were carried out using the myTXTL Sigma 70 Master Mix (Arbor Biosciences) in 120 µL 384-well plates (Thermofisher NUNC, flat-bottom) in Varioskan Lux plate reader at 29°C. The working volume of all the reactions was 14.1 µL, composed of the myTXTl ex- tract (9 µL), GamS nuclease inhibitor (0.6 µL), the linear DNA: deGFP (3 µL - 4.26 nM), nsp1 (1.5 µL - 4.26 nM). Two sets of control reactions were also run. In one, instead of nsp1, an equal volume of water was used. And in the other, equal volume and concentration of nsp8 linear DNA construct was used instead of nsp1. nsp8 was chosen as a control as it has a size similar to that of nsp1 but doesn’t inhibit translation unlike nsp1 (though it could affect protein trafficking to the cell membrane) [11]. The Cell-free reactions were carried out in dupli- cates for 16hr. deGFP fluorescence was measured using the Varioskan Lux plate reader at 488 nm (excitation) and 535 nm (emission) at 5 min time intervals. In order to confirm the expression of these proteins, following the CFPS run, 2 µL of each sample was extracted and diluted with 3 µL of water. Following this 5 µL of 2x Laemmli sample buffer was added to these and an SDS-Page of these samples was performed at 200V until the dye fronts reached the bottom of the gel. This was followed by Coomassie staining of the gel and visualization using the Image Lab software. 3.2.5 Qualitative testing of nsp1 binding with E.coli Ribosomes We ran a CFPS reaction of His6-tagged nsp1 in myTXTL. The working volume of the reaction was 12.5 µL, composed of the Sigma 70 Master Mix (9 µL), GamS 37 nuclease inhibitor (0.5 µL), and the nsp1 linear DNA fragments (3 µL - 9.6nM). The CFPS reactions were incubated at 29°C for 12hr. Then we purified the His6- tagged nsp1 using Ni-NTA affinity column. Meanwhile, we separated the E.coli Ribosomes from the PURExpress In Vitro Protein Synthesis Kit (New England Biolabs Inc). Then these ribosomes were added to half the volume of the puri- fied nsp1 proteins and allowed to mix well at 29°C for 15 minutes. Then this sample was loaded into a gel along with pure nsp1 protein samples and a Pro- tein ladder for an SDS-PAGE run. The ribosomes and its subunits are expected to be retained at the top while the nsp1 should be able to pass through. After the SDS-PAGE run, the gel was stained with Bio-Safe™ Coomassie Stain (Bio- Rad Laboratories) and the stained gel was visualized using the ChemiDoc MP. The concentration of nsp1 in each well was measured using Image Lab software (and adjusted for the dilution of nsp1 in the channel with the ribosomes). 3.3 Results and Discussion (a) (b) Figure 3.2: Gene expression of nsp1. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates 38 We were able to successfully synthesize SARS-CoV-2 nsp1 in myTXTL extract. With a starting concentration of 9.38 nM, we were able to obtain up to an aver- age of 2.18 µM of nsp1 protein after a CFPS run of 16hr. The mRNA concentra- tion peaked at about 1010 nM at the 4hr measurement point and continued to decline after that (Figure 3.2). (b) (a) Figure 3.3: Coexpression of deGFP with nsp1: (a) RFU measurements of deGFP when coexpressed with nsp1 (Red). The controls: expression of deGFP alone (Blue), expression of deGFP along with nsp8 (Green). The shaded regions indi- cate one standard deviation (b) SDS-PAGE Analysis of these samples to confirm the production of all the proteins. The gels have been Coomassie-stained From Figure 3.3, we can observe that although coexpressing nsp1 along with deGFP does reduce the synthesis of deGFP, there isn’t a very significant differ- ence from just expressing deGFP by itself. In addition to that, coexpressing deGFP with a control gene coding for the nsp8 protein with a molecular weight (∼ 22.8kDa) just slightly higher than nsp1 (∼ 19.77kDa) [200] also reduces the production of deGFP. This is despite the fact that the nsp8 doesn’t inhibit host translation [11]. This decrease in the production of deGFP in presence of nsp1 or nsp8 can be better explained by a competition for the metabolic resources caused by the introduction of their genes. 39 This also goes to prove that nsp1 doesn’t affect the prokaryotic ribosomes. Unlike the larger mammalian 80S ribosome made up of 60S and 40S subunits, the prokaryotic 70S ribosome is made up of 50S and 30S subunits. The nsp1 which binds to the mRNA entry channel of the 40S subunit doesn’t bind to the corresponding region in the prokaryotic 30S subunit. Therefore, it doesn’t actually inhibit translation in the E.coli based myTXTL system. Figure 3.4: Rudimentary nsp1-70S Ribosome binding assay: The third channel contains pure nsp1. The first and second channels contain nsp1 mixed with two times its volume of purified 70S ribosomes solution. In other words, the concentration of nsp1 in the third channel is thrice the concentration of nsp1 in the first two. Measurement of the fluorescence of the nsp1 bands show that the normalized concentration of nsp1 in all wells are approximately the same indicating that there is negligible binding of nsp1 with these ribosomes We used a rudimentary binding assay to test the binding of nsp1 with E.coli ribosomes. Ribosomes treated with nsp1 proteins were loaded on two channels of a gel alongside a different channel containing pure nsp1. The channels are expected to retain the ribosomes, their subunits, and any nsp1 bound to it at the top but allow the free unbound nsp1 to pass through [4]. The results of the SDS page show a negligible difference between the channel with just nsp1 and the channels with nsp1 and the ribosome. The channel with only nsp1 had 116.4 ng/µL of nsp1 while the ones where it was treated with ribosome had 40 an average of 106 ng/µL of nsp1 (See Figure 3.4). If nsp1 had bound to the ribosome, we would have seen a significantly reduced concentration. These results combined confirm that nsp1 doesn’t bind/shows negligible binding to E.coli ribosomes. Conversely, this does prove that we can still synthe- size nsp1 using myTXTL cell-free extracts and that this synthesis can be mod- eled. 3.3.1 Future Directions There are several directions for this work that can be pursued. First, in order to better characterize this system, we can use the obtained data to fit the pa- rameters for nsp1 gene expression (transcription and translation behavior) in myTXTL using the Pareto Optimal Ensemble Technique. Second, in order to better determine the binding between E.coli ribosomes and nsp1, we can make use of better assays such as Sucrose gradient centrifugation can be used. Lastly, we can repeat these experiments with a mammalian cell-free extract. The nsp1 should be able to bind to the 40S ribosomal subunits of the 80S mammalian ribosomes present in these. We could also analyze nsp1’s binding to ribosomes in cell-free systems based on different mammalian cells such as Rabbit reticulocytes, CHO cells, and HeLa cells [46]. It would also be interesting to study nsp1’s inhibition effects in cell- free extracts prepared from human cell lines of different systems/organs. Cells from human lungs are primarily affected by the SARS-CoV-2 virus infection, so human lung cell lines should theoretically provide the best environment for the growth of the virus [136, 141, 172]. We could compare lung-based cell-free 41 extracts from Calu-3 cells, A549 cells, or HF10B4 cells [103, 188], with other hu- man cell-free extracts like HeLa cells, HEK 293 cells, and K562 cells [16,187,199], to observe if there are any noticeable differences between nsp1’s effect in these systems. With these cell-free systems, we can also study the host translation inhibition of nsp1 and create a mathematical model for the system. Since the 5’UTR of the viral genome helps it bypass nsp1’s translation inhibition and is necessary for viral replication, the nsp1 gene can be designed with this 5’UTR. The nsp1 can be used as a global translation inhibitor to terminate the production of proteins in a suitable cell-free system. 42 CHAPTER 4 MODELING THE EFFECT OF THE RNA-DEPENDENT RNA POLYMERASE IN VITRO 4.1 Introduction The SARS-CoV-2 is a coronavirus and like all coronaviruses, it is a positive-sense single-stranded RNA virus ((+)ssRNA). In other words, its genome consists of a single strand of positive-sense RNA (which goes in the 5’ to 3’ direction) [36, 169,183,184]. This RNA genome serves as the template for both translation and RNA replication. Coronaviruses use an RNA-dependent RNA polymerase (RdRp) for the replication of their genome. The SARS-CoV-2 RdRp forms a complex made of multiple non-structural proteins (nsps). The core of the RdRp is the nsp12 cat- alytic subunit [5]. The nsp12 contains an N-terminal nidovirus RdRp-associated nucleotidyltransferase (NiRAN) domain, an interface domain, and a C-terminal RdRp domain [57, 86, 125]. The RdRp domain consists of fingers, palm, and thumb subdomains [41, 178]. However, nsp12 has little processivity by itself and in order to function properly, it requires the accessory subunits, nsp7 and nsp8 [41,45,79,158,195]. The nsp7 and nsp8 subunits bind to the thumb, and an additional copy of nsp8 binds to the fingers domain [41,79]. The nsp7-nsp8 het- erodimer is believed to be capable of de novo initiation and has been proposed to function as an RNA primase to synthesize small primers that are further ex- tended by the nsp12 [13, 62, 79, 168]. The RdRp also coordinates with many accessory factors such as the nsp13 43 and the nsp14 in order to carry out its functions [22, 98, 152, 194]. The nsp13 is a helicase that is essential for unwinding the RNA complex after replica- tion in a 5’ −→ 3’ direction [63, 64, 85, 140]. Besides this, nsp13 also displays 5’-triphosphatase activity, which suggests a possibility that it is involved in mRNA capping [22,63,64]. The nsp14 which possesses an N-terminal exonucle- ase (ExoN) domain, together with its co-factor nsp10, forms plays an important role as a proofreading machinery during viral replication [29, 105, 151, 152]. Once the virus enters the host cells, its positive-sense RNA (+mRNA) is able to act as a template immediately without having to undergo any modification. This allows the RdRp to use it to synthesize complementary negative-sense RNA (-mRNA), immediately. The initiation of synthesis occurs at the 3’ end of the template and proceeds in the 5’−→3’ direction [178]. This complementary strand that is synthesized is then, itself, able to act as a template for the pro- duction of new positive-stranded viral genomes. Meanwhile, the positive-sense RNA is also translated using the cellular machinery. These newly synthesized proteins and viral genome are further packaged and released from the cell ready to infect more host cells. The -mRNA does not translate for proteins (or trans- lates to garbage proteins) [89, 123, 128] (See Figure 4.1). The key role that RdRp plays in viral replication makes it a promising an- tiviral drug target [202]. Drugs such as Favipiravir, Remdesivir, and Galidesivir are being developed to treat viral infection by inhibiting the effects of RdRp [3, 81, 112]. For instance, Remdesivir is a nucleoside analog that gets incor- porated into RNA when the viral genome is being replicated. This creates a translocation barrier that the RdRp encounters after the addition of three more nucleotides following remdesivir causing the RdRp to stall [45, 81]. 44 Figure 4.1: Replication strategy of a (+)ssRNA virus: The viral genome is trans- lated using cellular machinery. This genome also acts as a template for the pro- duction of complementary -mRNA. This complementary strand then acts as a template for the production of a new viral genome (+mRNA). (Image adapted from [89]) The RdRp of some viruses have been studied to require the presence of cer- tain sequences in the RNA analogous to the DNA promoter-specific transcrip- tion by DNA-dependent RNA polymerases [147]. However, in the case of the SARS-CoV-2, although nsp12 can distinguish RNA from DNA, no specific in- teractions have been observed between base pairs of RNA and residues from nsp12, which suggests that RdRp binding may be RNA sequence-independent [71, 195]. In the event that the activity of the SARS-CoV-2 RdRp is sequence- 45 dependent, then the 5’UTR and 3’UTR of the genome must be highly con- served in order for the RdRp to produce the complementary RNA strand from the viral genome and also produce the viral genome from this complementary strand [95]. This also indicates that the RdRp can be harnessed to boost the production of any desired proteins. In this work, we aimed to simulate the effect of RdRp in a cell-free system. To this purpose, we introduce the RdRp into a simple genetic circuit in a myTXTL based cell-free system and simulate its effect to amplify mRNA concentration and thereby improving protein production. We also attempted to produce the subunits of the RdRp complex using myTXTL in order to study its effects in primer extension and expression of a reporter protein. 4.2 Materials and Methods 4.2.1 Model Derivation In a typical gene expression system, the overall flow of information starts from the coding region of DNA, which is read by the RNA polymerase, to produce a messenger RNA molecule (Transcription - TX) which is then read by a ribosome to produce a protein (Translation - TL). Along the way, both the mRNA and the protein can degrade (at different rates) [177] If we consider a system with N genes general, the gene expression of each gene Gi is described by two differential equations: one for the rate of change of mRNA ’i’ concentration (ṁi), and one for the rate of change of protein ’i’ 46 concentration (ṗi): ṁi = rX,iūi − (µ + θm,i)mi + λi (4.1) ṗi = rL,iw̄i − (µ + θp,i)pi (4.2) The term rX,iūi (nM/hr) denotes the regulated production rate of mRNA ’i’ by transcription. It is the product of the kinetic limit of transcription rX,i (nM/hr) and a transcriptional control term ūi Similarly, the term rL,iw̄i (nM/hr) denotes the regulated production rate of protein ’i’ translation. It is the product of the kinetic limit of translation rL,i (nM/hr) and a translational control term w̄i. The term λi denotes the unregulated rate of transcription (the leak for gene Gi). The ū and w̄ terms describe the control logic of the cell for transcription and transla- tion respectively. These terms are dimensionless, and bounded between 0 and 1. The term µ denotes the dilution term (hr−1), while the terms θm,i and θp,i the first-order mRNA degradation rate constant (hr−1) and protein degradation rate constant (hr−1) respectively. [2, 177] Since this reaction is in a cell-free system, the dilution (µ) term can be re- moved and we assume no leak (λi). So these can be removed to modify the equations to : ṁi = rX,iūi − θm,imi (4.3) ṗi = rL,iw̄i − θp,i pi (4.4) 47 Figure 4.2: Introduction of RdRp into a genetic circuit: RdRp uses the +mRNA as a template for the production of a complementary -mRNA and uses the - mRNA as a template for the production of +mRNA. Meanwhile, the +mRNA is also translated to synthesize proteins (Created with BioRender.com) In order to study the effect of the RdRp, we modified the mRNA rate equa- tion to include the transcription by RdRp. In this case, we assumed the RdRp to be a complex of nsp7, nsp8, nsp12, and nsp13. The changes brought by RdRp can be applied for both +mRNA and -mRNA : ṁ+i = rX,iūi − θm,im+i + f (m−i ,R − − +d) (4.5) ṁi = −θm,imi + f (mi ,Rd) (4.6) For the +mRNA, this additional function f (m−i ,Rd) is a function of -mRNA con- centration (m−i ) and RdRp concentration (Rd), while for the -mRNA, this function f (m+i ,Rd) depends on +mRNA concentration (m + i ) and RdRp concentration. This is because -mRNA acts as a template for the production of +mRNA using RdRp and vice versa (See Figure 4.2). In the case of the -mRNA production there is no term representing normal transcription from a gene. This is because there is no gene that codes the -mRNA here. The only source for -mRNA is from RdRp transcription with +mRNA as a 48 template. We considered the transcription of the mRNA using RdRp to be analogous to that of DNA-Dependent RNA Polymerase. The RdRp forms a complex with the mRNA. However, unlike in the transcription involving RNAPII, there is no open or closed complex here as the template here is an mRNA which is single- stranded. This process also involves abortive initiations where the RdRp starts transcription but aborts after a few nucleotides producing subgenomic RNA (sgmRNA). [146] These can be represented by the following set of equations: m+ + − −i + Rd ⇐⇒ (mi : Rd) mi + Rd ⇐⇒ (mi : Rd) (m+ +i : Rd) −→ mi + Rd + sgmRNA− (m−i : Rd) −→ m− +i + Rd + sgmRNA (m+i : Rd) −→ m+i + R − −d + mi (mi : Rd) −→ m−i + Rd + m+i Consider the second set of reactions where +mRNA (m+i ) is produced from -mRNA (m−i ) and RdRp (Rd). The rate constants for these reactions can be de- noted by : k∗E,i : Rate constant for transcription elongation kA : Rate constant for abortive initiation k+ & k− : ON/OFF rate constants for RdRp-mRNA complex. We assume that these are going to be the same for the RdRp interactions with both +mRNA and -mRNA. In this case, the rate of change of the complex concentration (m−i : Rd) is given by the following equation : d(m−i : Rd) = k (m−)(R ) − k (m− : R ) − k (m− : R ) − k∗ (m− : R ) (4.7) dt + i d − i d A i d E,i i d k At Steady State, (m− −1 − −1 +i : Rd) = K (mi )(Rd) where K = k− + kA + k∗E,i 49 In order to make this model more computationally simple, we are going to assume some amount of compartmentalization in the model. We assume that at pseudo-steady state conditions, RdRp is going to be equally divided between binding to -mRNA and +mRNA. This is will indeed be valid if the rate constants are the same for the interactions of RdRp with both RNAs. Assume that the total RdRp concentration available for binding with either one of these at a given time point = RTd , i.e., this is half the total amount of RdRp in the system. T RT − −1 − R d = Rd + (mi : Rd) = Rd + K (mi )(Rd) ⇒ R d d = −1 − (4.8)1 + K mi ( T ) f (m−,R ) r+ ∗ − ∗ − Rd i d = X,i = kE,i(mi : Rd) = kE,imi K + m−i So, this new term can be replaced in equation (4.5() to give) : RT ṁ+i = r ū − θ m+ dX,i i m,i i + k∗ −E,imi − (4.9)K + mi Similarly for -mRNA, ( T ) ṁ− R i = −θ − ∗ + dm,imi + kE,imi (4.10)K + m+i By solving the ODEs (4.9) and (4.10), it is possible to obtain the concentration of +mRNA ’i’ at any point of time. At Steady State, [ ( )] [ ( )] 1 RT 1 RT m+∗i = rX,iū + k ∗ m−∗ d −∗ ∗ +∗ di θ E,i i K + m−∗ mi = kθ E,i mi +∗ m,i i m,i K + mi Here m+∗i and m −∗ i represent the steady state concentrations of +mRNA and - mRNA respectively. 50 Combining these two equations, we get the following equation to denote the steady state concentration of the +mRNA ’i’,  ( [ ( )])∗ 1  T ∗ 1 ∗ ∗ Rd      ( [ RTm+ + di = rX,iūi + kθ E,i kθ E,imi ( )])K + m+∗ 1 ∗ ∗ RT    m,i m,i i 1 + k + d θm,i E,i mi K+m+∗i We can use equations (4.9) and (4.10) to modify the concentration of +mRNA at any point of time. Then using this modified +mRNA concentration, we can use the normal set of translation equations with the same translation parame- ters to get the temporal variations in the concentration of the proteins. Since only the +mRNA concentration has been affected by the presence RdRp (the - mRNA doesn’t code for any protein), that is the only factor that will affect the translation. 4.2.2 Control functions, Transcription and Translation Kinetic Limits The control functions, wi and ui are the(contr)ol functions modeled as: − ln(2)ẇi = wi (4.11) ∑ τL,1/2 u ∑ j∈χ W j f j(...)i = (4.12) i∈C W f (...)i i i Wi denotes the weig∑ht of configuration i and fi is the binding function modeled as a hill function. j∈χ W j f j(...) is the∑sum of all possible promoter configura- tions leading to transcription, while i∈C Wi fi(...) is the sum of all possible con-i figurations for the Gene ’i’. The weights Wi are related to the Gibbs energy of 51 configuration ’i’: ( ) −∆G W ii = exp (4.13)RT The rate of transcription rX, ju j was modeled as the product of a kinetic limit, rX, j, and a control term u j ∈ [0, 1]. The kinetic limit of transcription was derived from elementary reactions leading to the formation of m j, as previously done by McClure [101]. The kinetic limit of transcription was formulated as: rX, j = VmaxX j   G(j )   ,  (4.14) τX, jKX, j + τX, j + 1 G j where the initiation time constant τX, j, and saturation constant KX, j were exper- imentally measured by McClure [101]. The maximum transcription rate, VmaxX, j , was given by: [ ( )] max v̇XVX, j = RX,T (4.15)lG, j where RX,T is the RNA polymerase concentration (nM), v̇X is the RNA poly- merase elongation rate (nt/h) and lG, j is the length of gene j in nucleotides (nt). By analogy, the kinetic limit of translation was formulated as: max   x  r V mRNA L, j = L , j ( )  (4.16) τL, jKL, j + τL, j + 1 xmRNA where xmRNA is the mRNA concentration and VmaxL, j is the maximum translation rate which was formulated as: [ ( )] max v̇LVL, j = KPRL,T (4.17)lP, j 4.2.3 Building the model A preliminary sequence-specific model was generated on Julia using the Gene Regulatory Network Generator - JuGRN [176]. This model takes into account 52 the transcription and translation parameters and solves for the protein expres- sion as an FBA problem [181]. This model was modified to account for the ef- fect of RdRp. The parameters for the model were obtained from BioNumbers or Vilkhovoy, et al [104,181]. Arbor Biosciences’s myTXTL extract was chosen to be the cell-free extract used in this model. These model equations were solved nu- merically using the Rosenbrock23 routine of the DifferentialEquations.jl pack- age [126]. The model code and parameters used in this chapter are available in the following Github repository: https://github.coecis.cornell.edu/ am3246/RdRp 4.2.4 Cell-Free Protein Synthesis Reactions The cell-free protein synthesis (CFPS) reactions were carried out using the myTXTL Sigma 70 Master Mix (Arbor Biosciences) in 1.5 mL Eppendorf tubes. The working volume of all the reactions to synthesize nsp12 or nsp13 was 17 µL, composed of the Sigma 70 Master Mix (9 µL), T7 RNA polymerase DNA (1.5 µL - 0.225 nM), and the plasmids (6.5 µL - 7.65 nM) which were constructed with a T7 promoter. The working volume of all the reactions to synthesize nsp7 or nsp8 was 12.8 µL, composed of the Sigma 70 Master Mix (9 µL), GamS nuclease inhibitor (0.8 µL) and the linear DNA fragments (3 µL). These full constructs were ordered as linear DNA fragments, with 150-200 bp flanker sequences on both ends, from Twist Bioscience and Integrated DNA Technologies. The CFPS reactions were incubated at 29°C. The CFPS runs were carried out in triplicate for 2hr, 4hr, 8hr, and 16hr time points. Following each CFPS run, the samples were stored at -80°C. 53 4.2.5 mRNA Quantification Following each CFPS run, the total RNA was extracted from 5 µL reaction mix- ture using PureLink RNA Mini Kit (Thermo Fisher Scientific) and stored at −80°C. The quantitative RT-PCR reactions were done using Applied Biosystems TaqMan RNA-to-CT 1-Step Kit and Custom TaqMan Gene Expression Assays (Thermo Fisher Scientific). An mRNA standard curve was used to determine absolute mRNA concentrations for each of the samples. The mRNA standards were prepared as follows: separate CFPS reactions for 5 nM of DNA (nsp7, nsp8) were carried out for 2 hr. Total RNA was extracted using the full reac- tion volume. This was followed by the removal of 16S and 23S rRNA using the MICROBExpress Bacterial mRNA Enrichment Kit (Life Technologies Corpora- tion). Lastly, the MEGAclear Kit (Life Technologies Corporation) was used to further purify the mRNA. The mRNA concentrations were determined using the Qubit™ RNA assay kit (ThermoFisher Scientific). At least three technical replicates were performed for each standard. 4.2.6 Protein Quantification Following each CFPS run, 2 µL of each sample was extracted and added to 4 µL of water. And an equal amount of 2x Laemmli sample buffer (with 2- mercaptoethanol as a reducing agent) was added to these. An SDS-PAGE of 10µL of each of these diluted samples along with a Protein ladder of sam- ples with known concentrations was performed at 300V until the dye fronts reached the bottom of the gel. This was followed by staining the gel with Bio- Safe™ Coomassie Stain (Bio-Rad Laboratories). The SDS-PAGE was performed 54 in a Mini-PROTEAN Tetra Cell (Bio-Rad Laboratories) using Any kD™ Mini- PROTEAN® TGX Stain-Free™ Protein Gels, 15 well, 15 µl gels (Bio-Rad Labo- ratories). The stained gel was visualized using the ChemiDoc MP and then the concentrations were measured using Image Lab software. 4.3 Results and Discussion (a) (b) Figure 4.3: Temporal variation of N mRNA and protein when expressed along- side RdRp. The blue curves represent the expression when the model doesn’t account for the effect of RdRp, while the orange curves account for the RdRp feedback to amplify mRNA and thereby amplify protein expression We used the developed model to simulate the effect of RdRp on an arbitrarily chosen SARS-CoV-2 protein with a comparatively low half-life - the nucleocap- sid (N) protein of the SARS-CoV-2. The simulation used 5 nM each of DNA coding for nsp1 and RdRp. The simulation was run for a 30hr reaction time with two different models - One assumed a normal reaction where the effect of RdRp wasn’t accounted for, and the other accounted for the RdRp’s ability to amplify mRNA and thereby amplify protein expression. From Figure 4.3, we can observe that including the effect of RdRp on mRNA amplification does sig- 55 nificantly boost mRNA and thereby protein production. When we need to fit the model with experimental measurements, this model can easily be verified by replacing the chosen protein with a reporter protein such as deGFP. We were able to synthesize the nsp7 (∼ 10.2 kDa) and nsp8 (∼ 22.8 kDa) proteins using the myTXTL cell-free system. With a starting concentration of 9.38nM of the nsp7/nsp8 DNA, we were able to obtain up to an average of 29.98 µM of nsp7 protein and 6.96 µM of nsp8 protein after a CFPS run of 16hr. The mRNA concentration peaked at the 4hr measurement point and continues to decline after that for both (Figures 4.4 and 4.5). (a) (b) Figure 4.4: Gene expression of nsp7. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates 56 (a) (b) Figure 4.5: Gene expression of nsp8. The points denote the mean, while the error bars denote one standard deviation calculated from three replicates However, we were not able to synthesize the larger nsp12 (∼ 106.2 kDa) and nsp13 (∼ 67.8 kDa) subunits. Quantitative RT-PCR mRNA measurements of nsp12 and nsp13 CFPS samples after a 12 hour run revealed that only 0.58 nM of nsp12 mRNA and 2.97 nM of nsp13 mRNA was present. This explains the absence of protein and helps us narrow down the cause. We suspect the issue to be due to the presence of RNases (Ribonucleases) that are causing a higher degradation rate for mRNA than expected. 4.3.1 Future Direction The model is still in its nascent stages and requires a lot of tweaking in order to make it more accurate. For one, currently, the model only takes into consid- eration the effect of RdRp and doesn’t account for the formation of the RdRp complex along with nsp7, nsp8, and nsp13. The model will need to account for the parameters involved in the formation of this complex and the stability of the complex in addition to the stability of the individual subunits. We could 57 also explore building a model using FBA (Flux Balance Analysis) instead of the ODE-based model. Secondly, several model parameters need to be determined using experi- mental data. The viral non-structural proteins involved with RdRp have al- ready been successfully synthesized in E. coli cells [22,27,194]; So, it follows that it should still be theoretically possible to synthesize them in a E. coli-based cell- free extract. However, since we were unable to synthesize the larger nsp12 and nsp13 in our myTXTL system, we need to inspect other methods to synthesize them. Towards this aim, we can explore the use of other systems using extracts from other organisms such as the mammalian HeLa cell-free extract. The suc- cessful production of certain therapeutic proteins is often based on a synthesis setup that is closely related to in vivo conditions [199]. In the case it proves to be futile to synthesize the proteins in vivo, we could make use of traditional in vivo systems for the production of these proteins, but later assemble them for study in a cell-free system. Lastly, once the binding parameters for the system are determined, it would be feasible to apply this model to cell-based systems to predict the spread of viral infection in human cells. If this approach is successful, it will shed light on the viral replication kinetics of all ssRNA viruses. 58 CHAPTER 5 CONCLUSION We have constructed a promising transcriptional unit based on a gluconate- responsive allosteric transcription factor, GntR, that can be utilized to ”sense” glucose via its proxy, D-gluconate. The E. coli GntR combined with the mP70 promoter construct showed promising repression and de-repression results in the PURE system, which did not have metabolic side reactions that consumed D-gluconate. Taken together we have constructed a promising transcriptional unit that can ”sense” glucose via its proxy, D-gluconate. We aim to integrate this unit in a more complex genetic circuit to fully utilize its glucose-sensing po- tential to direct downstream synthesis of proteins such as insulin and glucagon for the management of diabetes. Second, we expressed the SARS-CoV-2 host shutoff factor, nsp1 in the myTXTL cell-free system and demonstrated that it does not bind the E.coli ribosomes. Thereby rendering its host translation inhi- bition ineffective in the E.coli-based cell-free system. Lastly, we also developed a mathematical model to study the effect of the SARS-CoV-2 RNA-dependent RNA polymerase (RdRp) on the production of a desired protein in the myTXTL cell-free system. Through simulations using the model built on a JuGRN back- bone, we demonstrated that the presence of RdRp can amplify the mRNA in the system, thereby boosting protein production. We also postulate that RdRp can be harnessed to improve the production of desired proteins. 59 BIBLIOGRAPHY [1] Diabetes facts and figures. https://idf.org/aboutdiabetes/ what-is-diabetes/facts-figures.html, 2019. [2] Abhinav Adhikari, Michael Vilkhovoy, Sandra Vadhin, Ha Eun Lim, and Jeffrey D Varner. Effective biophysical modeling of cell free transcription and translation processes. Frontiers in bioengineering and biotechnology, 8, 2020. [3] Syed Ovais Aftab, Muhammad Zubair Ghouri, Muhammad Umer Ma- sood, Zeshan Haider, Zulqurnain Khan, Aftab Ahmad, and Nayla Mu- nawar. Analysis of sars-cov-2 rna-dependent rna polymerase as a poten- tial therapeutic drug target using a computational approach. Journal of translational medicine, 18(1):1–15, 2020. [4] Dmitry E Agafonov, Vyacheslav A Kolb, Igor V Nazimov, and Alexan- der S Spirin. A protein residing at the subunit interface of the bacterial ribosome. Proceedings of the National Academy of Sciences, 96(22):12345– 12349, 1999. [5] Dae-Gyun Ahn, Jin-Kyu Choi, Deborah R Taylor, and Jong-Won Oh. Biochemical characterization of a recombinant sars coronavirus nsp12 rna-dependent rna polymerase capable of copying viral rna templates. Archives of virology, 157(11):2095–2104, 2012. [6] Cem Albayrak and James R. Swartz. Cell-free co-production of an orthog- onal transfer rna activates efficient site-specific non-natural amino acid incorporation. Nucleic Acids Research, 41(11):5949–5963, 2013. [7] Timothy E Allen and Bernhard Ø Palsson. Sequence-based analysis of metabolic demands for protein synthesis in prokaryotes. Journal of theoret- ical biology, 220(1):1–18, 2003. [8] Jorge Alonso-Gutierrez, Eun-Mi Kim, Tanveer S. Batth, Nathan Cho, Qi- jun Hu, Leanne Jade G. Chan, Christopher J. Petzold, Nathan J. Hillson, Paul D. Adams, Jay D. Keasling, Hector Garcia Martin, and Taek Soon Lee. Principal component analysis of proteomics (pcap) as a tool to direct metabolic engineering. Metabolic Engineering, 28:123–133, 2015. [9] Maria Anastasina, Ilya Terenin, Sarah J Butcher, and Denis E Kainov. A technique to increase protein yield in a rabbit reticulocyte lysate transla- tion system. Biotechniques, 56(1):36–39, 2014. 60 [10] JC Atlas, EV Nikolaev, ST Browning, and ML Shuler. Incorporating genome-wide dna sequence information into a dynamic whole-cell model of escherichia coli: application to dna replication. IET systems biology, 2(5):369–382, 2008. [11] Abhik K Banerjee, Mario R Blanco, Emily A Bruce, Drew D Honson, Lin- lin M Chen, Amy Chow, Prashant Bhat, Noah Ollikainen, Sofia A Quin- odoz, Colin Loney, et al. Sars-cov-2 disrupts splicing, translation, and protein trafficking to suppress host defenses. Cell, 183(5):1325–1339, 2020. [12] Lacramioara Bintu, Nicolas E Buchler, Hernan G Garcia, Ulrich Gerland, Terence Hwa, Jané Kondev, and Rob Phillips. Transcriptional regula- tion by the numbers: models. Current opinion in genetics & development, 15(2):116–124, 2005. [13] Mahamaya Biswal, Stephen Diggs, Duo Xu, Nelli Khudaverdyan, Jiuwei Lu, Jian Fang, Gregor Blaha, Rong Hai, and Jikui Song. Two conserved oligomer interfaces of nsp7 and nsp8 underpin the dynamic assembly of sars-cov-2 rdrp. Nucleic acids research, 49(10):5956–5966, 2021. [14] Igor W. Bogorad, Tzu-Shyang Lin, and James C. Liao. Synthetic non- oxidative glycolysis enables complete carbon conservation. Nature, 502(7473):693–697, 2013. [15] Henry Borsook. Protein turnover and incorporation of labeled amino acids into tissue proteins in vivo and in vitro. Physiological reviews, 30(2):206–219, 1950. [16] Shelton S Bradrick, Simardeep Nagyal, and Hilary Novatt. A mirna- responsive cell-free translation system facilitates isolation of hepatitis c virus mirnp complexes. Rna, 19(8):1159–1169, 2013. [17] Sabine Brantl and E Gerhart H Wagner. Antisense rna-mediated transcrip- tional attenuation: an in vitro study of plasmid pt181. Molecular microbiol- ogy, 35(6):1469–1482, 2000. [18] Kara A. Calhoun and James R. Swartz. Energizing cell-free protein synthesis with glucose metabolism. Biotechnology and Bioengineering, 90(5):606–613, 2005. [19] Mariajose Castellanos, David B Wilson, and Michael L Shuler. A modular minimal cell model: purine and pyrimidine transport and metabolism. Proceedings of the National Academy of Sciences, 101(17):6681–6686, 2004. 61 [20] Roger L Chang, Kathleen Andrews, Donghyuk Kim, Zhanwen Li, Adam Godzik, and Bernhard O Palsson. Structural systems biology evaluation of metabolic thermotolerance in escherichia coli. Science, 340(6137):1220– 1223, 2013. [21] James Chappell, Melissa K Takahashi, and Julius B Lucks. Creating small transcription activating rnas. Nature chemical biology, 11(3):214–220, 2015. [22] James Chen, Brandon Malone, Eliza Llewellyn, Michael Grasso, Patrick MM Shelton, Paul Dominic B Olinares, Kashyap Maruthi, Ed- ward T Eng, Hasan Vatandaslar, Brian T Chait, et al. Structural basis for helicase-polymerase coupling in the sars-cov-2 replication-transcription complex. Cell, 182(6):1560–1573, 2020. [23] A Daddaoua, A Corral-Lugo, J-L Ramos, and Tino Krell. Identification of gntr as regulator of the glucose metabolism in pseudomonas aeruginosa. Environmental microbiology, 19(9):3721–3733, 2017. [24] David Dai, Nicholas Horvath, and Jeffrey Varner. Dynamic sequence spe- cific constraint-based modeling of cell-free protein synthesis. Processes, 6(8):132, 2018. [25] Samar Damiati, Rami Mhanna, Rimantas Kodzius, and Eva-Kathrin Ehmoser. Cell-free approaches in synthetic biology utilizing microflu- idics. Genes, 9(3):144, 2018. [26] Denis Daneman. Type 1 diabetes. The Lancet, 367(9513):847–858, 2006. [27] Tyler L Dangerfield, Nathan Z Huang, and Kenneth A Johnson. Expres- sion and purification of tag-free sars-cov-2 rna-dependent rna polymerase in escherichia coli. STAR protocols, 2(1):100357, 2021. [28] Ulysses Amancio de Frias, Greicy Kelly Bonifacio Pereira, Marı́a-Eugenia Guazzaroni, and Rafael Silva-Rocha. Boosting secondary metabolite pro- duction and discovery through the engineering of novel microbial biosen- sors. BioMed Research International, 2018:7021826–7021826, 2018. [29] Mark R Denison, Rachel L Graham, Eric F Donaldson, Lance D Eckerle, and Ralph S Baric. Coronaviruses: an rna proofreading machine regulates replication fidelity and diversity. RNA biology, 8(2):270–279, 2011. 62 [30] Devasenan Devendra, Edwin Liu, and George S Eisenbarth. Type 1 dia- betes: recent developments. Bmj, 328(7442):750–754, 2004. [31] MM Domach, SK Leung, RE Cahn, GG Cocks, and ML Shuler. Computer model for glucose-limited growth of a single cell of escherichia coli b/r-a. Biotechnology and bioengineering, 26(9):1140–1140, 1984. [32] Quentin M Dudley, Ashty S Karim, and Michael C Jewett. Cell-free metabolic engineering: biomanufacturing beyond the cell. Biotechnology journal, 10(1):69–82, 2015. [33] Quentin M. Dudley, Connor J. Nash, and Michael Christopher Jewett. Cell-free biosynthesis of limonene using enzyme-enriched escherichia coli lysates. Synthetic Biology, 4(1), 2019. [34] Jeremy S Edwards and Bernhard O Palsson. Metabolic flux balance anal- ysis and the in silico analysis of escherichia coli k-12 gene deletions. BMC bioinformatics, 1(1):1–10, 2000. [35] Toru Ezure, Takashi Suzuki, Shoken Higashide, Eiichi Shintani, Kohki Endo, Shin-ichiro Kobayashi, Masamitsu Shikata, Masaaki Ito, Koji Tani- mizu, and Osamu Nishimura. Cell-free protein synthesis system prepared from insect cells by freeze-thawing. Biotechnology progress, 22(6):1570– 1577, 2006. [36] Anthony R Fehr and Stanley Perlman. Coronaviruses: an overview of their replication and pathogenesis. Coronaviruses, pages 1–23, 2015. [37] Yaara Finkel, Orel Mizrahi, Aharon Nachshon, Shira Weingarten-Gabbay, David Morgenstern, Yfat Yahalom-Ronen, Hadas Tamir, Hagit Achdout, Dana Stein, Ofir Israeli, et al. The coding capacity of sars-cov-2. Nature, 589(7840):125–130, 2021. [38] AG Fredrickson. Formulation of structured growth models. Biotechnology and bioengineering, 18(10):1481–1486, 1976. [39] Y Fujita and Y Miwa. Identification of an operator sequence for the bacil- lus subtilis gnt operon. Journal of Biological Chemistry, 264(7):4201–4206, 1989. [40] Ernest F Gale, Joan P Folkes, et al. Effect of nucleic acids on protein syn- 63 thesis and amino-acid incorporation in disrupted staphylococcal cells. Na- ture, 173:1223–7, 1954. [41] Yan Gao, Liming Yan, Yucen Huang, Fengjiang Liu, Yao Zhao, Lin Cao, Tao Wang, Qianqian Sun, Zhenhua Ming, Lianqi Zhang, et al. Struc- ture of the rna-dependent rna polymerase from covid-19 virus. Science, 368(6492):779–782, 2020. [42] Jonathan Garamella, Ryan Marshall, Mark Rustad, and Vincent Noireaux. The all e. coli tx-tl toolbox 2.0: a platform for cell-free synthetic biology. ACS synthetic biology, 5(4):344–355, 2016. [43] V. Georgi, Leopold Georgi, Martin Blechert, Merlin Bergmeister, Michael Zwanzig, Doreen A. Wüstenhagen, Frank F. Bier, Erik Jung, and Stefan Kubick. On-chip automation of cell-free protein synthesis: new opportu- nities due to a novel reaction mode. Lab on a Chip, 16(2):269–281, 2016. [44] Aaron R Goerke and James R Swartz. Development of cell-free protein synthesis platforms for disulfide bonded proteins. Biotechnology and bio- engineering, 99(2):351–367, 2008. [45] Calvin J Gordon, Egor P Tchesnokov, Joy Y Feng, Danielle P Porter, and Matthias Götte. The antiviral compound remdesivir potently inhibits rna- dependent rna polymerase from middle east respiratory syndrome coro- navirus. Journal of Biological Chemistry, 295(15):4773–4779, 2020. [46] Nicole E Gregorio, Max Z Levine, and Javin P Oza. A user’s guide to cell-free protein synthesis. Methods and protocols, 2(1):24, 2019. [47] Cassandra Guarino and Matthew P DeLisa. A prokaryote-based cell-free translation system that efficiently synthesizes glycoproteins. Glycobiology, 22(5):596–601, 2012. [48] Weihua Guo, Jiayuan Sheng, and Xueyang Feng. Mini-review: in vitro metabolic engineering for biomanufacturing of high-value products. Computational and Structural Biotechnology Journal, 15:161–167, 2017. [49] Jan-Karl Guterl, Daniel Garbe, Jörg Carsten, Fabian Steffler, Bettina Som- mer, Steven Reiße, Anja Philipp, Martina Haack, Broder Rühmann, Andre Koltermann, Ulrich Kettling, Thomas Brück, and Volker Sieber. Cell-free metabolic engineering: production of chemicals by minimized reaction cascades. Chemsuschem, 5(11):2165–2172, 2012. 64 [50] Andras Gyorgy and Richard M Murray. Quantifying resource competi- tion and its effects in the tx-tl system. In 2016 IEEE 55th Conference on Decision and Control (CDC), pages 3363–3368. IEEE, 2016. [51] Shogo Hamada, Kenneth Gene Yancey, Yehudah Pardo, Mingzhe Gan, Max Vanatta, Duo An, Yue Hu, Thomas L. Derrien, Roanna Ruiz, Peifeng Liu, Jenny Sabin, and Dan Luo. Dynamic dna material with emergent locomotion behavior powered by artificial metabolism. Science Robotics, 4(29), 2019. [52] Joshua J Hamilton, Vivek Dwivedi, and Jennifer L Reed. Quantitative as- sessment of thermodynamic constraints on the solution space of genome- scale metabolic models. Biophysical journal, 105(2):512–522, 2013. [53] Ryan L. Hartman, Jonathan P. McMullen, and Klavs F. Jensen. Deciding whether to go with the flow: evaluating the merits of flow reactors for synthesis. Angewandte Chemie, 50(33):7502–7519, 2011. [54] Hartmut Hengel, Ulrich H Koszinowski, and Karl-Klaus Conzelmann. Viruses know it all: new insights into ifn networks. Trends in immunol- ogy, 26(7):396–401, 2005. [55] Christopher S Henry, Linda J Broadbelt, and Vassily Hatzimanikatis. Thermodynamics-based metabolic flux analysis. Biophysical journal, 92(5):1792–1805, 2007. [56] Jason R Hillebrecht and Shaorong Chong. A comparative study of protein synthesis in in vitro systems: from the prokaryotic reconstituted to the eukaryotic extract-based. BMC biotechnology, 8(1):1–9, 2008. [57] Hauke S Hillen, Goran Kokic, Lucas Farnung, Christian Dienemann, Dim- itry Tegunov, and Patrick Cramer. Structure of replicating sars-cov-2 poly- merase. Nature, 584(7819):154–156, 2020. [58] Yu HongFeng, Ding YongQian, Liu HaiTao, Zhu WenQian, Liu Guo- Qiang, Fu XiuQing, and Ding WeiMin. Optimization design and appli- cation of variable rate fertilization system for small-scaled fields. Transac- tions of the Chinese Society of Agricultural Engineering, 34(3):35–41, 2018. [59] Nicholas Horvath, Michael Vilkhovoy, Joseph A Wayman, Kara Calhoun, James Swartz, and Jeffrey D Varner. Toward a genome scale sequence specific dynamic model of cell-free protein synthesis in escherichia coli. Metabolic engineering communications, 10:e00113, 2020. 65 [60] Chelsea Y Hu, Jeffrey D Varner, and Julius B Lucks. Generating effec- tive models and parameters for rna genetic circuits. ACS synthetic biology, 4(8):914–926, 2015. [61] Cheng Huang, Kumari G Lokugamage, Janet M Rozovics, Krishna Narayanan, Bert L Semler, and Shinji Makino. Sars coronavirus nsp1 protein induces template-dependent endonucleolytic cleavage of mrnas: viral mrnas are resistant to nsp1-induced rna cleavage. PLoS pathogens, 7(12):e1002433, 2011. [62] Isabelle Imbert, Jean-Claude Guillemot, Jean-Marie Bourhis, Cécile Bus- setta, Bruno Coutard, Marie-Pierre Egloff, François Ferron, Alexander E Gorbalenya, and Bruno Canard. A second, non-canonical rna-dependent rna polymerase in sars coronavirus. The EMBO journal, 25(20):4933–4942, 2006. [63] Konstantin A Ivanov, Volker Thiel, Jessika C Dobbe, Yvonne Van Der Meer, Eric J Snijder, and John Ziebuhr. Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. Journal of virology, 78(11):5619–5632, 2004. [64] Konstantin A Ivanov and John Ziebuhr. Human coronavirus 229e non- structural protein 13: characterization of duplex-unwinding, nucleoside triphosphatase, and rna 5-triphosphatase activities. Journal of virology, 78(14):7833–7838, 2004. [65] Thapakorn Jaroentomeechai, Jessica C Stark, Aravind Natarajan, Cameron J Glasscock, Laura E Yates, Karen J Hsu, Milan Mrksich, Michael C Jewett, and Matthew P DeLisa. Single-pot glycoprotein biosyn- thesis using a cell-free transcription-translation system enriched with gly- cosylation machinery. Nature communications, 9(1):1–11, 2018. [66] Andrew R Jauregui, Dhruti Savalia, Virginia K Lowry, Cara M Farrell, and Marc G Wathelet. Identification of residues of sars-cov nsp1 that differen- tially affect inhibition of gene expression and antiviral signaling. PloS one, 8(4):e62416, 2013. [67] Michael C Jewett, Kara A Calhoun, Alexei Voloshin, Jessica J Wuu, and James R Swartz. An integrated cell-free metabolic platform for protein production and synthetic biology. Molecular systems biology, 4(1):220, 2008. [68] Michael C Jewett and James R Swartz. Mimicking the escherichia coli cy- 66 toplasmic environment activates long-lived and efficient cell-free protein synthesis. Biotechnology and bioengineering, 86(1):19–26, 2004. [69] Michael C Jewett and James R Swartz. Substrate replenishment extends protein synthesis with an in vitro translation system designed to mimic the cytoplasm. Biotechnology and bioengineering, 87(4):465–471, 2004. [70] Guoqiang Jiang and Bei B Zhang. Glucagon and regulation of glucose metabolism. American Journal of Physiology-Endocrinology And Metabolism, 284(4):E671–E678, 2003. [71] Yi Jiang, Wanchao Yin, and H Eric Xu. Rna-dependent rna polymerase: Structure, mechanism, and drug discovery for covid-19. Biochemical and biophysical research communications, 538:47–53, 2021. [72] Jaeyoung K Jung, Khalid K Alam, Matthew S Verosloff, Daiana A Capdev- ila, Morgane Desmau, Phillip R Clauer, Jeong Wook Lee, Peter Q Nguyen, Pablo A Pastén, Sandrine J Matiasek, et al. Cell-free biosensors for rapid detection of water contaminants. Nature biotechnology, 38(12):1451–1459, 2020. [73] Wataru Kamitani, Cheng Huang, Krishna Narayanan, Kumari G Lokuga- mage, and Shinji Makino. A two-pronged strategy to suppress host pro- tein synthesis by sars coronavirus nsp1 protein. Nature structural & molec- ular biology, 16(11):1134–1140, 2009. [74] Eyal Karzbrun, Jonghyeon Shin, Roy H Bar-Ziv, and Vincent Noireaux. Coarse-grained dynamics of protein synthesis in a cell-free system. Phys- ical review letters, 106(4):048104, 2011. [75] Rozhgar A Khailany, Muhamad Safdar, and Mehmet Ozaslan. Genomic characterization of a novel sars-cov-2. Gene reports, 19:100682, 2020. [76] Akram T Kharroubi and Hisham M Darwish. Diabetes mellitus: The epi- demic of the century. World Journal of Diabetes, 6(6):850–867, 2015. [77] Dong-Myung Kim and James R Swartz. Regeneration of adenosine triphosphate from glycolytic intermediates for cell-free protein synthesis. Biotechnology and bioengineering, 74(4):309–316, 2001. [78] Dongwan Kim, Joo-Yeon Lee, Jeong-Sun Yang, Jun Won Kim, V Narry 67 Kim, and Hyeshik Chang. The architecture of sars-cov-2 transcriptome. Cell, 181(4):914–921, 2020. [79] Robert N Kirchdoerfer and Andrew B Ward. Structure of the sars-cov nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nature communica- tions, 10(1):1–9, 2019. [80] Mathilde Koch, Amir Pandi, Olivier Borkowski, Angelo Cardoso Batista, and Jean Loup Faulon. Custom-made transcriptional biosensors for metabolic engineering. Current Opinion in Biotechnology, 59:78–84, 2019. [81] Goran Kokic, Hauke S Hillen, Dimitry Tegunov, Christian Dienemann, Florian Seitz, Jana Schmitzova, Lucas Farnung, Aaron Siewert, Claudia Höbartner, and Patrick Cramer. Mechanism of sars-cov-2 polymerase stalling by remdesivir. Nature communications, 12(1):1–7, 2021. [82] Borimas Krutsakorn, Kohsuke Honda, Xiaoting Ye, Takashi Imagawa, Xi- aoyu Bei, Kenji Okano, and Hisao Ohtake. In vitro production of n- butanol from glucose. Metabolic Engineering, 20:84–91, 2013. [83] Christopher P Lapointe, Rosslyn Grosely, Alex G Johnson, Jinfan Wang, Israel S Fernández, and Joseph D Puglisi. Dynamic competition between sars-cov-2 nsp1 and mrna on the human ribosome inhibits translation ini- tiation. Proceedings of the National Academy of Sciences, 118(6), 2021. [84] Muriel Lederman and Geoffrey Zubay. Dna-directed peptide synthesis i. a comparison of t2 and escherichia coli dna-directed peptide synthesis in two cell-free systems. Biochimica et Biophysica Acta (BBA)-Nucleic Acids and Protein Synthesis, 149(1):253–258, 1967. [85] Na-Ra Lee, Hyun-Mi Kwon, Kkothanahreum Park, Sangtaek Oh, Yong- Joo Jeong, and Dong-Eun Kim. Cooperative translocation enhances the unwinding of duplex dna by sars coronavirus helicase nsp13. Nucleic acids research, 38(21):7626–7636, 2010. [86] Kathleen C Lehmann, Anastasia Gulyaeva, Jessika C Zevenhoven-Dobbe, George MC Janssen, Mark Ruben, Hermen S Overkleeft, Peter A Van Vee- len, Dmitry V Samborskiy, Alexander A Kravchenko, Andrey M Leon- tovich, et al. Discovery of an essential nucleotidylating activity associ- ated with a newly delineated conserved domain in the rna polymerase- containing protein of all nidoviruses. Nucleic acids research, 43(17):8416– 8434, 2015. 68 [87] Roberta Lentini, Michele Forlin, Laura Martini, Cristina Del Bianco, Amy C Spencer, Domenica Torino, and Sheref S Mansy. Fluorescent pro- teins and in vitro genetic organization for cell-free synthetic biology. ACS synthetic biology, 2(9):482–489, 2013. [88] Nathan E Lewis, Harish Nagarajan, and Bernhard O Palsson. Constrain- ing the metabolic genotype–phenotype relationship using a phylogeny of in silico methods. Nature Reviews Microbiology, 10(4):291–305, 2012. [89] Dongsheng Li, Ting Wei, Catherine M Abbott, and David Harrich. The unexpected roles of eukaryotic translation elongation factors in rna virus replication and pathogenesis. Microbiol. Mol. Biol. Rev., 77(2):253–266, 2013. [90] Jun Li, Liangcai Gu, John Aach, and George M Church. Improved cell-free rna and protein synthesis system. PLoS One, 9(9):e106232, 2014. [91] Di Liu, Trent Evans, and Fuzhong Zhang. Applications and advances of metabolite biosensors for metabolic engineering. Metabolic Engineering, 31:35–43, 2015. [92] Yuan Lu. Cell-free synthetic biology: Engineering in an open world. Syn- thetic and systems biotechnology, 2(1):23–27, 2017. [93] Yuan Lu, John P Welsh, and James R Swartz. Production and stabiliza- tion of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines. Proceedings of the National Academy of Sciences, 111(1):125–130, 2014. [94] Julius B Lucks, Lei Qi, Vivek K Mutalik, Denise Wang, and Adam P Arkin. Versatile rna-sensing transcriptional regulators for engineering genetic networks. Proceedings of the National Academy of Sciences, 108(21):8617– 8622, 2011. [95] Guangxiang Luo, Robert K Hamatake, Danielle M Mathis, Jason Racela, Karen L Rigat, Julie Lemm, and Richard J Colonno. De novo initiation of rna synthesis by the rna-dependent rna polymerase (ns5b) of hepatitis c virus. Journal of virology, 74(2):851–863, 2000. [96] Rolf Lutz and Hermann Bujard. Independent and tight regulation of tran- scriptional units in escherichia coli via the lacr/o, the tetr/o and arac/i1-i2 regulatory elements. Nucleic acids research, 25(6):1203–1210, 1997. 69 [97] Duo Ma, Luhui Shen, Kaiyue Wu, Chris W Diehnelt, and Alexander A Green. Low-cost detection of norovirus using paper-based cell-free sys- tems and synbody-based viral enrichment. Synthetic Biology, 3(1), 2018. [98] Brandon Malone, James Chen, Qi Wang, Eliza Llewellyn, Young Joo Choi, Paul Dominic B Olinares, Xinyun Cao, Carolina Hernandez, Edward T Eng, Brian T Chait, et al. Structural basis for backtracking by the sars-cov- 2 replication–transcription complex. Proceedings of the National Academy of Sciences, 118(19), 2021. [99] Rey W. Martin, Benjamin J. Des Soye, Yong Chan Kwon, Jennifer Kay, Roderick G. Davis, Paul M. Thomas, Natalia I. Majewska, Cindy X. Chen, Ryan D. Marcum, Mary Grace Weiss, Ashleigh E. Stoddart, Miriam Ami- ram, Arnaz K. Ranji Charna, Jaymin R. Patel, Farren J. Isaacs, Neil L. Kelleher, Seok Hoon Hong, and Michael C. Jewett. Cell-free protein syn- thesis from genomically recoded bacteria enables multisite incorporation of noncanonical amino acids. Nature Communications, 9(1):1203, 2018. [100] J Heinrich Matthaei and Marshall W Nirenberg. Characteristics and sta- bilization of dnaase-sensitive protein synthesis in e. coli extracts. Pro- ceedings of the National Academy of Sciences of the United States of America, 47(10):1580, 1961. [101] William R McClure. Rate-limiting steps in rna chain initiation. Proceedings of the National Academy of Sciences, 77(10):5634–5638, 1980. [102] D. T. McQuade and Peter H. Seeberger. Applying flow chemistry: Meth- ods, materials, and multistep synthesis. Journal of Organic Chemistry, 78(13):6384–6389, 2013. [103] Satoshi Mikami, Tominari Kobayashi, and Hiroaki Imataka. Cell-free pro- tein synthesis systems with extracts from cultured human cells. In Cell- Free Protein Production, pages 43–52. Springer, 2010. [104] Ron Milo, Paul Jorgensen, Uri Moran, Griffin Weber, and Michael Springer. Bionumbers—the database of key numbers in molecular and cell biology. Nucleic acids research, 38(suppl 1):D750–D753, 2010. [105] Ekaterina Minskaia, Tobias Hertzig, Alexander E Gorbalenya, Valérie Campanacci, Christian Cambillau, Bruno Canard, and John Ziebuhr. Dis- covery of an rna virus 3→ 5 exoribonuclease that is critically involved in coronavirus rna synthesis. Proceedings of the National Academy of Sciences, 103(13):5108–5113, 2006. 70 [106] Simon J. Moore, James T. MacDonald, Sarah Wienecke, Alka Ishwarb- hai, Argyro Tsipa, Rochelle Aw, Nicolas Kylilis, David J. Bell, David W. McClymont, Kirsten Jensen, Karen M. Polizzi, Rebekka Biedendieck, and Paul S. Freemont. Rapid acquisition and model-based analysis of cell- free transcription–translation reactions from nonmodel bacteria. Proceed- ings of the National Academy of Sciences of the United States of America, 115(19):201715806, 2018. [107] Travis W. Murphy, Jiayuan Sheng, Lynette B. Naler, Xueyang Feng, and Chang Lu. On-chip manufacturing of synthetic proteins for point-of-care therapeutics. Microsystems Nanoengineering, 5(1):13, 2019. [108] Keisuke Nakagawa, Krishna Narayanan, Masami Wada, Vsevolod L Popov, Maria Cajimat, Ralph S Baric, and Shinji Makino. The endonucle- olytic rna cleavage function of nsp1 of middle east respiratory syndrome coronavirus promotes the production of infectious virus particles in spe- cific human cell lines. Journal of virology, 92(21):e01157–18, 2018. [109] Ahmad Abu Turab Naqvi, Kisa Fatima, Taj Mohammad, Urooj Fatima, Indrakant K Singh, Archana Singh, Shaikh Muhammad Atif, Gururao Hariprasad, Gulam Mustafa Hasan, and Md Imtaiyaz Hassan. Insights into sars-cov-2 genome, structure, evolution, pathogenesis and thera- pies: Structural genomics approach. Biochimica et Biophysica Acta (BBA)- Molecular Basis of Disease, 1866(10):165878, 2020. [110] Krishna Narayanan, Cheng Huang, Kumari Lokugamage, Wataru Kami- tani, Tetsuro Ikegami, Chien-Te K Tseng, and Shinji Makino. Severe acute respiratory syndrome coronavirus nsp1 suppresses host gene expression, including that of type i interferon, in infected cells. Journal of virology, 82(9):4471–4479, 2008. [111] Krishna Narayanan, Sydney I Ramirez, Kumari G Lokugamage, and Shinji Makino. Coronavirus nonstructural protein 1: Common and dis- tinct functions in the regulation of host and viral gene expression. Virus research, 202:89–100, 2015. [112] Katerina Naydenova, Kyle W Muir, Long-Fei Wu, Ziguo Zhang, Francesca Coscia, Mathew J Peet, Pablo Castro-Hartmann, Pu Qian, Kasim Sader, Kyle Dent, et al. Structure of the sars-cov-2 rna-dependent rna polymerase in the presence of favipiravir-rtp. Proceedings of the Na- tional Academy of Sciences, 118(7), 2021. [113] Patrick P Ng, Ming Jia, Kedar G Patel, Joshua D Brody, James R Swartz, 71 Shoshana Levy, and Ronald Levy. A vaccine directed to b cells and pro- duced by cell-free protein synthesis generates potent antilymphoma im- munity. Proceedings of the National Academy of Sciences, 109(36):14526– 14531, 2012. [114] Pamela N. Nge, Chad I. Rogers, and Adam T. Woolley. Advances in mi- crofluidic materials, functions, integration, and applications. Chemical Re- views, 113(4):2550–2583, 2013. [115] Henrike Niederholtmeyer, Ling Xu, and Sebastian J Maerkl. Real-time mrna measurement during an in vitro transcription and translation reac- tion using binary probes. ACS synthetic biology, 2(8):411–417, 2013. [116] Alexander Nieß, Jurek Failmezger, Maike Kuschel, Martin Siemann- Herzberg, and Ralf Takors. Experimentally validated model enables de- bottlenecking of in vitro protein synthesis and identifies a control shift under in vivo conditions. ACS synthetic biology, 6(10):1913–1921, 2017. [117] Marshall W Nirenberg and J Heinrich Matthaei. The dependence of cell-free protein synthesis in e. coli upon naturally occurring or syn- thetic polyribonucleotides. Proceedings of the National Academy of Sciences, 47(10):1588–1602, 1961. [118] Vincent Noireaux, Roy Bar-Ziv, and Albert Libchaber. Principles of cell- free genetic circuit assembly. Proceedings of the National Academy of Sciences, 100(22):12672–12677, 2003. [119] Edward J O’brien, Joshua A Lerman, Roger L Chang, Daniel R Hyduke, and Bernhard Ø Palsson. Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction. Molecular sys- tems biology, 9(1):693, 2013. [120] Paul H Opgenorth, Tyler P Korman, and James U Bowie. A synthetic bio- chemistry molecular purge valve module that maintains redox balance. Nature communications, 5(1):1–8, 2014. [121] Keith Pardee and James J. Collins. Paper-based synthetic gene networks, 2014. [122] Keith Pardee, Alexander A. Green, Melissa K. Takahashi, Dana Braff, Guillaume Lambert, Jeong Wook Lee, Tom Ferrante, Duo Ma, Nina Donghia, Melina Fan, Nichole M. Daringer, Irene Bosch, Dawn M. Dud- ley, David H. O’Connor, Lee Gehrke, and James J. Collins. Rapid, low-cost 72 detection of zika virus using programmable biomolecular components. Cell, 165(5):1255–1266, 2016. [123] Susan Payne. Introduction to rna viruses. Viruses, page 97, 2017. [124] Norbert Peekhaus and T. Conway. Positive and negative transcriptional regulation of the escherichia coli gluconate regulon gene gntt by gntr and the cyclic amp (camp)-camp receptor protein complex. Journal of Bacteri- ology, 180(7):1777–1785, 1998. [125] Clara C Posthuma, Aartjan JW Te Velthuis, and Eric J Snijder. Nidovirus rna polymerases: complex enzymes handling exceptional rna genomes. Virus research, 234:58–73, 2017. [126] Aditya Pratapa, Amogh P Jalihal, Jeffrey N Law, Aditya Bharadwaj, and TM Murali. Benchmarking algorithms for gene regulatory network infer- ence from single-cell transcriptomic data. Nature methods, 17(2):147–154, 2020. [127] JM Pratt. Coupled transcription-translation in prokaryotic cell-free sys- tems. Transcription and translation: a practical approach, 1984. [128] Sephra Rampersad and Paula Tennant. Replication and expression strate- gies of viruses. Viruses, page 55, 2018. [129] Jan Rehwinkel and Michaela U Gack. Rig-i-like receptors: their regulation and roles in rna sensing. Nature Reviews Immunology, 20(9):537–551, 2020. [130] A. Reizer, J. Deutscher, M. H. Saier, and J. Reizer. Analysis of the glu- conate (gnt) operon of bacillus subtilis. Molecular Microbiology, 5(5):1081– 1089, 1991. [131] Maria Romano, Alessia Ruggiero, Flavia Squeglia, Giovanni Maga, and Rita Berisio. A structural view of sars-cov-2 rna replication machinery: Rna synthesis, proofreading and final capping. Cells, 9(5):1267, 2020. [132] Gabriel Rosenblum and Barry S Cooperman. Engine out of the chassis: cell-free protein synthesis and its uses. FEBS letters, 588(2):261–268, 2014. [133] Ryota Sakamoto, Vincent Noireaux, and Yusuke T. Maeda. Anomalous scaling of gene expression in confined cell-free reactions. Scientific Reports, 8(1):7364–7364, 2018. 73 [134] Shun Sakuraba, Xie Qilin, Kota Kasahara, Junichi Iwakiri, and Hidetoshi Kono. Modeling sars-cov-2 nsp1-5’-utr complex via the extended ensem- ble simulations. bioRxiv, 2021. [135] Claudia Sánchez, Juan Carlos Quintero, and Silvia Ochoa. Flux bal- ance analysis in the production of clavulanic acid by streptomyces clavuligerus. Biotechnology progress, 31(5):1226–1236, 2015. [136] Inga-Marie Schaefer, Robert F Padera, Isaac H Solomon, Sanjat Kanjilal, Mark M Hammer, Jason L Hornick, and Lynette M Sholl. In situ detec- tion of sars-cov-2 in lungs and airways of patients with covid-19. Modern Pathology, 33(11):2104–2114, 2020. [137] Christophe H Schilling, David Letscher, and Bernhard Ø Palsson. Theory for the systemic definition of metabolic pathways and their use in inter- preting metabolic function from a pathway-oriented perspective. Journal of theoretical biology, 203(3):229–248, 2000. [138] Katharina Schubert, Evangelos D Karousis, Ahmad Jomaa, Alain Scaiola, Blanca Echeverria, Lukas-Adrian Gurzeler, Marc Leibundgut, Volker Thiel, Oliver Mühlemann, and Nenad Ban. Sars-cov-2 nsp1 binds the ri- bosomal mrna channel to inhibit translation. Nature structural & molecular biology, 27(10):959–966, 2020. [139] Stefan Schuster, David A Fell, and Thomas Dandekar. A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks. Nature biotechnology, 18(3):326–332, 2000. [140] Anja Seybert, Annette Hegyi, Stuart G Siddell, and John Ziebuhr. The human coronavirus 229e superfamily 1 helicase has rna and dna duplex- unwinding activities with 5-to-3 polarity. Rna, 6(7):1056–1068, 2000. [141] Jian Shang, Yushun Wan, Chuming Luo, Gang Ye, Qibin Geng, Ashley Auerbach, and Fang Li. Cell entry mechanisms of sars-cov-2. Proceedings of the National Academy of Sciences, 117(21):11727–11734, 2020. [142] Yoshihiro Shimizu, Akio Inoue, Yukihide Tomari, Tsutomu Suzuki, Takashi Yokogawa, Kazuya Nishikawa, and Takuya Ueda. Cell-free translation reconstituted with purified components. Nature biotechnology, 19(8):751–755, 2001. [143] Jonghyeon Shin and Vincent Noireaux. Efficient cell-free expression with 74 the endogenous e. coli rna polymerase and sigma factor 70. Journal of biological engineering, 4(1):1–9, 2010. [144] Jonghyeon Shin and Vincent Noireaux. An e. coli cell-free expression tool- box: application to synthetic gene circuits and artificial cells. ACS synthetic biology, 1(1):29–41, 2012. [145] Takehiro Shinoda, Naoko Shinya, Kaori Ito, Yoshiko Ishizuka-Katsura, Noboru Ohsawa, Takaho Terada, Kunio Hirata, Yoshiaki Kawano, Masaki Yamamoto, Taisuke Tomita, Yohei Ishibashi, Yoshio Hirabayashi, Tomomi Kimura-Someya, Mikako Shirouzu, and Shigeyuki Yokoyama. Cell-free methods to produce structurally intact mammalian membrane proteins. Scientific Reports, 6(1):30442, 2016. [146] Bo Shu and Peng Gong. Structural basis of viral rna-dependent rna poly- merase catalysis and translocation. Proceedings of the National Academy of Sciences, 113(28):E4005–E4014, 2016. [147] Robert W Siegel, Scott Adkins, and C Cheng Kao. Sequence-specific recognition of a subgenomic rna promoter by a viral rna polymerase. Pro- ceedings of the National Academy of Sciences, 94(21):11238–11243, 1997. [148] Nicolas Sierro, Yuko Makita, Michiel J. L. de Hoon, and Kenta Nakai. Dbtbs: a database of transcriptional regulation in bacillus subtilis contain- ing upstream intergenic conservation information. Nucleic Acids Research, 36:93–96, 2008. [149] Adam D. Silverman, Ashty S. Karim, and Michael C. Jewett. Cell-free gene expression: an expanded repertoire of applications. Nature Reviews Genetics, 21(3):151–170, 2020. [150] Matthieu Simeoni, Théo Cavinato, Daniel Rodriguez, and David Gatfield. I (nsp1) ecting sars-cov-2–ribosome interactions. Communications biology, 4(1):1–5, 2021. [151] Everett Clinton Smith, Nicole R Sexton, and Mark R Denison. Thinking outside the triangle: replication fidelity of the largest rna viruses. Annual review of virology, 1:111–132, 2014. [152] EJ Snijder, E Decroly, and J Ziebuhr. The nonstructural proteins direct- ing coronavirus rna synthesis and processing. Advances in virus research, 96:59–126, 2016. 75 [153] Mehran Soltani, Brady R. Davis, Hayley Ford, J. Andrew D. Nelson, and Bradley C. Bundy. Reengineering cell-free protein synthesis as a biosen- sor: Biosensing with transcription, translation, and protein-folding. Bio- chemical Engineering Journal, 138:165–171, 2018. [154] Konstantin MJ Sparrer and Michaela U Gack. Intracellular detection of viral nucleic acids. Current opinion in microbiology, 26:1–9, 2015. [155] Alexander S Spirin, Vladimir I Baranov, Lubov A Ryabova, Sergey Yu Ovodov, and Yuly B Alakhov. A continuous cell-free translation system capable of producing polypeptides in high yield. Science, 242(4882):1162– 1164, 1988. [156] DE Steinmeyer and ML Shuler. Structured model for saccharomyces cere- visiae. Chemical engineering science, 44(9):2017–2030, 1989. [157] Tobias Stögbauer, Lukas Windhager, Ralf Zimmer, and Joachim O Rädler. Experiment and mathematical modeling of gene expression dynamics in a cell-free system. Integrative Biology, 4(5):494–501, 2012. [158] Lorenzo Subissi, Clara C Posthuma, Axelle Collet, Jessika C Zevenhoven- Dobbe, Alexander E Gorbalenya, Etienne Decroly, Eric J Snijder, Bruno Canard, and Isabelle Imbert. One severe acute respiratory syndrome coronavirus protein complex integrates processive rna polymerase and exonuclease activities. Proceedings of the National Academy of Sciences, 111(37):E3900–E3909, 2014. [159] Rahul K Suryawanshi, Raghuram Koganti, Alex Agelidis, Chan- drashekhar D Patil, and Deepak Shukla. Dysregulation of cell signaling by sars-cov-2. Trends in microbiology, 2020. [160] James R Swartz. Expanding biological applications using cell-free metabolic engineering: an overview. Metabolic engineering, 50:156–172, 2018. [161] Stanley Tabor. Expression using the t7 rna polymerase/promoter system. Current protocols in molecular biology, 11(1):16–2, 1990. [162] Melissa K Takahashi, James Chappell, Clarmyra A Hayes, Zachary Z Sun, Jongmin Kim, Vipul Singhal, Kevin J Spring, Shaima Al-Khabouri, Christopher P Fall, Vincent Noireaux, et al. Rapidly characterizing the fast dynamics of rna genetic circuitry with cell-free transcription–translation (tx-tl) systems. ACS synthetic biology, 4(5):503–515, 2015. 76 [163] Melissa K Takahashi, Clarmyra A Hayes, James Chappell, Zachary Z Sun, Richard M Murray, Vincent Noireaux, and Julius B Lucks. Characterizing and prototyping genetic networks with cell-free transcription–translation reactions. Methods, 86:60–72, 2015. [164] Kazuyuki Takai, Tatsuya Sawasaki, and Yaeta Endo. Practical cell-free protein synthesis system using purified wheat embryos. Nature protocols, 5(2):227–238, 2010. [165] Tomohisa Tanaka, Wataru Kamitani, Marta L DeDiego, Luis Enjuanes, and Yoshiharu Matsuura. Severe acute respiratory syndrome coronavirus nsp1 facilitates efficient propagation in cells through a specific transla- tional shutoff of host mrna. Journal of virology, 86(20):11128–11137, 2012. [166] Matthew Zirui Tay, Chek Meng Poh, Laurent Rénia, Paul A MacAry, and Lisa FP Ng. The trinity of covid-19: immunity, inflammation and inter- vention. Nature Reviews Immunology, 20(6):363–374, 2020. [167] Noah D Taylor, Alexander S Garruss, Rocco Moretti, Sum Chan, Mark A Arbing, Duilio Cascio, Jameson K Rogers, Farren J Isaacs, Sriram Kosuri, David Baker, et al. Engineering an allosteric transcription factor to re- spond to new ligands. Nature methods, 13(2):177–183, 2016. [168] Aartjan JW Te Velthuis, Sjoerd HE Van Den Worm, and Eric J Snijder. The sars-coronavirus nsp7+ nsp8 complex is a unique multimeric rna poly- merase capable of both de novo initiation and primer extension. Nucleic acids research, 40(4):1737–1747, 2012. [169] Matthias Thoms, Robert Buschauer, Michael Ameismeier, Lennart Koepke, Timo Denk, Maximilian Hirschenberger, Hanna Kratzat, Manuel Hayn, Timur Mackens-Kiani, Jingdong Cheng, et al. Structural basis for translational shutdown and immune evasion by the nsp1 protein of sars- cov-2. Science, 369(6508):1249–1255, 2020. [170] Aidan Tinafar, Katariina Jaenes, and Keith Pardee. Synthetic biology goes cell-free. BMC biology, 17(1):1–14, 2019. [171] Masaru Tomita, Kenta Hashimoto, Koichi Takahashi, Thomas Simon Shimizu, Yuri Matsuzaki, Fumihiko Miyoshi, Kanako Saito, Sakura Tanida, Katsuyuki Yugi, J Craig Venter, et al. E-cell: software environ- ment for whole-cell simulation. Bioinformatics (Oxford, England), 15(1):72– 84, 1999. 77 [172] Chien-Te K Tseng, Jennifer Tseng, Lucy Perrone, Melissa Worthy, Vsevolod Popov, and Clarence J Peters. Apical entry and release of severe acute respiratory syndrome-associated coronavirus in polarized calu-3 lung epithelial cells. Journal of virology, 79(15):9470–9479, 2005. [173] Olga Tsypik, Roman Makitrynskyy, Agnieszka Bera, Lijiang Song, Wolf- gang Wohlleben, Victor Fedorenko, and Bohdan Ostash. Role of gntr family regulatory gene sco1678 in gluconate metabolism in streptomyces coelicolor m145. BioMed Research International, 2017:9529501–9529501, 2017. [174] Meaghan A. Valliere, Tyler P. Korman, Nicholas B. Woodall, Gregory A. Khitrov, Robert E. Taylor, David Baker, and James U. Bowie. A cell- free platform for the prenylation of natural products and application to cannabinoid production. Nature Communications, 10(1):565–565, 2019. [175] Amit Varma and Bernhard O Palsson. Stoichiometric flux balance mod- els quantitatively predict growth and metabolic by-product secretion in wild-type escherichia coli w3110. Applied and environmental microbiology, 60(10):3724–3731, 1994. [176] Jeffrey D. Varner. Gene Regulatory Network Generator in Julia (JuGRN), April 2020. [177] Jeffrey D Varner. Lectures notes in advanced biomolecular engineering, 2020. [178] Sangita Venkataraman, Burra VLS Prasad, and Ramasamy Selvarajan. Rna dependent rna polymerases: insights from structure, function and evolution. Viruses, 10(2):76, 2018. [179] Michael Vilkhovoy, Abhinav Adhikari, Sandra Vadhin, and Jeffrey D Varner. The evolution of cell free biomanufacturing. Processes, 8(6):675, 2020. [180] Michael Vilkhovoy, David Dai, Sandra Vadhin, Abhinav Adhikari, and Jeffrey D Varner. Absolute quantification of cell-free protein synthesis metabolism by reversed-phase liquid chromatography-mass spectrome- try. JoVE (Journal of Visualized Experiments), (152):e60329, 2019. [181] Michael Vilkhovoy, Nicholas Horvath, Che-Hsiao Shih, Joseph A Way- man, Kara Calhoun, James Swartz, and Jeffrey D Varner. Sequence spe- 78 cific modeling of e. coli cell-free protein synthesis. ACS synthetic biology, 7(8):1844–1857, 2018. [182] Peter L. Voyvodic and Jerome Bonnet. Cell-free biosensors for biomedical applications. Current Opinion in Biomedical Engineering, 13:9–15, 2020. [183] Philip V’kovski, Annika Kratzel, Silvio Steiner, Hanspeter Stalder, and Volker Thiel. Coronavirus biology and replication: implications for sars- cov-2. Nature Reviews Microbiology, 19(3):155–170, 2021. [184] Huihui Wang, Xuemei Li, Tao Li, Shubing Zhang, Lianzi Wang, Xian Wu, and Jiaqing Liu. The genetic sequence, origin, and diagnosis of sars-cov- 2. European Journal of Clinical Microbiology & Infectious Diseases, pages 1–7, 2020. [185] Xiaoqiang Wang, Karolina Corin, Philipp Baaske, Christoph J Wienken, Moran Jerabek-Willemsen, Stefan Duhr, Dieter Braun, and Shuguang Zhang. Peptide surfactants for cell-free production of functional g protein-coupled receptors. Proceedings of the National Academy of Sciences, 108(22):9049–9054, 2011. [186] Joseph A Wayman, Adithya Sagar, and Jeffrey D Varner. Dynamic model- ing of cell-free biochemical networks using effective kinetic models. Pro- cesses, 3(1):138–160, 2015. [187] LA Weber, ER Feman, and C Baglioni. Cell free system from hela cells active in initiation of protein synthesis. Biochemistry, 14(24):5315–5321, 1975. [188] Michael Weinfeld, Jane Lee, Gu Ruiqi, Feridoun Karimi-Busheri, Davis Chen, and Joan Allalunis-Turner. Use of a postlabelling assay to examine the removal of radiation-induced dna lesions by purified enzymes and human cell extracts. Mutation Research/Fundamental and Molecular Mecha- nisms of Mutagenesis, 378(1-2):127–137, 1997. [189] Wolfgang Wiechert. 13c metabolic flux analysis. Metabolic engineering, 3(3):195–206, 2001. [190] T Winnick et al. Studies on the mechanism of protein, synthesis in em- bryonic and tumor tissues. 1. evidence relating to the incorporation of la- beled amino acids into protein structure in homogenates. Arch. Biochem., 27, 1950. 79 [191] Aiping Wu, Yousong Peng, Baoying Huang, Xiao Ding, Xianyue Wang, Peihua Niu, Jing Meng, Zhaozhong Zhu, Zheng Zhang, Jiangyuan Wang, et al. Genome composition and divergence of the novel coronavirus (2019- ncov) originating in china. Cell host & microbe, 27(3):325–328, 2020. [192] P Wu, NG Ray, and ML Shuler. A single-cell model for cho cells a. Annals of the New York Academy of Sciences, 665(1):152–187, 1992. [193] Raghavendra Yadavalli and Tobili Sam-Yellowe. Hela based cell free ex- pression systems for expression of plasmodium rhoptry proteins. JoVE (Journal of Visualized Experiments), (100):e52772, 2015. [194] Liming Yan, Ying Zhang, Ji Ge, Litao Zheng, Yan Gao, Tao Wang, Zhihui Jia, Haofeng Wang, Yucen Huang, Mingyu Li, et al. Architecture of a sars- cov-2 mini replication and transcription complex. Nature communications, 11(1):1–6, 2020. [195] Wanchao Yin, Chunyou Mao, Xiaodong Luan, Dan-Dan Shen, Qingya Shen, Haixia Su, Xiaoxi Wang, Fulai Zhou, Wenfeng Zhao, Minqi Gao, et al. Structural basis for inhibition of the rna-dependent rna polymerase from sars-cov-2 by remdesivir. Science, 368(6498):1499–1504, 2020. [196] Ken-Ichi Yoshida, Yasutaro Fujita, and Akinori Sarai. Missense mutations in the bacillus subtilis gnt repressor that diminish operator binding ability, 1993. [197] Ken-ichi Yoshida, Hisanobu Ohmori, Yasuhiko Miwa, and Yasutaro Fu- jita. Bacillus subtilis gnt repressor mutants that diminish gluconate- binding ability. Journal of bacteriology, 177(16):4813–4816, 1995. [198] Francis K Yoshimoto. The proteins of severe acute respiratory syndrome coronavirus-2 (sars cov-2 or n-cov19), the cause of covid-19. The protein journal, 39:198–216, 2020. [199] Anne Zemella, Lena Thoring, Christian Hoffmeister, and Stefan Kubick. Cell-free protein synthesis: pros and cons of prokaryotic and eukaryotic systems. ChemBioChem, 16(17):2420, 2015. [200] Yang Zhang. Genome wide structure and function modeling of sars-cov- 2. https://zhanglab.ccmb.med.umich.edu/COVID-19/, 2020. [201] Ying Zhang, Ines Thiele, Dana Weekes, Zhanwen Li, Lukasz Jaroszewski, 80 Krzysztof Ginalski, Ashley M Deacon, John Wooley, Scott A Lesley, Ian A Wilson, et al. Three-dimensional structural view of the central metabolic network of thermotoga maritima. Science, 325(5947):1544–1549, 2009. [202] Wei Zhu, Catherine Z Chen, Kirill Gorshkov, Miao Xu, Donald C Lo, and Wei Zheng. Rna-dependent rna polymerase as a target for covid-19 drug discovery. SLAS DISCOVERY: Advancing the Science of Drug Discovery, 25(10):1141–1151, 2020. 81