Team:Calgary/Project/Transcription

From 2010.igem.org

Transcription/Translation Circuit

Overview

Transcription and translation are essential processes for protein expression. Problems that arise during these processes could lead to improper protein formation. Issues that can occur include shortage in length, folding problems, low or no expression, etc. These issues are accentuated in synthetic biology as foreign genes are implemented into prokaryotes such as Escherichia coli. The transcription translation detector circuit was developed in order to test whether or not a gene of interest is being correctly transcribed and translated.


How the Circuit Works

The gene of interest is fused to a mutant RFP. Downstream of this is GFP with its own ribosomal binding site. If transcription is occurring, the transcript would include the gene of interest, RFP as well as GFP. Because GFP has its own ribosomal biding site, it should be translated if transcription is happening.


If the gene of interest is also being translated, then RFP should also be translated because it is fused to the GOI.the RFP was specially selected from Dr. Lewenza’s lab. This RFP (nicknamed sRFP or special red fluorescent protein) can fold in the cytoplasm, periplasm and the cellular membrane.



Details

Assumptions:

For this circuit to work, there are several assumptions that must be made. The first of which is a result of a limitation within the design which is that sRFP will not affect the stability of the protein of interest. Both positives and negatives are not ideal because the circuit functions as an indicator, any assistance could lead to false positives or vice versa. Second, AraC is the right promoter for this circuit. Although there are many benefits for using an arabinose inducible promoter, however evolutionary conditions have established optimal expression in natural promoters. Third is folding properties in the periplasm and cytoplasm (Lewenza, et al., 2006) had to be the same such that a sRFP in the cytoplasm will give the same absorbance as one in the periplasm. Fourth would be that the GOI does not contain a “rut” site (Rho utilization site) which would prematurely stop transcription using Rho dependent termination. Fifth would be that E.coli would be the most compatible cell available for protein expression. Much like the second assumption, genes are optimally expressed in its natural host. Transferring these genes into E.coli might decrease the efficiency of protein expression. These are considerations that must be made in order to ensure the success of this circuit towards its utilization within our testing kit. It is definitely more “artificial” compared to the other two mostly because it overrides the necessity for the natural systems within. However if all limitations are accounted for, this could be a very useful tool if coupled with our other systems.

Helpful tips with understanding the circuit: With the way the circuit is developed, a failure of transcription will lead to a failure of translation. Therefore it is impossible to see only red cells, but possible to see green cells. If a brownish color is expressed (a mixture of red and green), this indicates both transcription and translation. Also if only green cells are noticed, then to definitively test whether or not there is something wrong with translation, a user must employ the other two circuits. Meaning positive in the folding circuits indicates the translation mechanism works. However, due to the design of attaching sRFP with the GOI, the GOI misfolding will affect the stability of sRFP.



Problems that can arise during transcription / translation:

There are numerous problems that can arise in the transcription and translation, especially when trying to turn E.coli into a factory for foreign proteins. Each category of transcription and translation can be broken down into pre, during, and post. Although some aspects between post-transcription and pre-translation are slightly grey, there are parts of it that are quite clear. For example, the attachment of the 30S subunit from rRNA would be considered pre-translational but not post-transcriptional. This section describes some of the possible transcription/translation issues and the following responses by the system.


Transcription

Pre-transcription

Transcription factors

One of the main ideas of synthetic biology is the expression of proteins from foreign enzymes. For example, GFP comes from Aequorea Victoria (Andersen, et al., 1998) . One of the considerations is whether or not foreign circuits have the corresponding transcriptions within E.coli. If these transcription factors have a profound effect on whether or not transcription can occur (Kleinert, et al., 2003) , then natural promoters might be hindered or lack the necessary transcription factors for expression. Therefore it is necessary to include an arabinose promoter (pBad/araC), a well characterized and working promoter in E.coli. (iGEM registry,2003)

If the problem of the foreign circuit lies in the promoter, the circuit can be used to detect this simply through inserting the RBS+GOI into the circuit and compare this with inserting the foreign promoter + RBS + GOI. If there is expression without the foreign promoter, and no expression with it, then there could be a repressor bounded to the operon of the circuit. If there is expression in both then a third circuit can be constructed with just the foreign promoter + RBS + GOI without the arabinose promoter. If there is no expression in the third, then the foreign promoter lacks the necessary transcription factors to operate in the host E.coli.

Promoter strength

This is not a problem with natural promoters however this is an issue faced by many synthetic biologist when matching a promoter with a GOI. More is not always better, over expression of protein could lead to higher amounts of aggregation and longer folding time.(Brock, 2010) Choosing the pBAD/araC promoter is beneficial because induction varies with arabinose concentrations. Therefore it is possible to use a 96 well plate with varying levels of arabinose to promote induction at various strengths. A plate reader can then be used to read absorbance levels to find the optimal amount of indicator expressed.


Transcription

Repressor/amount of inducer

The ratio of inducer to plasmid copy number would be a problem when trying to express a foreign protein in E.coli. Much like issue with transcription factors, the circuit was designed to include an arabinose promoter that way it is possible to control the concentration of the inducer arabinose. In that case there will be no shortage in the concentration of inducer because the promoter is well characterized meaning its induction is known.

Hair pin loop/rho dependent termination

The formation of premature hair pin loops and rho utilization sites formed from within the gene are potential methods of premature stops to transcription. Hair pin loops are typically 7 to 20 amino acids long (Lewin, 2007) and ruts sites are 22-116 base pairs.(Banerjee, et al., 2007) The more likely of the two when forming an accidental termination site would be a hair pin loop. This relies on the palindrome formation with high concentrations of guanine and cysteine which results in a RNA pulling from the DNA. Our system would detect premature termination of RNA would result in no signal with our GFP signal.


Post-transcription

mRNA shape degradation

Although transcription occurs, mRNA instability results in the degradation of the mRNA. The circuit would suggest that the GFP report was not expressed. Despite transcription occurring completely, the most logical approach would be to group this under issues with transcription, also because pre-translational steps have not occurred yet.


Translation

Pre-translation

No current issues arise from this step.


Translation

Multi codon usage

When inserting foreign genes into E.coli, the ratios of tRNAs in E.coli in comparison to the foreign source can vary. Shortages in tRNA can lead to problems with rate and accuracy of translation. (Ran and Higgs, 2010) Kinetics is a factor of rate of protein formation, decreased concentrations of necessary tRNAs results in slower formation of proteins. Based on the research by Drummond and Wilke, lack of accuracy is caused by mistranslation causing higher amount of misfolding.(Drummond and Wilke, 2008) If the GOI’s multi codon usage disagrees with the host E.coli, there would be aggregation which will inhibit protein expression. The circuit can detect that there are problems with translation because sRFP would be form aggregate bodies with the protein of interest (POI).

Premature stop codon

Stop codons inhibit translation. The circuit would indicate the presence of a premature stop codon because the sRFP would not be translated therefore no signal would be present.

RBS compatibility

The ribosome binding site allows the attachment of the ribosome. Differences in ribosome strength could change the translation frequencies. This leaves room for protein misfolding. Because of the specificity of the RBS to the expression of the gene, as well as the potential of affecting the triple nucleotide site which could shift the reading frame. The circuit was designed in such a way that the user is capable of attaching their own RBS.

Copy number

Copy number refers to the number of plasmids that can exist within on E.coli cell.(iGEM Registry, 2009) Although this does not change the rate of transcription (polymerase per second, PoPs) like promoter strength, the effects are similar. Increasing the concentration of slower folding proteins could result in aggregation due to exposed hydrophobic segments . (Ran and Higgs, 2010) The circuit will detect this as an issue with translation as this could affect the protein.


Post-translation

Lack of chaperones

The lack of essential chaperones could result in protein misfolding. E.coli may not have the necessary chaperones to correct the conformation of the POI. The formation of misfolded protein will cause the aggregation of sRFP, therefore indicating an error in translation.



Design of the Circuit

Photobucket

pBad/araC Promoter- This promoter was chosen because it allow for variable strength without replacing the promoter (if the circuit had a promoter library). Because more is not always better, the user can customize optimal levels of promoter strength in protein expression. This part is also highly characterized (iGEM Registry, 2003).

Multiple Cloning Sites- We are using modified biobrick prefix and suffix. What this means is that these sites are not separating the biobrick parts from the sequences, rather they are located between the arabinose promoter and the sRFP.

ccdB- A place holder that was added for selection in addition to antibiotic selection. The circuit will contain a suicide ccdB gene as a placeholder for the GOI. If this is not removed, the cell which has this transformed plasmid will die. This will ensure that the only cells present on the plate will only express the genes intended to be there.

RBS- We have decided not include a RBS within this sequence to allow customizability. Natural RBS are known to indicate optimal PoPs plus issues with this would indicate compatibility problems on the part of the RBS and GOI. This would also eliminate any issues regard reading frame shifts of the RBS to the GOI for those that are biobricking new parts.

sRFP (special red fluorescent protein)- This is part of the translation portion of the circuit. This indicator was chosen because it can fold in the cytoplasm, periplasm and membranes.(Lewenza, et al., 2006) One of the limitations of this circuit is that the GOI must be fused to sRFP in order for translation detection to occur. This means additional time on the part of the user to rebiobrick the end portion of the GOI such that the stop codons are removed. Current studies by Lewenza, et al. reveals that RFP can be localized in the cytoplasm as well as the outer membrane.

Reference:

1. Andersen, J. B., Sternberg, C., Poulsen, L. K., Bjorn, S. P., Givskov, M., Molin, S., et al. (1998). New Unstable Variants of Green Fluorescent Protein for Studies of Transient Gene Expression in Bacteria. Appl. Envir. Microbiol., 64(6), 2240-2246. Retrieved from http://aem.asm.org/cgi/content/abstract/64/6/2240.

2. Banerjee, S., Chalissery, J., Bandey, I., & Sen, R. (2006). Rho-dependent transcription termination: more questions than answers. Journal of microbiology (Seoul, Korea), 44(1), 11-22. Retrieved from: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1838574&tool=pmcentrez&rendertype=abstract.

3. Drummond, D. A., & Wilke, C. O. (2008). Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution. Cell, 134(2), 341-52. doi: 10.1016/j.cell.2008.05.042.

4. Help:plasmid backbones/features. (2008). Retrieved from http://partsregistry.org/Help:Plasmid_backbones/Features

5. Kleinert, H., Schwarz, P. M., & Förstermann, U. (n.d.). Regulation of the expression of inducible nitric oxide synthase. Biological chemistry, 384(10-11), 1343-64. doi: 10.1515/BC.2003.152.

6. Kosuri, S. (2003, December 5). Part:bba_i0500. Retrieved from http://partsregistry.org/Part:BBa_I0500

7. Lewin, Benjamin (2007). Genes IX. Sudbury, MA: Jones and Bartlett Publishers.

8. Lewenza, S., Vidal-Ingigliardi, D., & Pugsley, A. P. (2006). Direct visualization of red fluorescent lipoproteins indicates conservation of the membrane sorting rules in the family Enterobacteriaceae. Journal of bacteriology, 188(10), 3516-24. doi: 10.1128/JB.188.10.3516-3524.2006.

9. Madigan, M.T., Martinko, J.M., Dunlap, P.V., & Clark, D.P. (2008). Brock biology of microorganisms (12th edition). San Francisco, California: Benjamin Cummings.

10. Ran, W., & Higgs, P. G. (2010). The influence of anticodon-codon interactions and modified bases on codon usage bias in bacteria. Molecular biology and evolution, 27(9), 2129-40. doi: 10.1093/molbev/msq102.