Team:KAIST-Korea/Project/References
From 2010.igem.org
Luftschloss (Talk | contribs) |
Luftschloss (Talk | contribs) |
||
Line 6: | Line 6: | ||
==Data sources== | ==Data sources== | ||
During our project, we required many data. We used the data of protein-protein interactions for signal transduction pathway deciding, protein sequence and domain description for modified FGFR design, gene sequence for modified FGFR sequence synthesis, structures or sequences of antibodies and FGF binding domain of FGFR for structural alignment. These data are from Uniprot, PID, NCBI and RCSB PDB. | During our project, we required many data. We used the data of protein-protein interactions for signal transduction pathway deciding, protein sequence and domain description for modified FGFR design, gene sequence for modified FGFR sequence synthesis, structures or sequences of antibodies and FGF binding domain of FGFR for structural alignment. These data are from Uniprot, PID, NCBI and RCSB PDB. | ||
+ | |||
===Uniprot=== | ===Uniprot=== | ||
[[Image:UniProt.png|200px|right]] | [[Image:UniProt.png|200px|right]] | ||
Uniprot provides sequences of many proteins and domain description of well-researched proteins. Key advantage to use Uniprot is domain description. Without knowledge of function of each parts of protein, to design engineered protein for detecting ''Mycobacterium. Tubeculosis'' antigen MPT51 is impossible. Uniprot provided the information of location of FGF binding domain of FGFR which is replaced by our single-chain antibody 16A1. Uniprot also provided sequences of some antibodies to make single chain antibody sequences. | Uniprot provides sequences of many proteins and domain description of well-researched proteins. Key advantage to use Uniprot is domain description. Without knowledge of function of each parts of protein, to design engineered protein for detecting ''Mycobacterium. Tubeculosis'' antigen MPT51 is impossible. Uniprot provided the information of location of FGF binding domain of FGFR which is replaced by our single-chain antibody 16A1. Uniprot also provided sequences of some antibodies to make single chain antibody sequences. | ||
* Site URL : http://www.uniprot.org/ | * Site URL : http://www.uniprot.org/ | ||
+ | |||
===PID=== | ===PID=== | ||
[[Image:PID.png|200px|right]] | [[Image:PID.png|200px|right]] | ||
PID(Pathway Interaction Database) provides the interaction networks between protein and protein or protein and DNA through certain signal transduction pathway. This Protein-protein interaction(PPI) and protein-DNA interaction(PDI) data helped us to port Human signal transduction pathway which is activated by FGF to fission yeast. Without knowledge of PPI and PDI through FGF signal pathway, we must undergo many trial and error of adding and removing of proteins and promoters to form working signal transduction pathway. With the data from PID, we decided to port the FGF->FGFR1->STAT1->GAS pathway from human to fission yeast. | PID(Pathway Interaction Database) provides the interaction networks between protein and protein or protein and DNA through certain signal transduction pathway. This Protein-protein interaction(PPI) and protein-DNA interaction(PDI) data helped us to port Human signal transduction pathway which is activated by FGF to fission yeast. Without knowledge of PPI and PDI through FGF signal pathway, we must undergo many trial and error of adding and removing of proteins and promoters to form working signal transduction pathway. With the data from PID, we decided to port the FGF->FGFR1->STAT1->GAS pathway from human to fission yeast. | ||
* Site URL : http://pid.nci.nih.gov/ | * Site URL : http://pid.nci.nih.gov/ | ||
+ | |||
===NCBI=== | ===NCBI=== | ||
[[Image:NCBI.png|200px|right]] | [[Image:NCBI.png|200px|right]] | ||
NCBI provides many data for biologists. We used protein sequence and DNA sequence from NCBI nucleotide database and protein database. DNA sequence is very important to us because gene sequence of original protein is required to synthesize novel engineered protein. Even we don't synthesize gene, we should know the sequence of gene because biobrick require not only nucleotide material, but also its sequence information. NCBI also provided sequences of some antibodies to make single chain antibody sequence, and many Journals through PubMed service. | NCBI provides many data for biologists. We used protein sequence and DNA sequence from NCBI nucleotide database and protein database. DNA sequence is very important to us because gene sequence of original protein is required to synthesize novel engineered protein. Even we don't synthesize gene, we should know the sequence of gene because biobrick require not only nucleotide material, but also its sequence information. NCBI also provided sequences of some antibodies to make single chain antibody sequence, and many Journals through PubMed service. | ||
* Site URL : http://ncbi.nlm.nih.gov/ | * Site URL : http://ncbi.nlm.nih.gov/ | ||
+ | |||
===RCSB PDB=== | ===RCSB PDB=== | ||
[[Image:RCSB-PDB.png|200px|right]] | [[Image:RCSB-PDB.png|200px|right]] | ||
RCSB PDB(Protein Data Bank) provides the data of structure of protein or other biomolecules. Key feature of data from PDB is the structure. NCBI or Uniprot provide the sequence of proteins, but it don't shows us the 3D-structure of them. With the structural similarity between FGF binding domain of FGFR and single chain antibody 16A1, we can sure that the replacement of FGF binding domain with 16A1 to detect MPT51 is appropriate. | RCSB PDB(Protein Data Bank) provides the data of structure of protein or other biomolecules. Key feature of data from PDB is the structure. NCBI or Uniprot provide the sequence of proteins, but it don't shows us the 3D-structure of them. With the structural similarity between FGF binding domain of FGFR and single chain antibody 16A1, we can sure that the replacement of FGF binding domain with 16A1 to detect MPT51 is appropriate. | ||
* Site URL : http://www.pdb.org/ | * Site URL : http://www.pdb.org/ | ||
+ | |||
==Tools== | ==Tools== | ||
During our project, we processed many bioinformations. It is not easy to process many informations manually is not easy. So we used many bioinformation tools for our projects. We marked restriction sites to select proper restriction enzyme, searched the nucleotide which coding query peptide sequence to find the coding region of certain genes, virtual-translated given nucleotide sequence to check our sequence coding expected protein, and predicted and align structure of single chain antibodies with FGF binding domain of FGFR to check single chain antibodies are structurally similar to confirm that replacement of FGF binding domain with single chain antibodies are appropriate. We used BioEdit to mark restriction site, BLAST to find the coding region of certain genes and to compare similar proteins, Transeq for virtual-translation, Modeller for structural prediction of single chain antibody, and Matt for structural alignment between single chain antibodies and FGF binding domain of FGFR. | During our project, we processed many bioinformations. It is not easy to process many informations manually is not easy. So we used many bioinformation tools for our projects. We marked restriction sites to select proper restriction enzyme, searched the nucleotide which coding query peptide sequence to find the coding region of certain genes, virtual-translated given nucleotide sequence to check our sequence coding expected protein, and predicted and align structure of single chain antibodies with FGF binding domain of FGFR to check single chain antibodies are structurally similar to confirm that replacement of FGF binding domain with single chain antibodies are appropriate. We used BioEdit to mark restriction site, BLAST to find the coding region of certain genes and to compare similar proteins, Transeq for virtual-translation, Modeller for structural prediction of single chain antibody, and Matt for structural alignment between single chain antibodies and FGF binding domain of FGFR. | ||
+ | |||
===BioEdit=== | ===BioEdit=== | ||
[[Image:BioEdit.png|200px|right]] | [[Image:BioEdit.png|200px|right]] | ||
Line 30: | Line 36: | ||
* License : Freeware | * License : Freeware | ||
* Download : http://www.mbio.ncsu.edu/BioEdit/bioedit.html | * Download : http://www.mbio.ncsu.edu/BioEdit/bioedit.html | ||
+ | |||
===BLAST=== | ===BLAST=== | ||
[[Image:BLAST.png|200px|right]] | [[Image:BLAST.png|200px|right]] | ||
Line 55: | Line 62: | ||
* License : Free fore non-profit academic institutions | * License : Free fore non-profit academic institutions | ||
* Download : http://www.salilab.org/modeller/download_installation.html | * Download : http://www.salilab.org/modeller/download_installation.html | ||
+ | |||
===Matt=== | ===Matt=== | ||
[[Image:Matt.png|200px|right]] | [[Image:Matt.png|200px|right]] |
Revision as of 06:47, 10 August 2010
Data sourcesDuring our project, we required many data. We used the data of protein-protein interactions for signal transduction pathway deciding, protein sequence and domain description for modified FGFR design, gene sequence for modified FGFR sequence synthesis, structures or sequences of antibodies and FGF binding domain of FGFR for structural alignment. These data are from Uniprot, PID, NCBI and RCSB PDB. UniprotUniprot provides sequences of many proteins and domain description of well-researched proteins. Key advantage to use Uniprot is domain description. Without knowledge of function of each parts of protein, to design engineered protein for detecting Mycobacterium. Tubeculosis antigen MPT51 is impossible. Uniprot provided the information of location of FGF binding domain of FGFR which is replaced by our single-chain antibody 16A1. Uniprot also provided sequences of some antibodies to make single chain antibody sequences.
PIDPID(Pathway Interaction Database) provides the interaction networks between protein and protein or protein and DNA through certain signal transduction pathway. This Protein-protein interaction(PPI) and protein-DNA interaction(PDI) data helped us to port Human signal transduction pathway which is activated by FGF to fission yeast. Without knowledge of PPI and PDI through FGF signal pathway, we must undergo many trial and error of adding and removing of proteins and promoters to form working signal transduction pathway. With the data from PID, we decided to port the FGF->FGFR1->STAT1->GAS pathway from human to fission yeast.
NCBINCBI provides many data for biologists. We used protein sequence and DNA sequence from NCBI nucleotide database and protein database. DNA sequence is very important to us because gene sequence of original protein is required to synthesize novel engineered protein. Even we don't synthesize gene, we should know the sequence of gene because biobrick require not only nucleotide material, but also its sequence information. NCBI also provided sequences of some antibodies to make single chain antibody sequence, and many Journals through PubMed service.
RCSB PDBRCSB PDB(Protein Data Bank) provides the data of structure of protein or other biomolecules. Key feature of data from PDB is the structure. NCBI or Uniprot provide the sequence of proteins, but it don't shows us the 3D-structure of them. With the structural similarity between FGF binding domain of FGFR and single chain antibody 16A1, we can sure that the replacement of FGF binding domain with 16A1 to detect MPT51 is appropriate.
ToolsDuring our project, we processed many bioinformations. It is not easy to process many informations manually is not easy. So we used many bioinformation tools for our projects. We marked restriction sites to select proper restriction enzyme, searched the nucleotide which coding query peptide sequence to find the coding region of certain genes, virtual-translated given nucleotide sequence to check our sequence coding expected protein, and predicted and align structure of single chain antibodies with FGF binding domain of FGFR to check single chain antibodies are structurally similar to confirm that replacement of FGF binding domain with single chain antibodies are appropriate. We used BioEdit to mark restriction site, BLAST to find the coding region of certain genes and to compare similar proteins, Transeq for virtual-translation, Modeller for structural prediction of single chain antibody, and Matt for structural alignment between single chain antibodies and FGF binding domain of FGFR. BioEditBioEdit is the program for display of biological sequences. It displays different amino acids or nucleotides with different colors to check the change between sequences. It have many simple but useful functions. We used BioEdit for marking the restriction sites on given sequence to select proper restriction enzyme which don't restrict coding region of gene. Other functions of BioEdit like phylogeny making or front-end of ClustalW is not used for our projects but they are also useful.
BLASTBLAST is the alignment search tools for protein or nucleotide sequences. There are five modes of BLAST; blastn(nucleotide to nucleotide), blastp(Protein to Protein), blastx(nucleotide to protein). tblastn(protein to nucleotide), and tblastx(translated nucleotide to translated nucleotide). We used tblastn to find the location of coding region of given protein, blastn to find the differences between transcription variants of same genes.
TranseqIt is not difficult work to make protein sequence from nucleotide sequence . With codon table, we can make protein sequence manually without any special ability. But to translate long nucleotide sequence is not easy way. So we used Transeq to virtual-translate given nucleotides sequence. Transeq also do virtual-translation with shifted reading frame or non-transitional translation code tables like mitochondrial translation code table.
ModellerModeller is the program to predict the structure of protein with given peptide sequence based on the homology model. Modeller search the similar sequence from database sequence whose structure is known yet with given query sequence. And assume that similar sequence have similar structure then predict the structure of query protein as the combination of known structure with similar sequence. This method is very useful for prediction of single chain antibodies because the structures of many original antibodies are known.
MattIt is also possible to compare the structures between FGF binding domain of FGFR and single chain antibodies from prediction of Modeller manually. But is is not quantitative and estimated by rule of thumb. So the result is not useful for further analysis. We used Matt to compare the structures of FGF binding domain of FGFR and single chain antibodies. Matt uses the algorithm to maximize shared structure with small translation and rotations. Matt provide the quantitative result to estimate similarity and aligned structures of proteins to visualize the alignment.
PyMolStructure of complex of protein and other biomolecules are often saved as format of “*.PDB”. And to visualize and analysis of that structure and sequence is also important to design novel engineered proteins. PyMol is used for this works. We used PyMol for two processes. At first, it is used to confirm iG-like regions of FGFR is really FGF binding domain. We downloaded the FGF binding domain of FGFR from RCSB PDB and checked the sequence binds to the FGF is really marked as iG-lie regions. (Interleukin receptors have iG-like regions but they don't bind to its signal molecules.) Other process using PyMol is to visualize the structural alignment result made by Matt. PyMol saves the image of biomolecules as png format which is usable for many image processing programs.
References
|