Team:VT-ENSIMAG/Introduction
From 2010.igem.org
Line 16: | Line 16: | ||
[[#top|top]] | [[#top|top]] | ||
- | ==Sequence Alignment== | + | ==Sequence Alignment: BLAST== |
- | In order to make a sequence screening software, we have to perform many sequence alignment. It consists in comparing sequences in order to tell how near they are. The tool we used for that, is, as suggested in the federal guideline, [ | + | In order to make a sequence screening software, we have to perform many sequence alignment. It consists in comparing sequences in order to tell how near they are. The tool we used for that, is, as suggested in the federal guideline, BLAST: Basic Local Alignement Search Tool. |
+ | |||
+ | Blast is a software available on the ncbi web site ([http://blast.ncbi.nlm.nih.gov/Blast.cgi NCBI Blast website]). BLAST performed local alignments on a query sequence against the Genbank database. Given an input sequence, BLAST give us a list of the most similar known sequences, with statistical scores to measure how near are the matching sequences. | ||
[[#top|top]] | [[#top|top]] |
Revision as of 14:27, 27 September 2010
Sequence screening
|
Sequence screenig: Why and How?IntroductionGene synthesis technology gives scientists an unparalleled capability to manipulate genomes. Over the past several decades, an entire commercial industry has developed to inexpensively produce genes on a large scale. It is this industry which provides the manufactured genes and standardized parts to make synthetic biology, and iGEM, possible. Synthetic genomics, like synthetic biology, has the potential to act as both a great benefit and a great detriment to public health and national security. A precedence for the dual use of synthetic genomics is the reconstruction of the virus responsible for the pandemic 1918 Spanish Flu in 2005 by researchers at the CDC ([http://en.wikipedia.org/wiki/1918_flu_pandemic See more]). This highly infectious strain is estimated to have killed as many as 50,000,000 people worldwide. Although the 1918 Spanish flu genes were synthesized for legitimate research purposes, they could have just as easily been used to reconstruct a biological weapon. It should be noted that the reconstructed strain was partially attenuated (1). This, however, does not preclude the possibility of more virulent forms being engineered in the future. Although such engineering is difficult at the moment, advances in this technology over the next decade could make it easier for bioterrorists to harm the Public. According to a 2004 report by the U.S. National Intelligence Council, its greatest security concern over the coming years is that terrorists will acquire biological agents for use as weapons of mass destruction (NIC, 2004). Many nucleotide sequences encoding for or derived from dangerous toxins or pathogens can be freely accessed on the U.S. National Center for Biotechnology Information GenBank (NCBI-GenBank). The ease with which dangerous sequences can be located and synthesized presents novel threats to both public health and national security. To prevent illicit activities by end users of de novo synthesized genes, it is crucial to stop their manufacture at the source: gene synthesis companies. Therefore, effective and efficient screening measures must be developed to identify sequences of concern within a synthesis order. The United States government recognizes its responsibility to protect the public and in November, 2009, published a draft guidance for sequence screening. As part of our iGEM 2010 project, we are implementing the draft Government guidance for sequence screening, characterizing its performance, and suggesting improvements. Sequence Alignment: BLASTIn order to make a sequence screening software, we have to perform many sequence alignment. It consists in comparing sequences in order to tell how near they are. The tool we used for that, is, as suggested in the federal guideline, BLAST: Basic Local Alignement Search Tool. Blast is a software available on the ncbi web site ([http://blast.ncbi.nlm.nih.gov/Blast.cgi NCBI Blast website]). BLAST performed local alignments on a query sequence against the Genbank database. Given an input sequence, BLAST give us a list of the most similar known sequences, with statistical scores to measure how near are the matching sequences. Current screening statePresentation of the problem and of current proposed guidance: State of art GenoTHREAT, our sequence screening softwareThe software we have implemented is called GenoTHREAT. Given a DNA sequence, GenoTHREAT indicates if this sequence may be one of concern or not. The algorithm and the implementation of this software are detailled in Genothreat. Tests and ResultsIn order to characterize the government guideline and the software we created, we have implemented and executed different tests. See Tests and Results. ConclusionWe have succeed in implementing a functioning sequence screening software. The main issue raised is the cost of doing a good software, rapid and efficient. We have also shown the influence of the software parameters as the keyword list or the BLAST parameters. The guideline we have followed was not precise enough and left a great place to interpretation. A new version must be edited in the next years. But we have showed that in following it, the result was an efficient sequence screening software, but expensive, and that must raised to many false hits. |