Team:VT-ENSIMAG/Table6
From 2010.igem.org
(One intermediate revision not shown) | |||
Line 3: | Line 3: | ||
- | An integral feature of the sequence screening process is the keyword and anti-keyword lists used to identify whether the “Best Match” is to a Select Agent or Toxin. | + | An integral feature of the sequence screening process is the keyword and anti-keyword lists used to identify whether the “Best Match” is to a Select Agent or Toxin. The more limited key word list is only composed of words found on the CDC Select Agent and Toxin List. Our keyword list includes alternative names for, and words related to, the entries on the CDC Select Agent and Toxin List. In the case of toxins, related words include the names of enzymes which are intimately associated with the toxin’s production and function as well as organisms which directly produce the toxin. For organism and virus entries, related words include the names of diseases associated with the entries in addition to any toxins or pathogenic agents uniquely produced by the entry. Discretion was used when developing the key world list because an overly inclusive list could increase the number of false positive results. |
[[Image:VTIMAG_Keyword.png|center|frame|<br>''Extract of the Keyword database'']] | [[Image:VTIMAG_Keyword.png|center|frame|<br>''Extract of the Keyword database'']] | ||
<br> | <br> | ||
Line 12: | Line 12: | ||
<br> | <br> | ||
- | Since the keyword list identifies dangerous sequences as such, the success of the screening software relies heavily upon its content. An advantage keyword finding is that it can be automated and the keyword list can be refined over time. A drawback is that it sees in black and white; it cannot make judgment calls as a human can. To test the | + | Since the keyword list identifies dangerous sequences as such, the success of the screening software relies heavily upon its content. An advantage keyword finding is that it can be automated and the keyword list can be refined over time. A drawback is that it sees in black and white; it cannot make judgment calls as a human can. |
+ | <br> | ||
+ | Our keyword list contains 338 keyword, and we have 37 anti-keywords. To test the effeciency of our keyword list, we developed a second keyword list, which was just the basic one that one can extract fron the CCL list, and we compared different combinations of the limited and extensive keyword lists with the anti-keyword lists were set as parameters for the program. | ||
<br> | <br> |
Latest revision as of 14:44, 27 September 2010
Keyword List
|
Since the keyword list identifies dangerous sequences as such, the success of the screening software relies heavily upon its content. An advantage keyword finding is that it can be automated and the keyword list can be refined over time. A drawback is that it sees in black and white; it cannot make judgment calls as a human can.
|