|
|
(19 intermediate revisions not shown) |
Line 1: |
Line 1: |
| {{Team:USTC_Software/Header}} | | {{Team:USTC_Software/Header}} |
- | | + | {| |
- | ==Overall project==
| + | |
- | One of the objectives of synthetic biology is the design and construction of new biological devices and systems to deliver useful applications. CAD (computer-aided design) tools, such as Tinker Cell, have been developed to model and simulate biological system and give its behavior as guides. However, they all need users to provide details of the system network, such as the activation and repression of transcription and translation reactions in genetic regulatory network. It is so difficult for even professionals to construct a detailed network of a complex system only depending on their minds. In this sense, modeling through CAD tools will not reduce the work of modeling: users are actually required to model manually and input their model in details. To solve this problem, we design a new method for automatic modeling: only assembling of parts and other initial environment parameters are needed for our program to discover the network from initial conditions and generate the model automatically. We believe this will greatly relieve users from complex interactions of species in biological system.
| + | |
- | | + | |
- | It is impossible to construct a database with all possible reactions written explicitly applying to different systems.
| + | |
- | To enable automation, we assume behaviors of complex are determined by its composite structure, including both the profile and individual parts. And then, we propose a novel '''Chain-Node''' model for complex with multiple chains bound together. Meanwhile, we extend the concept of Part so that all species in the biological system could be structurally described by '''Chain-Node''' model.
| + | |
- | | + | |
- | Our program is modular and has 3 major components. The first aspect is user interface (链接). Users could give their assembling of parts by drag-and-pull function and setup initial conditions as well. System behavior as output will also be shown there. The second aspect is database written in '''MoDeL'''. And the last is our core program compatible with '''MoDeL''' to search and discover system network based on users' input and data stored in our database.
| + | |
- | | + | |
- | == MoDeL: Modeling Database Language ==
| + | |
- | <br />
| + | |
- | === Bring Biological Modeling to the Next Level ===
| + | |
- | ----
| + | |
- | | + | |
- | {| cellpadding="10" cellspacing="0" | + | |
- | |-
| + | |
- | |[https://2010.igem.org/Team:USTC_Software/model_features '''Chain-Node Model'''] (Figure. 1) is a brand new ''Complex Modeling Concept'' incorporating detailed structure description with universal applicability. Instead of treating complex as a whole while ignoring their basic composition and structure, Chain-Node Model view complex as a construction of it basic [https://2010.igem.org/Team:USTC_Software/model_features '''Parts''']. Just as its name implies, our model includes two components: ''Chain'' and ''Node''. As a correspondence to natural polymer chains, each ''Chain'' is consisted of an arrangement of its basic unit, ''Part'', whose concept has been greatly extended. Parts include but not limited to ''Biobrick Parts''.
| + | |
- | |rowspan="2"|[[Image:Ustcs cnmodel 0.jpg|thumb|400px|Figure 1: Logo of Chain-Node Model]]
| + | |
- | |-
| + | |
- | |The ''Node'' component does not have a natural correspondence. It is an abstract concept to describe binding states of two or more parts: each binding will create a ''Node''. The abstract nodes may continue to bind with other parts or nodes to form a tree structure. However, parts or nodes in bound states are not allowed to bind again. With the help of chains and nodes, it is possible to model any complex with arbitrary architecture. Simple and inaccurate modeling of biological process could not keep pace with the development of synthetic biology and undoubtedly, our ''Chain-Node'' model provides a possible solution to the imbalance.
| + | |
- | |-
| + | |
- | |A simple example, tetR dimer, is shown to illustrate our simple modeling idea (做一个tetR2的模型放在右边做为配图,否则右边太空了). It has two chains with each containing only one part, tetR. Dimerization of tetR will create a node to indicate the bound state of two parts. To explain more clearly, bound parts are also considered as nodes so that in this example, all nodes are organized in a tree structure, which includes two children (leaf) nodes and one parent node. We will conform to this convention in our wiki.
| + | |
- | |[[Image:Ustcs cnmodel 0.jpg|thumb|400px|Figure 1: Logo of Chain-Node Model]]
| + | |
- | | + | |
- | |-
| + | |
- | |colspan="2"|To know more, users are suggested to read this [https://2010.igem.org/Team:USTC_Software/MoDeL One-Minute Introduction] to have an intuitive idea of our modeling system.
| + | |
- | |}
| + | |
- | <br />
| + | |
- | | + | |
- | === Modeling with Templates ===
| + | |
- | ----
| + | |
- | {| cellpadding="10" cellspacing="0"
| + | |
| |- | | |- |
- | |Modeling manually of biological system is widely used for synthetic biology modeling but it requires an overall understanding of the biological network. It is difficult for even professionals to provide such large amount of data. The underlying reason making modeling so difficult is that the data provided manually are redundant because different reactions may occur through different mechanisms. Based on this. we are always seeking feasible ways to implement our automatic modeling idea. The ''automatic'' does not mean modeling without any information provided, but however, there indeed exists a minimal data set to enable the automation. The minimal data set is the ''Templates''. Similar to C++ programming language, we introduce templates to allow generic description of species and reactions of a certain structural pattern or reaction mechanism. | + | |style="padding: 20px 20px 20px 20px;"| |
- | | + | =Overall Project= |
- | There are two kinds of templates: species templates and reaction templates. A <font color='red'>template species</font> behaves like species except that the template can have unknown parts of many different types. In other words, a species template represents a family of species. To apply this idea, we design a special part, ''ANY'', of class [https://2010.igem.org/Team:USTC_Software/model_features '''Substituent'''], to represent unknown parts of any length on one chain. For example, a species with structure ''ANY'' - pTetR - ''ANY'' represents any species with part pTetR. A <font color='red'>reaction template</font> provides a specification for generating reactions with the same mechanism. Species in reaction templates are all templates, too. This could be understood more clearly by interpreting the known parts of species templates as the functional group -- a reaction template describes the interaction mechanism of these functional groups. For example (去做一个合适的example), pTetR promoter is deactivated in presence of TetR dimer which usually occupies the RNA polymerase binding site of pTetR sequence. The template species are pTetR template and TetR dimer template (see Figure) and the functional groups are pTetR promoter and TetR dimer. Any pair of species which partially contain pTetR DNA and TetR protein dimer respectively would bind according to the description of this reaction template. Modeling with templates allows users to define species and reactions only once for one certain family without rewriting them again in database.
| + | |
- | | + | |
- | |[[Image:Ustcs Template ptetr.jpg|thumb|400px|Figure 2: Template of pTetR DNA]] | + | |
- | |}
| + | |
- | <br />
| + | |
- | | + | |
- | === Automatic Modeling Database Language === | + | |
- | ----
| + | |
| {| cellpadding="10" cellspacing="0" | | {| cellpadding="10" cellspacing="0" |
| |- | | |- |
- | |The data used for automatic modeling should be well organized. For this purpose, we construct a database in unified format and make it machine-readable. Since everyone is allowed to add, delete or modify data stored in the database, each component of the database must be specified to a clear format, which defines a standard database language. We call it [https://2010.igem.org/Team:USTC_Software/model_features '''MoDeL''']''': Mo'''deling '''D'''atabas'''e''' '''L'''anguage by picking out characters from three words. MoDeL is based on XML language, which makes it flexible and extensible. For more specifications of MoDeL, click [https://2010.igem.org/Team:USTC_Software/model_lang here]. | + | |The ultimate goal of USTC_Software 2010 team is to promote synthetic biology throughout the world. To attract more people who do not have biology background to be interested in this area, we plan to develop a modeling-and-simulation game specially designed for synthetic biology. Users are taught to learn the basic knowledge in the area via constructing their genetic circuits as input to our software and try to understand the system behavior as output. This is the basic functional module. We also plan to develop a rating system to grade users' design for given tasks, and a more friendly game interface that can fill the process of input with joy and ease. Though more functions are expected, we focus on the basic module, modeling-and-simulation, at the first stage of our project. |
- | |rowspan="5"|[[Image:Ustcs dblang.jpg|thumb|400px|Figure 3: A peek at our database]]
| + | |
| |- | | |- |
- | |<br style="clear:both;"> | + | |In practice, many CAD (Computer-Aided Design) tools, such as ''TinkerCell'' and ''Synbioss'', have been developed to model and simulate biological systems and give the system behaviors as guides. However, they all need users to provide details of the system network, such as the activation and repression of transcription and translation reactions in genetic regulatory network. It is so difficult for even professionals to construct a detailed network of a complex system depending only on their minds. In this sense, modeling through CAD tools will not reduce the work of modeling: users are actually required to model manually and input their model in details. |
| |- | | |- |
- | |<br style="clear:both;"> | + | |However, since our software is developed for non-biological background users, it is unrealistic to expect them to model their design manually. To solve this problem, the USTC_Software 2010 team attempts to take synthetic biology modeling one step further by introducing new methods for automatic modeling of biological systems. Just as the word ''automation'' implies, users are only required to submit their assembling of parts, and the generation of biological model is automatically done by our program. Being the first-ever team trying to develop a synthetic biology automatic modeling tool, we focus on genetic regulatory network for the first year and develop our software tool, '''iGaME''', which will assist the design of genetic function modules for biological systems in synthetic biology. We believe this will greatly relieve users from handling complex interactions of species in biological system. |
| |- | | |- |
- | |<br style="clear:both;"> | + | |Many novel and revolutionary concepts are proposed during our development. The first is our <font size="5">[[Team:USTC_Software/model_features|''Chain-Node'']]</font> model for complex structure with multiple chains bound together. It is necessary for automation since behaviors of species (such as how it react with other species) must be determined by their structures instead of their names: it is impossible to construct a universal name-based reaction database applying for different systems. The next is <font size="5">[[Team:USTC_Software/model_features#Modeling_with_Templates|''Template'']]</font>. A species with a certain structure is a template species, and a reaction occurring between several template species is a template reaction. The introduction of ''Template'' makes it possible to describe a group of reactions with same structure-determined reaction mechanism. Finally, based on our ''Chain-Node'' and ''Template'' concept, we propose an XML-based <font size="5">[[Team:USTC_Software/model_features#A_Standard_Language|''Standard Biological Part Automatic Modeling Database Language'']] </font>(''MoDeL'' for abbreviation) to fully characterize ''Species'' and ''Reaction'' in templates with clear definitions of elements and attributes in XML fashion. It is a database language for next generation when automatic modeling is widely used throughout the world. To know more, users are suggested to read this <font size="5">[[Team:USTC_Software/MoDeL| ''One-Minute Introduction'' ]]</font> to have an intuitive idea of our modeling system. |
| |- | | |- |
- | |<br style="clear:both;"> | + | |Though we have no enough time to achieve our ultimate goal completely, we have successfully developed a ''MoDeL''-based, automatic modeling and simulation software. Our program falls into 3 major components. The first component is <font size="5">[[Team:USTC_Software/User_Interface| ''User Interface'' ]]</font>. Users could give their assembling of parts by drag-and-drop function and setup initial conditions as well. System behavior as output will also be shown there. The second component is database written in <font size="5">[[Team:USTC_Software/MoDeL| ''MoDeL'' ]]</font>, which is the kernel of our automatic modeling idea. The last component is our <font size="5">[[Team:USTC_Software/Simulation| ''Core Program'' ]]</font> designed to support ''MoDeL'' language. It functions as a driver: completing system network based on users' input and data stored in our database to give dynamic analysis as output. |
| |} | | |} |