Team:Paris Liliane Bettencourt/Project/SIP/Downloads

From 2010.igem.org

(Difference between revisions)
Line 9: Line 9:
  <font size=4>SIP Wiki Analyser : Downloads</font>
  <font size=4>SIP Wiki Analyser : Downloads</font>
<a href="https://2010.igem.org/Team:Paris_Liliane_Bettencourt/Project/Population_counter">
<a href="https://2010.igem.org/Team:Paris_Liliane_Bettencourt/Project/Population_counter">
-
  <img src="https://static.igem.org/mediawiki/2010/3/30/Popcount.png" width="75" height="75" align=right title="Population Counter">
+
  <img src="https://static.igem.org/mediawiki/2010/9/93/Pop_counter_logo-01.jpg" width="108" height="89" align=right title="Population Counter">
</a>
</a>
<a href="https://2010.igem.org/Team:Paris_Liliane_Bettencourt/Project/Memo-cell">
<a href="https://2010.igem.org/Team:Paris_Liliane_Bettencourt/Project/Memo-cell">
-
  <img src="https://static.igem.org/mediawiki/2010/e/e8/Memocell.png" width="75" height="75" align=right title="Memo-Cell">
+
  <img src="https://static.igem.org/mediawiki/2010/a/aa/Memo_cell-01.jpg" width="108" height="89" align=right title="Memo-Cell">
</a> <br />
</a> <br />
</p>
</p>

Revision as of 20:02, 27 October 2010



SIP Wiki Analyser : Downloads





Team List

Wiki Data
  • [http://www.lsdlive.org/misc/wdata_2007.zip Wiki data 2007 (ZIP)]
  • [http://www.lsdlive.org/misc/wdata_2008.zip Wiki data 2008 (ZIP)]
  • [http://www.lsdlive.org/misc/wdata_2009.zip Wiki data 2009 (ZIP)]
SIP Database
  • SIP words database 2007 (SQLITE3)
  • [http://www.lsdlive.org/misc/wsip_2008.db.zip SIP words database 2008 (SQLITE3)]
  • [http://www.lsdlive.org/misc/wsip_2009.db.zip SIP words database 2009 (SQLITE3)]

To read databases, use [http://www.sqlite.org/ sqlite3].

Warning : Notice these files are generated using "links -dump" to remove html, to speed the process, but you can do without that, because SIP will remove them later. With links, some pages with special characters like '(' ')' and ':' in their name are not converted, we consider it's not very important, because it's a small number of pages, but you can re-gen the database without the html parse step.
You can also use html2text, but if the software find special character, it don't remove the html.


Notes about filters : In these files, there're no filters but you can make what you want : remove common name, keep only [http://www.nlm.nih.gov/mesh/ MeSH] terms etc. See what you need!