Similarity Matrix of Proteins

Similarity Matrix of Proteins (SIMAP) is a database of protein similarities created using volunteer computing.^[1]^[2] It is freely accessible for scientific purposes. SIMAP uses the FASTA algorithm to precalculate protein similarity, while another application uses hidden Markov models to search for protein domains. SIMAP is a joint project of the Technical University of Munich, the Helmholtz Zentrum München, and the University of Vienna.

Project

The project usually got new work units at the beginning of each month. More recently, (2010), inclusion of environmental sequences into the database has required longer periods of activity, several months of continuous work for example. Typically, these updates occurred twice each year.^{[citation needed]}

In the fourth quarter of 2010, the project relocated to the University of Vienna due to the failing electrical infrastructure at the Technical University of Munich. Part of this exercise involved the creation of a project specific URL requiring existing volunteers and users to detach/reattach to the project.

On May 30, 2014, it was announced by project administrators that after a 10-year history, SIMAP would be leaving BOINC by the end of 2014. SIMAP research, however, will go forward with the use of local hardware consisting of "ordinary multi-core CPUs (some hundreds), crunching a SSE-optimized version of the Smith-Waterman algorithm."

Computing platform

SIMAP used the Berkeley Open Infrastructure for Network Computing (BOINC) distributed computing platform.

Application performance notes

Work unit CPU times varied widely, ranging between 15 minutes and 3 hours. Work units varied in size from 1.5 to 2.2 MB each, averaging around 2 MB. SIMAP provided client software optimized for SSE enabled processors and x86-64 processors. For older processors non SSE applications are provided but require manual installation steps to be taken. Operating Systems supported by SIMAP are Linux, Windows, Mac OS, Android, and other UNIX platforms. Since the database had sometimes been completed with all publicly known protein sequences and metagenomes having been precalculated by the project, the work available consisted of newly published protein sequences and metagenomes that needed to be precomputed for SIMAP.

References

^ Arnold, R.; Rattei, T.; Tischler, P.; Truong, M.-D.; Stümpflen, V.; Mewes, H. W. (2005). "SIMAP--The similarity matrix of proteins". Bioinformatics. 21 (Suppl 2): ii42 – ii46. doi:10.1093/bioinformatics/bti1107. ISSN 1367-4803. PMC 1347468. PMID 16204123.
^ Rattei, T.; Arnold, R.; Tischler, P.; Lindner, D.; Stümpflen, V.; Mewes, H. W. (2006). "SIMAP: the similarity matrix of proteins". Nucleic Acids Research. 34 (90001): D252 – D256. doi:10.1093/nar/gkj106. ISSN 0305-1048. PMC 1347468. PMID 16381858.

External links

Official website

This scientific software article is a stub. You can help Wikipedia by expanding it.

[ArnoldRattei2005-1] Arnold, R.; Rattei, T.; Tischler, P.; Truong, M.-D.; Stümpflen, V.; Mewes, H. W. (2005). "SIMAP--The similarity matrix of proteins". Bioinformatics. 21 (Suppl 2): ii42 – ii46. doi:10.1093/bioinformatics/bti1107. ISSN 1367-4803. PMC 1347468. PMID 16204123.

[Rattei2006-2] Rattei, T.; Arnold, R.; Tischler, P.; Lindner, D.; Stümpflen, V.; Mewes, H. W. (2006). "SIMAP: the similarity matrix of proteins". Nucleic Acids Research. 34 (90001): D252 – D256. doi:10.1093/nar/gkj106. ISSN 0305-1048. PMC 1347468. PMID 16381858.

[1]

[2]

v t e Berkeley Open Infrastructure for Network Computing (BOINC) projects
Active	Amicable Numbers Asteroids@home climateprediction.net Collatz Conjecture Einstein@Home Gerasim@home GPUGRID.net iThena LHC@home LODA MilkyWay@home Minecraft@home MindModeling@Home Moo! Wrapper NFS@Home NumberFields@home ODLK ODLK1 PrimeGrid QuChemPedIA@home RakeSearch Ramanujan Machine Rosetta@home SIDock@home SRBase Universe@Home World Community Grid (subprojects Clean Energy Project, Discovering Dengue Drugs – Together, FightAIDS@Home, Fiocruz Genome Comparison Project, Help Defeat Cancer, Help Conquer Cancer, Help Cure Muscular Dystrophy, Human Proteome Folding Project, Help Fight Childhood Cancer, Smash Childhood Cancer) WUProp@Home yoyo@home
Beta	RNA World (beta) WEP-M+2 Project
Alpha	nanoHUB@home RADIOACTIVE@HOME RALPH@home YAFU
Technology, tools	BOINC client–server technology BOINC Credit System Gridcoin Charity Engine GridRepublic Science United
Terminated or inactive	ABC@Home AQUA@home Artificial Intelligence System BBC Climate Change Experiment Big and Ugly Rendering Project CAS@home Cell Computing Citizen Science Grid Correlizer Cosmology@Home DistrRTgen Docking@Home EDGeS@Home Enigma@Home eOn Evolution@Home (yoyo@home subproject) FreeHAL HashClash Ibercivis Kryptos@Home The Lattice Project Leiden Classical uFluids@Home Malaria Control Project MLC@Home OProject@Home orbit@home POEM@Home Pirates@Home Predictor@home proteins@home Riesel Sieve (merged with PrimeGrid) QMC@Home SAT@home Seasonal Attribution Project SETI@home (subproject Astropulse) SETI@home beta SIMAP SLinCA@Home Spinhenge@home SZTAKI Desktop Grid TANPAKU theSkyNet TN-Grid VGTU@Home XtremLab

Project

Computing platform

Application performance notes

See also

References

External links