Not Logged In

PA-GOSUB: A searchable Database of Model Organism Protein Sequences With Their Predicted GO Molecular Function and Subcelluar Localization

Full Text: 2005-Paper-Lu-PA-GOSUB-NARDB.pdf PDF

PA-GOSUB (Proteome Analyst: Gene Ontology Molecular Function and Subcellular Localization) is a publicly available, web-based, searchable and downloadable database that contains the sequences, predicted GO molecular functions and predicted subcellular localizations of more than 107 000 proteins from 10 model organisms (and growing), covering the major kingdoms and phyla for which annotated proteomes exist (http://www.cs.ualberta.ca/~bioinfo/PA/GOSUB). The PA-GOSUB database effectively expands the coverage of subcellular localization and GO function annotations by a significant factor (already over five for subcellular localization, compared with Swiss-Prot v42.7), and more model organisms are being added to PA-GOSUB as their sequenced proteomes become available. PA-GOSUB can be used in three main ways. First, a researcher can browse the pre-computed PA-GOSUB annotations on a per-organism and per-protein basis using annotation-based and text-based filters. Second, a user can perform BLAST searches against the PA-GOSUB database and use the annotations from the homologs as simple predictors for the new sequences. Third, the whole of PA-GOSUB can be downloaded in either FASTA or comma-separated values (CSV) formats.

Citation

P. Lu, D. Szafron, R. Greiner, D. Wishart, A. Fyshe, B. Pearcy, B. Poulin, R. Eisner, D. Ngo, N. Lamb. "PA-GOSUB: A searchable Database of Model Organism Protein Sequences With Their Predicted GO Molecular Function and Subcelluar Localization". Nucleic Acids Research (NAR), 33 (Database issue), pp D147--D153, January 2005.

Keywords: bioinformatics, proteome analyst, database, machine learning, medical informatics
Category: In Journal

BibTeX

@article{Lu+al:NAR05,
  author = {Paul Lu and Duane Szafron and Russ Greiner and David S. Wishart and
    Alona Fyshe and Brandon Pearcy and Brett Poulin and Roman Eisner and D Ngo
    and N Lamb},
  title = {PA-GOSUB: A searchable Database of Model Organism Protein Sequences
    With Their Predicted GO Molecular Function and Subcelluar Localization},
  Volume = {33 (Database issue)},
  Pages = {D147--D153},
  journal = {Nucleic Acids Research (NAR)},
  year = 2005,
}

Last Updated: October 13, 2013
Submitted by Russ Greiner

University of Alberta Logo AICML Logo