Not Logged In

Protein Subcellular Localization Prediction with Associative Classification and Multi-class SVM

Full Text: acmbcb11-short.pdf PDF

Protein subcellular localization prediction is the problem of predicting where a protein functions within a living cell. In this paper, we apply associative classifications (CMAR and CPAR) and multi-class Support Vector Machines to tackle the problem of protein subcellular localization prediction. We use classification feature sources generated from a protein’s SwissProt annotation record. We visualize the applied classification rules in an explain graph for domain experts to interpret. We compare the performance of our approaches to those of Proteome Analyst 3.0, using the same set of classification features; we find that all three classification algorithms outperform Proteome Analyst. Multi-class SVM achieves overall F-measures [0.934 ∼ 0.991], while CPAR and CMAR achieve overall F-measures [0.922 ∼ 0.989] and [0.880 ∼ 0.989], respectively. Our result shows that despite multi-class SVM is still the most accurate prediction algorithm with overall F-measures, CPAR and CMAR achieve very similar accuracy. In most cases, CPAR outperforms CMAR, especially when the feature space is large. Our result indicates that associative classification algorithms, especially CPAR, is a good alternative to SVM with similar accuracy but much better transparency in classification models.

Citation

Y. Liu, Z. Guo, X. Ke, O. Zaiane. "Protein Subcellular Localization Prediction with Associative Classification and Multi-class SVM". ACM Conference on Bioinformatics, Computational Biology and Biomedicine, Chicago, United States, pp 493-495, August 2011.

Keywords: protein annotation, associative classification, support vector machine, text mining, bioinformatics
Category: In Conference
Web Links: ACM Digital Library

BibTeX

@incollection{Liu+al:ACMBCB11,
  author = {Yifeng Liu and Zhaochen Guo and Xiaodi Ke and Osmar R. Zaiane},
  title = {Protein Subcellular Localization Prediction with Associative
    Classification and Multi-class SVM},
  Pages = {493-495},
  booktitle = {ACM Conference on Bioinformatics, Computational Biology and
    Biomedicine},
  year = 2011,
}

Last Updated: January 14, 2020
Submitted by Sabina P

University of Alberta Logo AICML Logo