Not Logged In

Cost sensitive Ranking Support Vector Machine for Multi-label Data Learning

Full Text: HIS2016_48.pdf PDF

Multi-label data classification has become an important and active research topic, where the classification algorithm is required to deal with prediction of sets of label indicators for instances simultaneously. Label powerset (LP) method reduces the multi-label classification problem to a single-label multi-class classification problem by treating each distinct combination of labels. However, the predictive performance of LP is challenged with imbalanced distribution among the labelsets, deteriorating the performance of traditional classifiers. In this paper, we study the problem of multi-label imbalanced data classification and propose a novel solution, called CSRankSVM (Cost sensitive Ranking Support Vector Machine), which assigns a different misclassification cost for each labelset to effectively tackle the problem of imbalance for Multilabel data. Empirical studies on popular benchmark datasets with various imbalance ratios of labelsets demonstrate that the proposed CSRankSVM approach can effectively boost classification performances in multi-label datasets.

Citation

P. Cao, X. Liu, D. Zhao, O. Zaiane. "Cost sensitive Ranking Support Vector Machine for Multi-label Data Learning". International Conference on Hybrid Intelligent Systems, Marrakech, Morocco, November 2016.

Keywords: Multi-label learning, Imbalanced data, Classification, Rank SVM
Category: In Conference
Web Links: Webdocs

BibTeX

@incollection{Cao+al:16,
  author = {Peng Cao and Xiaoli Liu and Danzhe Zhao and Osmar R. Zaiane},
  title = {Cost sensitive Ranking Support Vector Machine for Multi-label Data
    Learning},
  booktitle = {International Conference on Hybrid Intelligent Systems},
  year = 2016,
}

Last Updated: November 05, 2019
Submitted by Sabina P

University of Alberta Logo AICML Logo