Not Logged In

An optimized cost-effective SVM for imbalanced data learning

Full Text: pakdd13-1.pdf PDF

Class imbalance is one of the challenging problems for machine learning in many real-world applications. Cost-sensitive learning has attracted significant attention in recent years to solve the problem, but it is difficult to determine the precise misclassification costs in practice. There are also other factors that influence the performance of the classification including the input feature subset and the intrinsic parameters of the classifier. This paper presents an effective wrapper framework incorporating the evaluation measure (AUC and G-mean) into the objective function of cost sensitive SVM directly to improve the performance of classification by simultaneously optimizing the best pair of feature subset, intrinsic parameters and misclassification cost parameters. Experimental results on various standard benchmark datasets and real-world data with different ratios of imbalance show that the proposed method is effective in comparison with commonly used sampling techniques.

Citation

P. Cao, D. Zhao, O. Zaiane. "An optimized cost-effective SVM for imbalanced data learning". Proceeding of the Pacific Asia Conference on Knowledge Discovery and Data Mining, (ed: Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G.), pp 280-292, April 2013.

Keywords:  
Category: In Conference
Web Links: Springer Link

BibTeX

@incollection{Cao+al:PAKDD13,
  author = {Peng Cao and Dazhe Zhao and Osmar R. Zaiane},
  title = {An optimized cost-effective SVM for imbalanced data learning},
  Editor = {Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G.},
  Pages = {280-292},
  booktitle = {Proceeding of the Pacific Asia Conference on Knowledge Discovery
    and Data Mining},
  year = 2013,
}

Last Updated: January 13, 2020
Submitted by Sabina P

University of Alberta Logo AICML Logo