Not Logged In

Communities Validity: Methodical Evaluation of Community Mining Algorithms

Full Text: SNAMjournal13.pdf PDF

Grouping data points is one of the fundamental tasks in data mining, which is commonly known as clustering if data points are described by attributes. When dealing with interrelated data that is represented in the form of nodes and their relationships and the grouping is based on these relationships but not the node attributes, this task is also referred to as community mining. There has been a considerable number of approaches proposed in recent years for mining communities in a given network. However, little work has been done on how to evaluate the community mining algorithms. The common practice is to evaluate the algorithms based on their performance on standard benchmarks for which we know the ground-truth. This technique is similar to external evaluation of attribute-based clustering methods. The other two well-studied clustering evaluation approaches are less explored in the community mining context; internal evaluation to statistically validate the clustering result, and relative evaluation to compare alternative clustering results. These two approaches enable us to validate communities discovered in a real world application, where the true community structure is hidden in the data. In this article, we investigate different clustering quality criteria applied for relative and internal evaluation of clustering data points with attributes, and also different clustering agreement measures used for external evaluation; and incorporate proper adaptations to make them applicable in the context of interrelated data. We further compare the performance of the proposed adapted criteria in evaluating community mining results in different settings through extensive set of experiments.

Citation

R. Rabbany, M. Takaffoli, J. Fagnan, O. Zaiane, R. Campello. "Communities Validity: Methodical Evaluation of Community Mining Algorithms". Social Network Analysis and Mining, 3(4), pp 1039-1062, December 2013.

Keywords: Evaluation Approaches , Quality Measures, Clustering Evaluation, Clustering Objective Function , Community Mining
Category: In Journal
Web Links: Webdocs

BibTeX

@article{Rabbany+al:13,
  author = {Reihaneh Rabbany and Mansoreh Takaffoli and Justin Fagnan and Osmar
    R. Zaiane and Ricardo J. G. B. Campello},
  title = {Communities Validity: Methodical Evaluation of Community Mining
    Algorithms},
  Volume = "3",
  Number = "4",
  Pages = {1039-1062},
  journal = {Social Network Analysis and Mining},
  year = 2013,
}

Last Updated: October 31, 2019
Submitted by Sabina P

University of Alberta Logo AICML Logo