Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Amorim, M. J. & Cardoso, M. G. M. S. (2015). Clustering stability and ground truth: numerical experiments. International Journal of Artificial Intelligence and Knowledge Discovery. 5 (4), 27-33
Export Reference (IEEE)
M. J. Amorim and M. M. Cardoso,  "Clustering stability and ground truth: numerical experiments", in Int. Journal of Artificial Intelligence and Knowledge Discovery, vol. 5, no. 4, pp. 27-33, 2015
Export BibTeX
@article{amorim2015_1732204989813,
	author = "Amorim, M. J. and Cardoso, M. G. M. S.",
	title = "Clustering stability and ground truth: numerical experiments",
	journal = "International Journal of Artificial Intelligence and Knowledge Discovery",
	year = "2015",
	volume = "5",
	number = "4",
	pages = "27-33",
	url = "http://www.journals.rgsociety.org/index.php/ijai/article/view/728"
}
Export RIS
TY  - JOUR
TI  - Clustering stability and ground truth: numerical experiments
T2  - International Journal of Artificial Intelligence and Knowledge Discovery
VL  - 5
IS  - 4
AU  - Amorim, M. J.
AU  - Cardoso, M. G. M. S.
PY  - 2015
SP  - 27-33
SN  - 2231-2021
UR  - http://www.journals.rgsociety.org/index.php/ijai/article/view/728
AB  - Stability has been considered an important property for evaluating clustering solutions. Nevertheless, there are no conclusive studies on the relationship between this property and the capacity to recover clusters inherent to data (“ground truth”). This study focuses on this relationship, resorting to experiments on synthetic data generated under diverse scenarios (controlling relevant factors) and experiments on real data sets. Stability is evaluated using a weighted cross-validation procedure. Indices of agreement (corrected for agreement by chance) are used both to assess stability and external validity. The results obtained reveal a new perspective so far not mentioned in the literature. Despite the clear relationship between stability and external validity when a broad range of scenarios is considered, the within-scenarios conclusions deserve our special attention: faced with a specific clustering problem (as we do in practice), there is no significant relationship between clustering stability and the ability to recover data clusters
ER  -