Mojena and Wishart criteria are methods of selecting the optimal grouping result of agglomerative cluster analysis methods (hierarchical). Two criteria were proposed by Mojena in the 70’s of the 20th century: the upper tail rule and moving average quality control rule, both based on an analysis of the fusion levels of objects in the dendrogram with the aim to determine the cut-off point of it, i.e. to choose the optimal clustering result. The third criterion: tree validation was created by Wishart and evaluates the randomness of the objects clustering in the dendrogram. The purpose of this paper is to present the results of the empirical analysis of the effectiveness of Mojena and Wishart criteria for the number of clusters selection, in comparison to other applicable criteria in this area, including those proposed by: Baker and Hubert, Calinski and Harabasz, Davies and Bouldin, Hubert and Levine. The empirical analysis has been carried out in ClustanGraphics 8 Program and selected packages in R environment for the generated data sets.
upper tail rule, moving average quality control rule, Mojena criteria, Wishart criterion (tree validation), ClustanGraphics 8.
GAN, G., MA C., WU, J., Data clustering: theory, algorithms, and applications, SIAM, Philadelphia 2007.
GATNAR, E., WALESIAK, M. (ed.), Statystyczna analiza danych z wykorzy staniem programu R, Wydawnictwo PWN, Warsaw 2009.
MIKULEC, A., Metody oceny wyniku grupowania w analizie skupień, [in:] Jajuga K., Walesiak M. (ed.), Taksonomia 19. Klasyfikacja i analiza danych –teoria i zastosowania, Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu, Wrocław 2012.
MOJENA, R., Hierarchical grouping methods and stopping rules: an evaluation, „Computer Journal” 1977, vol. 20 (4), p. 359-363.
WISHART, D., Clustangraphics primer: a guide to cluster analysis, (the 4thedition), Edinburgh 2006.
www.clustan.com