Comparing partitions
@article{Hubert1985ComparingP, title={Comparing partitions}, author={Lawrence J. Hubert and Phipps Arabie}, journal={Journal of Classification}, year={1985}, volume={2}, pages={193-218}, url={https://api.semanticscholar.org/CorpusID:189915041} }
A measure based on the comparison of object triples having the advantage of a probabilistic interpretation in addition to being corrected for chance is proposed and bounded between ยฑ1.5 and ยฑ2.5.
2,067 Citations
sARI: a soft agreement measure for class partitions incorporating assignment probabilities
- 2018
Computer Science, Mathematics
An extension of the ARI, the soft adjusted Rand index (sARI), with similar intuition and interpretation but also incorporating information from one or two soft partitions is proposed, supporting the intuition that in general, mapping to hard partitions tends to increase the measure of similarity between partitions.
Investigating cluster validation metrics for optimal number of clusters determination
- 2021
Computer Science, Mathematics
An extensive set of well-known validity indices, based on the approach of the so-called relative criteria, are examined comparatively and a total of 26 cluster validation measures were investigated in two distinct case studies: one in real-world and one in artificially generated data.
An Extension of the Infinite Relational Model Incorporating Interaction between Objects
- 2013
Computer Science, Mathematics
This paper proposes an extension of the IRM by introducing a subset mechanism that selects a part of the data according to the interaction among objects and presents posterior probabilities for running collapsed Gibbs sampling to learn the model from the given data.
Comparison of modularity-based approaches for nodes clustering in hypergraphs
- 2024
Computer Science, Mathematics
A comparative analysis of the performance of modularity-based methods for clustering nodes in binary hypergraphs using different quality measures, including true clustering recovery, running time, (local) maximization of the objective, and the number of clusters detected.
A modification of the k-means method for quasi-unsupervised learning
- 2013
Computer Science, Mathematics
Unsupervised extra trees: a stochastic approach to compute similarities in heterogeneous data
- 2020
Computer Science
The empirical study shows that the approach based on UET outperforms existing methods in some cases and reduces the amount of preprocessing typically needed when dealing with real-world datasets.
Adaptive clustering with a variance-aware criterion: An alternative to k-means
- 2016
Computer Science, Mathematics
A generalized version of the incremental one-by-one clustering algorithm of MacQueen (1967) is introduced and evaluated that is suitable for general variance-based criteria, whether convex or not and performs remarkably well with a variety of heterogeneous data sets.
Suboptimal Comparison of Partitions
- 2019
Computer Science, Mathematics
It is concluded that no standard measures of either internal or external validation can guarantee a correspondence with optimal clustering performance, and recommendations are provided for the suboptimal evaluation of clusteringperformance.
CENTRAL PARTITION FOR A PARTITION- DISTANCE AND STRONG PATTERN GRAPH
- 2004
Computer Science, Mathematics
This paper considers the problem of finding a consensus partition between the set of these partitions, called central partition, and defines a new graph where the nodes are the strong patterns, by using the concept of strong patterns.