Papers
Topics
Authors
Recent
Search
2000 character limit reached

Core Stability in Non-Centroid Clustering

Updated 1 December 2025
  • The paper introduces a formal framework defining the α-core using max-loss objectives to quantify cluster robustness in non-centroid settings.
  • It demonstrates impossibility results, showing that no k-clustering can achieve the 1-core under certain conditions, and provides tight α-bound analyses.
  • The study proposes algorithmic relaxations such as Fully Justified Representation and spectral stability methods to enhance practical robustness in clustering.

Core stability in non-centroid clustering refers to the robustness of cluster assignments under potential group deviations, quantified by whether a coalition of agents can jointly improve their losses by switching to a different clustering configuration. In non-centroid clustering, loss is not determined by distance to a representative point (centroid), but by some function of pairwise distances or graph interactions among cluster members. Formal definitions, impossibility results, algorithmic frameworks, and empirical findings demonstrate the complexity and limitations of achieving core stability, especially under the max-loss objective, where cluster assignment is determined by the worst pairwise distance within a cluster.

1. Formal Framework of Core Stability in Non-Centroid Clustering

A finite metric space (N,d)(N, d) consists of nn agents with a symmetric distance function dd. A non-centroid kk-clustering defines a partition C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}, where Cj≠∅C_j \neq \emptyset and ⋃jCj=N\bigcup_j C_j = N. The max-loss objective assigns to agent i∈Ni \in N in cluster S⊆NS \subseteq N the loss

$\loss_i(S) = \max_{j \in S} d(i, j).$

For clustering nn0, we write nn1, where nn2 is the cluster containing nn3.

nn4-blocking coalition: Subset nn5 of size nn6 nn7-blocks nn8 if for every nn9,

dd0

dd1-core: dd2 is in the dd3-core if there is no dd4-blocking coalition of size dd5. The dd6-core is referred to as the core (Caragiannis et al., 2024, Bredereck et al., 24 Nov 2025).

2. Impossibility Theorems and Core Emptiness

For dd7 and dd8 divisible by dd9, there exist metric instances such that no kk0-clustering lies in the kk1-core for any kk2. The construction, detailed in (Bredereck et al., 24 Nov 2025), uses specially structured coalitions kk3 achieving distinct internal max-losses:

  • kk4
  • kk5
  • kk6
  • kk7
  • kk8

Any clustering forces at least one such coalition to strictly improve by factor kk9, rendering the core empty. The bound is tight: at C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}0, cluster assignments can precisely meet all coalition thresholds.

A computer-assisted construction for 2D Euclidean space (C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}1, C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}2) yields a lower bound C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}3 where no core clustering exists. This demonstrates that the core (1-core) can be empty for non-centroid, max-loss clustering—a result previously unresolved (Bredereck et al., 24 Nov 2025).

3. Algorithmic Relaxations: FJR and Approximate Cores

Given the restrictive nature and possible emptiness of the core, Fully Justified Representation (FJR) provides a relaxation. A clustering C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}4 is in the C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}5-FJR if no coalition C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}6 of size C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}7 can reduce everyone's loss below C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}8 by a factor more than C={C1,…,Ck}\mathcal C = \{C_1, \dots, C_k\}9:

Cj≠∅C_j \neq \emptyset0

Algorithmically, the GreedyCohesiveClustering framework (using exact or approximate oracle for cohesive cluster selection) constructs clusterings satisfying FJR precisely or up to a constant factor. The efficient GreedyCapture algorithm achieves:

  • Cj≠∅C_j \neq \emptyset1-core (max-loss), Cj≠∅C_j \neq \emptyset2-FJR (average loss), running in Cj≠∅C_j \neq \emptyset3 time
  • Core and FJR violation metrics computed in practice reveal GreedyCapture is consistently fairer than Cj≠∅C_j \neq \emptyset4-means++ or Cj≠∅C_j \neq \emptyset5-medoids, sacrificing only moderate increases in standard clustering cost (Caragiannis et al., 2024).
Algorithm Objective Core/FJR Guarantee Runtime
GreedyCohesiveClustering Arbitrary loss Exact FJR (factor-Cj≠∅C_j \neq \emptyset6) Inefficient
GreedyCapture Average/max-loss Cj≠∅C_j \neq \emptyset7-core (average), Cj≠∅C_j \neq \emptyset8-core (max-loss) Cj≠∅C_j \neq \emptyset9

4. Statistical and Probabilistic Core Notions

Beyond worst-case coalition deviations, statistical core stability quantifies the robustness of cluster membership under stochasticity. For sample-dependent clustering methods (hierarchical, density-based, spectral), core clusters are the largest subsets where every pair ⋃jCj=N\bigcup_j C_j = N0 co-occurs in the same cluster with probability ⋃jCj=N\bigcup_j C_j = N1:

⋃jCj=N\bigcup_j C_j = N2

Estimating ⋃jCj=N\bigcup_j C_j = N3 via bootstrapping and finding core clusters reduces to max-clique identification in a co-occurrence graph. Empirical results indicate non-centroid methods (e.g., hierarchical linkage) tend to have smaller and less pure cores than centroid-based methods, emphasizing the instability of assignments near cluster boundaries (Henelius et al., 2016).

5. Stability in Graph-Based Non-Centroid Clustering

Maximum-likelihood mixture models for graphs (e.g., the NL-EM model) enable a node-centric notion of core stability. Stabilizer nodes are those whose connection patterns strictly exclude membership in all but one group for their neighbors, making the classification "crisp." Extraction involves solving set-cover instances among neighbor excluded-connection sets. The abundance and redundancy of stabilizers corresponds to resilience of the classification under structural perturbations and noise. Real-world examples (U.S. Senate co-voting, food webs) identify stabilizers as information-rich backbones reflective of core community structure (0809.1398).

6. Spectral Stability: Core Assessment via Eigenvalue Gaps

For spectral clustering, stability is assessed via the ⋃jCj=N\bigcup_j C_j = N4-th spectral gap ⋃jCj=N\bigcup_j C_j = N5 and the structured distance to ambiguity ⋃jCj=N\bigcup_j C_j = N6. The latter is the minimum Laplacian perturbation under admissible changes such that the ⋃jCj=N\bigcup_j C_j = N7-th gap collapses. A two-level iterative algorithm, combining constrained gradient flow (inner) and root-finding (outer), computes ⋃jCj=N\bigcup_j C_j = N8 robustly in sparse graphs. Structured stability indicators can yield different optimal cluster numbers compared to unstructured spectral gaps, especially for real data with ambiguous community separation (Andreotti et al., 2019).

Stability Indicator Definition Use Case
Spectral gap ⋃jCj=N\bigcup_j C_j = N9 i∈Ni \in N0 Rapid practical screening
Structured distance i∈Ni \in N1 Min. Frobenius norm i∈Ni \in N2 for vanishing gap Robustness assessment

7. Research Directions and Open Problems

  • The exact i∈Ni \in N3-core often fails to exist under max-loss; i∈Ni \in N4-core existence is currently the best general guarantee. Structured metric properties or alternative losses (sum-loss, i∈Ni \in N5-loss) might admit stronger results but remain open.
  • Efficient auditing procedures enable estimation of FJR violation; analogous constant-factor core audits remain an open technical challenge.
  • Statistical core notions allow fine-grained stability assessments, but scale is limited by max-clique complexity and bootstrap instance size.
  • Extensions to richer and non-pairwise loss functions, adaptive choice of i∈Ni \in N6, and coalition-formation models may yield further insights into tractable core stability for non-centroid clustering (Bredereck et al., 24 Nov 2025, Caragiannis et al., 2024, Henelius et al., 2016, 0809.1398, Andreotti et al., 2019).

A plausible implication is that the conceptual shift from exact core stability to relaxations (FJR, statistical cores, stabilizers, structured spectral stability) provides a necessary framework for achieving practically robust, interpretable, and fair clusterings in complex metric and graph-based domains.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Core Stability in Non-Centroid Clustering.