Partition Method for BlockRR

Updated 10 February 2026

The paper introduces a principled, weight-based partition of label sets to optimize the trade-off between accuracy and privacy in label-differentially private randomized response algorithms.
The methodology employs a weight matrix derived from private prior estimates and a tunable sharpness parameter to distinguish between majority and minority label blocks.
Empirical results indicate improved per-class accuracy and effective noise regulation, particularly under conditions of imbalanced class distributions.

The Partition Method for BlockRR is a framework for partitioning a label set—based on prior information about label frequencies—to optimize the trade-off between accuracy and privacy in label-differentially private randomized response algorithms. It introduces a principled, weight-based partition of possible labels into “majority” and “minority” blocks, enabling distinct randomized response mechanisms on each and generalizing many previous approaches under one unified scheme (Liu et al., 3 Feb 2026). BlockRR’s partition method is crucial for balancing the injected noise across classes, especially when class distributions are non-uniform or imbalanced, and is mathematically constructed via a weight matrix that encodes prior probabilities and controls the degree of block separation.

1. Purpose and Integration with BlockRR

The central purpose of the partition method in BlockRR is to divide the label set $S = \{1,2,\dots,K\}$ into two subsets, $S_1$ (majority) and $S_2$ (minority), such that high-prior labels receive standard (“diagonal”) randomized response (RR), while low-prior labels are handled using a more noise-uniformized mechanism. The partitioning ensures that the randomization preserves utility for common classes without sacrificing privacy, and prevents excessive performance degradation due to label imbalance. After partitioning, BlockRR applies block-specific randomization rules to the four possible regions of the label–privatized label Cartesian product, adapting noise to class support (Liu et al., 3 Feb 2026).

2. Construction of the Weight Matrix

The method starts from a private estimate of the prior distribution over labels, $\mathbf{p} = (p_1, \dots, p_K)^\top$ , typically obtained with an $\epsilon$ -differentially private mechanism (e.g., Laplace mechanism). The partition relies on a $K \times K$ weight matrix $W$ defined as

$w_{ij} = p_j \cdot \exp\left( -\frac{1}{\sigma} \cdot \mathbf{1}_{i \neq j} \right),$

where $\sigma>0$ is a tunable sharpness parameter. This assignment ensures that diagonal elements $w_{ii}$ represent the direct support for label $i$ , while off-diagonal $w_{ij}$ are exponentially downweighted versions of the prior mass for other labels. The matrix captures both the global class balance and label locality, making it suitable for discriminating between well-supported and rare classes (Liu et al., 3 Feb 2026).

3. Mathematical Formulation of the Partition

The partition method selects $S_1$ as the set of labels $i$ for which the diagonal weight dominates all off-diagonal entries in its row,

$S_1 = \{ i \in S : w_{ii} \geq w_{ij} \text{ for all } j \in S \},$

and $S_2 = S\setminus S_1$ . This formalizes the notion of “majority” labels without requiring arbitrary thresholds; it is entirely dictated by the estimated prior $\mathbf{p}$ and sharpness parameter $\sigma$ . A block-ID function $P: S \to \{1,2\}$ encodes this mapping. The resulting split determines which labels receive more protective noise injections versus which are granted more accurate privatization (Liu et al., 3 Feb 2026).

4. Blockwise Randomized Response Mechanism

Once $S_1, S_2$ and the corresponding privatized label blocks $\tilde S_1, \tilde S_2$ are determined, BlockRR partitions the response mechanism over four blocks $(S_i, \tilde S_j)$ for $i, j \in \{1,2\}$ . Transition probabilities differ across these regions:

For $(S_1, \tilde S_1)$ , a diagonal RR is used, with probability mass $e^{\epsilon}\beta$ for mapped labels and $\beta$ otherwise.
For other blocks, partially uniformized transition probabilities, parameterized by $(\beta, \gamma)$ , are assigned and computed via normalization constraints ensuring a proper probability distribution.
A distinguished subset $\Delta \subseteq \tilde{S}_1$ (of size $l$ ) is used for minority labels to prevent oversmoothing and class collapse; the value of $l$ interpolates between conventional RR and prior-weighted RR.

The $(\beta, \gamma)$ parameters emerge from normalization equations specific to each block, yielding closed-form solutions (Liu et al., 3 Feb 2026).

5. Algorithmic Recipe and Complexity

The partition method is implemented as follows:

Compute the $K \times K$ weight matrix $W$ from $\mathbf{p}$ and $\sigma$ .
For each $i$ , declare $i \in S_1$ if $w_{ii} \geq w_{ij}$ for all $j$ ; else assign to $S_2$ .
Split $\tilde S$ (the space of output labels) into $\tilde S_1$ and $\tilde S_2$ using mapping $B$ (e.g., top- $k$ prior labels for each $i$ ).
Compute $\Delta \subseteq \tilde S_1$ as the set of $l$ highest-prior labels.
Solve for $(\beta, \gamma)$ by inverting a $2 \times 2$ linear system determined by the support sizes and privacy parameter $\epsilon$ .
For each privatization operation, draw $\tilde y$ according to the transition probabilities determined by the block containing $(y, \tilde y)$ .

The dominant cost is $O(K^2)$ for computing $W$ and $O(K)$ per privatization. Solving the small linear system and sampling is negligible in comparison (Liu et al., 3 Feb 2026).

6. Theoretical Guarantees and Privacy-Utility Trade-offs

BlockRR’s partitioned mechanism rigorously satisfies $\epsilon$ -label differential privacy by construction. The composition of partitioned mechanisms is also $\epsilon$ -label DP under standard parallel composition principles, provided that data splits are disjoint (Liu et al., 3 Feb 2026).

Utility and privacy are controlled primarily by the parameters $\sigma$ and $l$ :

Lower $\sigma$ broadens $S_2$ , increasing noise on rare classes at the expense of utility.
Varying $l$ interpolates between standard RR and prior-weighted RR, allowing practitioners to externally tune privacy-utility trade-offs.
Empirical evaluation demonstrates that in high and moderate privacy regimes ( $\epsilon \leq 3.0$ ), the partition method yields strictly better test and per-class accuracy than unpartitioned methods, especially under class imbalance. In the low-privacy regime ( $\epsilon \geq 4.0$ ), the method reduces to standard RR without further performance loss (Liu et al., 3 Feb 2026).

7. Implications and Applicability

The partition method for BlockRR unifies a wide range of label-differentially-private randomized response mechanisms within a single parameterized framework. It provides systematic control over the partitioning of label sets, adapting flexibly to the empirical distribution of labels and allowing blockwise customization of noise. This is significant in settings with heavy class imbalance or when fine-grained control over per-class accuracy is required. The $O(K^2)$ partition step is efficient for moderate to large label spaces ( $K \ll 10^4$ ). The approach is readily extensible to structured output domains, as the only requirement is the ability to define a block-ID mapping and candidate sets for privatization.

A plausible implication is that further refinement of the weight matrix (e.g., nonlinear weighting, dependency on other statistics) could yield even more flexible or utility-preserving variants of the BlockRR partition, potentially generalizing beyond label DP to other forms of privatization or fairness constraints (Liu et al., 3 Feb 2026).

Markdown Report Issue Upgrade to Chat

References (1)

BlockRR: A Unified Framework of RR-type Algorithms for Label Differential Privacy (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Partition Method for BlockRR.