Randomized Feasibility Algorithm with Polyak Steps

Updated 30 January 2026

The paper introduces a randomized feasibility algorithm that replaces full projection onto intersected constraints with tractable, sampled Polyak subgradient updates.
It employs adaptive, parameter-free step-size strategies to achieve linear convergence in strongly convex cases and optimal sublinear rates for general convex functions.
Empirical evaluations on QCQP and SVM tasks demonstrate that the method maintains computational efficiency and competitive performance without rigorous parameter tuning.

A randomized feasibility algorithm with Polyak steps is a class of iterative methods for constrained convex optimization where computationally tractable projections onto each individual constraint set are used instead of direct projection onto the intersection of all constraints. At each iteration, the algorithm randomly samples constraints and projects the current point towards feasibility using subgradient steps of Polyak type. Adaptive, problem-parameter-free step-size rules and sampled constraint selection enable linear or sublinear convergence rates according to the regularity of the objective function, while maintaining computational practicality when the full constraint projection is prohibitive (Chakraborty et al., 27 Jan 2026).

1. Problem Formulation and Notation

The central problem is of the form

$\text{minimize} \quad f(x) \quad \text{subject to} \quad x \in X \cap Y,$

where

$f: \mathbb{R}^n \to \mathbb{R}$ is convex (possibly strongly convex and/or smooth),
$X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ , with each $g_i$ convex,
$Y \subset \mathbb{R}^n$ is a simple closed convex set (such as a box or Euclidean ball).

Key notations:

$\|\cdot\|$ is the Euclidean norm,
$\Pi_Y[z]$ denotes projection onto $Y$ ,
$g^+(x) := \max\{0, g(x)\}$ ,
$\text{dist}(x, X \cap Y) := \min_{y \in X \cap Y} \|x - y\|$ .

A global error bound assumption is used: there exists $f: \mathbb{R}^n \to \mathbb{R}$ 0 and a sampling distribution $f: \mathbb{R}^n \to \mathbb{R}$ 1 over $f: \mathbb{R}^n \to \mathbb{R}$ 2 such that for all $f: \mathbb{R}^n \to \mathbb{R}$ 3,

$f: \mathbb{R}^n \to \mathbb{R}$ 4

2. Randomized Feasibility Algorithm with Polyak Steps

The algorithm performs a sequence of feasibility updates, each consisting of $f: \mathbb{R}^n \to \mathbb{R}$ 5 substeps at iteration $f: \mathbb{R}^n \to \mathbb{R}$ 6. Each feasibility substep involves:

Sampling a constraint index $f: \mathbb{R}^n \to \mathbb{R}$ 7 uniformly,
Computing a subgradient $f: \mathbb{R}^n \to \mathbb{R}$ 8,
Updating via the Polyak-type step: $f: \mathbb{R}^n \to \mathbb{R}$ 9 where $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 0 is a parameter, and projection is onto $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 1.

After $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 2 such substeps, $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 3. This scheme avoids projection onto $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 4, replacing it with computationally tractable projections onto $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 5 and randomized selection of individual constraints.

Under the error-bound and bounded subgradient assumptions, the following hold:

Nonexpansiveness: For any feasible $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 6, $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 7.
Geometric decrease in infeasibility: $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 8 where $X = \bigcap_{i=1}^m \{x \in \mathbb{R}^n \mid g_i(x) \le 0\}$ 9.

3. Interleaved Objective Minimization and Feasibility Updates

The algorithm alternates or interleaves randomized feasibility updates with (sub)gradient steps for objective minimization. Two major cases are considered:

Strongly Convex, $g_i$ 0-Smooth Objective

Assumptions:

$g_i$ 1 has $g_i$ 2-Lipschitz gradient,
$g_i$ 3 is $g_i$ 4-strongly convex.

Algorithm steps:

Compute $g_i$ 5,
Update $g_i$ 6 using the randomized feasibility algorithm with $g_i$ 7.

Adaptive Polyak-type step size: $g_i$ 8 where $g_i$ 9 is a prescribed accuracy.

Weighted averaging is used: $Y \subset \mathbb{R}^n$ 0

Convex, Possibly Nonsmooth Objective: Distance-over-Weighted-Subgradients (DoWS)

Assumptions:

$Y \subset \mathbb{R}^n$ 1 is convex (possibly nondifferentiable),
$Y \subset \mathbb{R}^n$ 2 is convex and bounded with diameter $Y \subset \mathbb{R}^n$ 3.

For $Y \subset \mathbb{R}^n$ 4 iterations:

Maintain $Y \subset \mathbb{R}^n$ 5,
$Y \subset \mathbb{R}^n$ 6; $Y \subset \mathbb{R}^n$ 7,
$Y \subset \mathbb{R}^n$ 8,
Compute $Y \subset \mathbb{R}^n$ 9,
Randomized feasibility update as above.

A weighted average output $\|\cdot\|$ 0 minimizes $\|\cdot\|$ 1.

4. Convergence Guarantees and Theoretical Rates

Strongly Convex, Smooth Case

For adaptive stepsizes as above and exponential weighting,

$\|\cdot\|$ 2

after $\|\cdot\|$ 3 outer iterations, provided the mean reduction in infeasibility per iteration meets a prescribed threshold (Chakraborty et al., 27 Jan 2026).

Convex, Possibly Nonsmooth Case

After $\|\cdot\|$ 4 iterations using DoWS with feasibility, the output $\|\cdot\|$ 5 satisfies

$\|\cdot\|$ 6

with \begin{align*} A_1(T) &= \frac{2 D M_f}{\sqrt{T}}\left(\frac{D}{r}\right)^{{\frac{2}{T}\ln(e} D^{2/r^2)},\} A_2(\tau) &= D M_f \max_{1\le k\le\tau} \mathbb{E}[ (1-q)^{N_k/2} ],\ A_3(T) &= \frac{D M_f}{T} \left(\frac{D}{r}\right)^{{\frac{2}{T}\ln(e} D^{2/r^2)}} \sum_{k=1}^{\tau} \mathbb{E}[(1-q)^{N_k/2}], \end{align*} yielding the optimal $\|\cdot\|$ 7 rate as $\|\cdot\|$ 8 up to sampling-determined terms.

For unbounded $\|\cdot\|$ 9, a tamed (logarithmically adjusted) variant of the DoWS step-size ensures bounded iterates and the same $\Pi_Y[z]$ 0 expected error rate up to constants that grow logarithmically in $\Pi_Y[z]$ 1.

5. Sampling Distribution Regimes and Computational Properties

Performance and theoretical rates depend critically on the sampling distribution of the number of feasibility substeps $\Pi_Y[z]$ 2 at each outer iteration. For common regimes:

Deterministic polynomial growth: $\Pi_Y[z]$ 3 ensures that the sum $\Pi_Y[z]$ 4 is uniformly bounded.
Poisson sampling: $\Pi_Y[z]$ 5 with $\Pi_Y[z]$ 6 yields $\Pi_Y[z]$ 7, which decays polynomially in $\Pi_Y[z]$ 8.
Binomial sampling: $\Pi_Y[z]$ 9 with $Y$ 0 gives similar decay properties.

Sub-polynomial growth of $Y$ 1 suffices to make sampling-driven error negligible at polylogarithmic cost in total feasibility steps.

6. Empirical Evaluation: QCQP and SVM Applications

Simulations were conducted on two canonical classes of problems:

Quadratically Constrained Quadratic Programming (QCQP)

The problem: $Y$ 2 was tested in three regimes:

(a) Strongly convex $Y$ 3, known $Y$ 4,
(b) Strongly convex, unknown $Y$ 5,
(c) Convex $Y$ 6, unknown $Y$ 7.

Baselines included the Nedić et al subgradient-projection, Arrow–Hurwicz and Alt-GDA primal-dual schemes, ACVI (ADMM+log-barrier), and CVXPY interior-point.

Key observations:

Adaptive Polyak-step algorithm achieved linear convergence in (a), requiring no prior knowledge of strong convexity or smoothness parameters.
DoWS and T-DoWS performed competitively in (b), (c), attaining the expected $Y$ 8 rate slope.
ACVI provided the fastest infeasibility decay but required expensive tuning.

Support Vector Machine (SVM) Soft-Margin Classification

For the SVM problem

$Y$ 9

the UCI Banknote, Breast-Cancer, and MNIST 3-vs-5 datasets were used. Only DoWS/T-DoWS and primal-dual (Arrow–Hurwicz/Alt-GDA) baselines were compared due to convexity.

Results:

DoWS/T-DoWS schemes reduced objective and infeasibility rapidly;
Test-set misclassification rates were competitive with cross-validated primal-dual methods;
Methods required no parameter tuning.

7. Theoretical Significance and Practical Implications

Randomized feasibility algorithms with Polyak steps provide a rigorously justified, computation-efficient approach to large-scale constrained convex optimization where projection onto intersected constraints is intractable. Theoretical results guarantee:

Linear convergence to any prespecified tolerance for strongly convex, $g^+(x) := \max\{0, g(x)\}$ 0-smooth $g^+(x) := \max\{0, g(x)\}$ 1;
Optimal $g^+(x) := \max\{0, g(x)\}$ 2 rates in the convex, potentially nonsmooth setting;
Bounded sampling-driven error without demanding hyperparameter tuning or explicit knowledge of problem parameters.

Empirical results indicate practical competitiveness against state-of-the-art first-order and primal-dual methods, particularly when problem structure or scale make conventional projection approaches prohibitively costly (Chakraborty et al., 27 Jan 2026).

Markdown Report Issue Upgrade to Chat

References (1)

Randomized Feasibility Methods for Constrained Optimization with Adaptive Step Sizes (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Randomized Feasibility Algorithm with Polyak Steps.