Convex–Concave Procedure (CCP) Overview

Updated 17 January 2026

CCP is a framework that converts nonconvex optimization problems, expressed as differences of convex functions, into a sequence of tractable convex subproblems.
It employs first-order Taylor linearizations for concave terms and adaptive penalty mechanisms to handle constraints and ensure convergence.
The approach is widely used in diverse applications such as power systems, machine learning, and signal processing, with extensions like DCCP enhancing feasibility and robustness.

The Convex–Concave Procedure (CCP) is a majorization–minimization algorithmic framework for solving nonconvex optimization problems whose objectives and constraints are expressible as a difference of convex (DC) functions. CCP converts the original nonconvex program into a sequence of tractable convex subproblems via linearization of the concave terms. This meta-algorithm is widely employed across a variety of scientific and engineering domains, and underpins specialized implementations such as Disciplined Convex–Concave Programming (DCCP) (Shen et al., 2016), as well as connecting to classical optimization methods such as Frank–Wolfe (Yurtsever et al., 2022).

1. Mathematical Formulation of DC Problems and CCP

A problem is amenable to CCP if its objective and constraints can be written as sums or differences of convex and concave functions. The canonical DC program has the form: $\begin{aligned} & \underset{x \in \mathbb{R}^n}{\text{minimize}} \quad f_0(x) - g_0(x) \ & \text{subject to} \quad f_i(x) - g_i(x) \leq 0,\quad i = 1, ..., m \end{aligned}$ where $f_i$ and $g_i$ are convex functions. Equivalently, the objective and constraints may be decomposed as:

$f(x) = f_{\text{cvx}}(x) + f_{\text{ccv}}(x)$
$g_i(x) = g_{i,\text{cvx}}(x) + g_{i,\text{ccv}}(x) \leq 0$ with $f_{\text{ccv}}$ and $g_{i,\text{ccv}}$ concave.

CCP applies a first-order Taylor (affine) upper bound to each concave term at the current iterate $x^{(k)}$ : $f_{\text{ccv}}(x) \leq f_{\text{ccv}}(x^{(k)}) + \nabla f_{\text{ccv}}(x^{(k)})^T (x - x^{(k)}) \equiv \widehat{f}_{\text{ccv}}(x;x^{(k)})$ Similarly for the constraints: $g_{i,\text{ccv}}(x) \leq \widehat{g}_{i,\text{ccv}}(x;x^{(k)}) = g_{i,\text{ccv}}(x^{(k)}) + \nabla g_{i,\text{ccv}}(x^{(k)})^T(x - x^{(k)})$

The convex subproblem at iteration $f_i$ 0 is then: $f_i$ 1 Here, slack variables $f_i$ 2 with weights $f_i$ 3 penalize constraint violation and enforce feasibility (Shen et al., 2016).

2. Algorithmic Structure and CCP/DCCP Enhancements

The CCP iteration is succinctly captured as:

Compute gradients of concave terms at $f_i$ 4.
Form convex subproblem by replacing all concave terms with their affine majorizers.
Solve the subproblem for new iterate $f_i$ 5 and slacks.
If the new point is on a non-differentiable boundary, damp:

$f_i$ 6

Otherwise, take the full step.

Increase penalty weight $f_i$ 7 and repeat until convergence.

Disciplined Convex–Concave Programming (DCCP) (Shen et al., 2016) introduces two principal refinements:

Domain-aware linearization: When concave atoms have restricted domains, linearizing alone can lead to infeasible proposals. DCCP incorporates indicator functions:

$f_i$ 8

which are zero in the domain and $f_i$ 9 outside, preserving feasibility within proper convex sets.

Boundary nondifferentiability handling: When linearization yields solutions on domain boundaries with non-existent gradients, DCCP implements interior damping to move iterates away from such points.

These ensure robust convergence and automate constraint handling in generic modeling frameworks such as CVXPY/DCCP and YALMIP.

3. Theoretical Connections and Convergence Analysis

CCP is formally equivalent to the Frank–Wolfe (FW) algorithm applied to a suitable epigraphical reformulation of the DC problem (Yurtsever et al., 2022). For a program $g_i$ 0, it can be rewritten as

$g_i$ 1

Frank–Wolfe updates map exactly to CCP linearizations and, therefore, inherit non-asymptotic convergence rates. When the surrogate is concave on the feasible set, the iterates satisfy a stationarity gap of $g_i$ 2: $g_i$ 3 For practical implementations, under mild conditions (Lipschitz gradients, bounded level sets), the objective with penalty is nonincreasing and slacks decrease to zero. Accumulation points satisfy first-order KKT conditions for the original DC problem (Shen et al., 2016).

Nonconvexity precludes global optimality guarantees; CCP is a majorization–minimization heuristic rather than a globally convergent algorithm.

4. Implementation Strategies and Representative Applications

DCCP extends convex modeling libraries such as CVXPY with:

Expression domains and gradients for subdifferentiability checks.
Problem compliance checking (is_dccp).
A generic CCP loop (problem.solve(method='dccp')).

Typical usage involves expressing constraints and objectives in DCP-compliant atomic functions, automatically identifying DC structures, and solving convex surrogates at each iteration.

Representative applications include:

Circle packing: Placing $g_i$ 4 circles in the minimal bounding box, constraints expressed via nonconvex distance inequalities.
Boolean least-squares: Quadratic minimization with constraints $g_i$ 5, which is nonconvex and handled via CCP efficiently.
Morphological neural networks: Training morphological perceptrons (e.g., MPCLs or SLMPs) employs CCP to navigate non-differentiable max/min operators, outperforming greedy and gradient methods in multiclass classification tasks (Cunha et al., 2024 Cunha et al., 6 Sep 2025).
Power systems: Recovering feasible solutions for optimal power flow via penalty variant CCP, outperforming semi-definite relaxation methods in speed and solution quality (Tian et al., 2017).
Indefinite kernel learning: Handling logistic regression with indefinite kernels decomposed into positive and negative definite parts, solved efficiently by inexact CCP and stochastic CCP variants (CCICP–GD, CCICP–SGD) (Liu et al., 2017).

5. Practical Considerations and Extensions

Practical enhancements for CCP include:

Initialization from relaxed convex models (e.g., SOCP or SDP relaxations).
Penalty parameter tuning and adaptive slack mechanisms for constraint violation handling.
Warm starts and incremental model updates for high-dimensional problems and scalability (Cubuktepe et al., 2021).
Embedding exact verification/model checking within each CCP iteration, essential for hybrid symbolic-numeric frameworks (e.g., pMDPs).

Table: Key algorithmic components in CCP-based modeling frameworks

Component	Function	Modeling System
Domain indicator	Enforces variable domain constraints	DCCP/CVXPY
Gradient/subgradient	Affine majorization of concave terms	DCCP/CVXPY
Slacks + penalty weight	Maintains feasibility in surrogates	Penalty CCP
Model checking loop	Verifies candidate solution after numerical step	pMDPs/PROPhESY

6. Broader Impact and Research Directions

Viewing CCP as Frank–Wolfe on DC programs enables immediate transfer of convergence theory, rates, and algorithmic innovations from the extensive literature on conditional gradient methods (Yurtsever et al., 2022). This insight unifies MM, CCP, and FW under a single linear-oracle methodology, opening lines of research in accelerated variants, block-coordinate schemes, stochastic updates, and nonsmooth extensions.

CCP's disciplined frameworks streamline DC modeling across signal processing, robust control, learning with indefinite kernels, combinatorial NP-hard problems (QAP, graph matching via GNCGCP (Liu et al., 2013)), and logic-based model verification. The method bridges statistical heuristics with convex optimization principles and machine learning architectures, yielding scalable and robust solvers for a wide spectrum of nonconvex problems.