Papers
Topics
Authors
Recent
Search
2000 character limit reached

Proximal Alternating Linearized Minimization Algorithm

Updated 6 January 2026
  • Proximal Alternating Linearized Minimization (PALM) is a structured block-coordinate algorithm for nonconvex, nonsmooth optimization using proximal steps and linearization.
  • It alternates updates via block-specific surrogate functions, ensuring global convergence under the Kurdyka–Łojasiewicz property.
  • TiBPALM extends PALM by incorporating two-step inertia and Bregman distance regularization to accelerate convergence in applications like sparse recovery and imaging.

The Proximal Alternating Linearized Minimization (PALM) algorithm and its advanced variants, such as the Two-step Inertial Bregman PALM (TiBPALM), constitute a robust class of block-coordinate methods for structured optimization. These methods address general nonconvex, nonsmooth, and nonseparable two-block problems of the form

minxRl,yRm  L(x,y)=f(x)+Q(x,y)+g(y)\min_{x \in \mathbb{R}^l, \, y \in \mathbb{R}^m} \; L(x, y) = f(x) + Q(x, y) + g(y)

where ff and gg are proper, lower-semicontinuous functions (possibly nonconvex and nonsmooth), and QQ is C1C^1 with Lipschitz continuous gradient. The PALM framework, particularly in its inertial and Bregman generalized forms, delivers global convergence guarantees under the Kurdyka–Łojasiewicz (KL) property, and allows block-specific generalizations critical for modern large-scale applications (Guo et al., 2023).

1. Class of Problems and Block Structure

The core setting involves partitioning the optimization variables into two (or, in some extensions, more) blocks, xx and yy, over which the objective L(x,y)=f(x)+Q(x,y)+g(y)L(x, y) = f(x) + Q(x, y) + g(y) is defined. Here,

  • f:Rl(,+]f: \mathbb{R}^l \to (-\infty, +\infty] and g:Rm(,+]g: \mathbb{R}^m \to (-\infty, +\infty] are (potentially nonconvex, nonsmooth) proper, lsc functions.
  • ff0 is ff1 in ff2 with ff3 Lipschitz continuous on bounded sets.
  • No separability of ff4 is assumed.

This abstraction is broad enough to encompass constrained and penalized data fitting in signal recovery, machine learning, imaging, and quadratic fractional programming.

2. Standard PALM Methodology

The fundamental PALM scheme operates by alternating linearized majorizations of the block-coupled term ff5 and applying blockwise proximal steps:

  • At iterate ff6, pick step-sizes ff7, ff8. Perform

ff9

gg0

This translates to block coordinate descent with local quadratic majorants. The PALM iteration admits broad regularity (no convexity of gg1, gg2, or gg3 needed beyond the outlined smoothness/lsc assumptions).

3. Inertial and Bregman Extensions: TiBPALM

To accelerate convergence and handle generic geometry, the TiBPALM algorithm incorporates:

  • Two-step inertia: Extrapolations of the form gg4 allow leverage of momentum from two preceding steps, generalizing standard heavy-ball approaches.
  • Bregman distance regularization: The squared norm is replaced with a Bregman kernel gg5, typically with a strongly convex, smooth gg6. This is critical when the geometry of proximal steps is non-Euclidean or a Euclidean prox is not easily solvable.

The TiBPALM update for block gg7 (similarly for gg8) is:

gg9

Momentum and geometry are thus jointly exploited.

4. Benefit Function and Convergence Analysis

A distinctive feature of TiBPALM is the introduction of a benefit function QQ0 across three consecutive iterates:

QQ1

This legitimizes sufficient decrease arguments accounting for the inertial terms.

Under strong convexity of the Bregman kernels QQ2 (constants QQ3) and appropriate upper bounds on the inertia parameters QQ4, one establishes the key decrease inequality:

QQ5

where QQ6 and QQ7.

The KL property is invoked to conclude global convergence: the whole sequence QQ8 has finite length and converges to a critical point of QQ9. This leverages the abstract KL-descent framework for general nonconvex, nonsmooth scenarios (Guo et al., 2023).

The inertial Bregman PALM framework synthesizes and generalizes several prior advancements:

  • iPALM (Inertial PALM) incorporates single-step momentum but uses standard prox-terms rather than Bregman distances (Pock et al., 2017).
  • Stochastic and variance-reduced schemes adapt PALM to finite-sum problems using variance-reduced estimators (SAGA, SARAH), with global convergence and accelerated rates established under similar KL-type assumptions (Guo et al., 2023, Driggs et al., 2020). The stochastic two-step inertial Bregman PALM (STiBPALM) extends TiBPALM to the stochastic regime.

A comparison is summarized below:

Method Inertia Proximal Geometry Stochasticity Reference
PALM None Euclidean No (Guo et al., 2023)
iPALM 1-step (heavy-ball) Euclidean No (Pock et al., 2017)
TiBPALM 2-step Bregman No (Guo et al., 2023)
STiBPALM 2-step Bregman Yes (Guo et al., 2023)

The advantage of Bregman distances becomes pronounced when prox-subproblems associated with convex indicator or regularizer functions become tractable in non-Euclidean geometry (e.g., Kullback–Leibler distance for nonnegativity, half-shrinkage for C1C^10).

6. Empirical Performance and Applications

Numerical studies confirm TiBPALM’s superiority over both noninertial PALM and one-step inertial variants (iPALM, GiPALM) on diverse nonconvex benchmarks:

TiBPALM consistently achieved faster decrease in the objective (fewer iterations and reduced wall-clock time for the same accuracy), attributable to the combination of two-step inertial extrapolation and Bregman geometry, and enabled efficient closed-form inner updates in practical instances (Guo et al., 2023).

7. Extensions and Theoretical Significance

The PALM paradigm, especially in its inertial and Bregman-extended forms, underpins a rich landscape of modern nonconvex optimization algorithms. It readily admits:

  • Block-separable multimodal generalizations (BPALM/A-BPALM) (Ahookhosh et al., 2019).
  • Variable metric and composite proximal variants for composite nonsmooth terms (Yashtini, 2022).
  • Inexact and infeasible subsolver frameworks (PALM-I) with surrogate sequences restoring descent (Hu et al., 2022).
  • Unrolled, learned, and interpretable deep optimization networks, where the entire PALM structure is encoded in a parameter-efficient, convergence-guaranteed architecture (Chen et al., 2024).

These developments establish PALM and its advanced variants as foundational tools for scalable, structured, and theoretically grounded optimization in modern computational mathematics and machine learning.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Proximal Alternating Linearized Minimization Algorithm.