Optimal Constants in Dimension Reduction

Updated 18 February 2026

Optimal constants in dimension reduction are precise tradeoffs between embedding dimensions, distortion levels, and set sizes, central to the Johnson–Lindenstrauss lemma.
These results establish asymptotically tight bounds and explicit constants for both linear and nonlinear mappings in high-dimensional data analysis.
Innovative methods like convex hull distortion and majorizing measures underpin these findings, influencing practical strategies such as terminal embeddings.

Optimal constants in dimension reduction refer to the precise dependencies and best possible constants in the tradeoff between target embedding dimension, allowable distortion, and set size for mapping high-dimensional data into Euclidean (or other normed) spaces. This concept is central in theoretical computer science, geometric functional analysis, and high-dimensional data analysis, and is especially associated with the Johnson–Lindenstrauss (JL) lemma and its optimality boundaries. Over the past decades, sharp results have established both asymptotic optimality and specific constant factors for a variety of dimension-reduction tasks, addressing linear, nonlinear, and so-called terminal embedding regimes, as well as average- and worst-case distortion measures.

1. Classical Johnson–Lindenstrauss Bound and Optimality

The Johnson–Lindenstrauss lemma states that for any $n$ -point subset $X$ of $\mathbb{R}^d$ and $0<\epsilon<1$ , there exists a (typically linear) map $f:X\to\mathbb{R}^m$ such that

$\forall x,y\in X,\quad (1-\epsilon)\|x-y\|_2 \leq \|f(x)-f(y)\|_2 \leq (1+\epsilon)\|x-y\|_2,$

provided

$m = O(\epsilon^{-2} \log n).$

Both upper and lower bounds have been sharpened to show that no method—linear or otherwise—can guarantee smaller $m$ up to universal constants for worst-case pairwise distortion (Larsen et al., 2014, Bartal et al., 2021). The best-known explicit constant for the classical JL Gaussian (or $\pm1$ ) random projection construction is approximately 4:

$m \geq \frac{4}{\epsilon^2}(2\ln n - \ln \delta)$

to achieve distortion at most $1+\epsilon$ for all pairs with failure probability at most $\delta$ (0806.4422).

2. Dimension Reduction: Lower Bounds, Hard Instances, and Average-Case Tightness

Lower bounds for dimension reduction, originally for linear maps, have been established via explicit probabilistic and geometric constructions. For linear maps $A:\mathbb{R}^n\rightarrow\mathbb{R}^m$ preserving all pairwise distances up to $1+\epsilon$ distortion, it was shown that

$m = \Omega\left(\min\left\{ n, \epsilon^{-2} \log n \right\}\right)$

is necessary in the worst case (Larsen et al., 2014). Explicit hard sets demonstrating tightness include unions of the standard basis and carefully constructed Gaussian clouds.

For average-case and $\ell_q$ -moment distortion measures—e.g., stress, energy, and relative error—Bartal, Fandina, and Larsen proved that the JL bound is again optimal. Any (linear or nonlinear) map $f$ with $q$ -moment average distortion at most $1+\epsilon$ for $N$ points must have

$m \geq \Omega(\max\{ \epsilon^{-2}, q/\epsilon \}),$

for all $1 \leq q \leq O(\frac{\log(2\epsilon^2 n)}{\epsilon})$ , and for any practical distortion criterion used in applications (Bartal et al., 2021). These results show no asymptotic improvement is possible over JL for all practical embedding objectives.

3. Terminal Embeddings and Resolution of Open Questions

A “terminal embedding” is a relaxation of the JL lemma in which the map $f:\mathbb{R}^d \rightarrow \mathbb{R}^m$ must preserve distances not just within a finite subset $X$ but between every $x\in X$ and all $y\in\mathbb{R}^d$ :

$\forall x\in X,\,\forall y\in\mathbb{R}^d:\ \|x-y\|_2 \leq \|f(x)-f(y)\|_2 \leq (1+\epsilon)\|x-y\|_2.$

Originally, known constructions for terminal embeddings required $m=O(\epsilon^{-4}\log n)$ dimensions [MMMR18]. Narayanan and Nelson (Narayanan et al., 2018) demonstrated that the optimal JL bound persists even in this stronger regime, proving it suffices to take

$m \leq C_0\epsilon^{-2}\log n,$

closing an open problem. Their construction uses a nonlinear map based on random subgaussian matrices and a convex-hull distortion principle—showing that with high probability, a single random map preserves all directions in a convex hull of $O(n^2)$ directions up to $\epsilon$ relative error.

Table: Comparison of Terminal and Classical JL Bounds

Context	Required $m$	Optimality Status
JL (within-set only)	$O(\epsilon^{-2} \log n)$	Tight (linear/nonlin.)
Terminal embedding	$O(\epsilon^{-2} \log n)$	Tight (Narayanan et al., 2018)

Here $C_0$ is an absolute constant originating in subgaussian concentration machinery and convex-geometric tail bounds—estimated to be in the range 10–100 for current proofs, with the optimal value $<4$ remaining an open question (Narayanan et al., 2018).

4. Explicit and Algorithmic Aspects of Leading Constants

Precise constants in JL-type bounds are critical for applications and for understanding the information-theoretic threshold. The best-known explicit constant for the Gaussian-JL random projection is 4 in the exponent for the classical case $m \geq \frac{4}{\epsilon^2}(2\ln n - \ln\delta)$ (0806.4422). The quantile-based estimator for $\ell_2$ distance (using stable random projections) yields a leading constant $G(2,\epsilon)\approx6.1$ (0806.4422), which is strictly weaker than the optimal arithmetic mean estimator.

Current optimality proofs for the terminal and average-case settings maintain the same $\epsilon^{-2}\log n$ dependence but do not optimize the leading constant $C_0$ , with explicit values difficult to refine due to the reliance on majorizing measure bounds and subgaussian tail estimates (Narayanan et al., 2018, Larsen et al., 2014). There remains a research direction to reduce $C_0$ closer to 2 or 3, which would further close the polylogarithmic gap for practical deployment.

5. Methodological Innovations: Net Arguments, Majorizing Measures, and Outer Extensions

Recent progress relies on several technical innovations:

Convex hull distortion: By considering all directions in the convex hull of normalized difference vectors $(x-y)/\|x-y\|_2$ for $x,y\in X$ , Narayanan and Nelson achieved simultaneous preservation of an exponential number of directions with only logarithmic union bound overhead (Narayanan et al., 2018).
Refined outer extension: The map is constructed by solving a minimax optimization (von Neumann style) to extend the JL map beyond $X$ to all of $\mathbb{R}^d$ while maintaining distance preservation.
Majorizing measure ( $\gamma_2$ ) functional: The analysis harnesses this functional to control uniform deviations of subgaussian processes over exponentially large index sets.

These methods close the previous $\epsilon^{-2}$ gap in the exponent for the terminal regime, yielding sharp inequalities with minimal dependency inflation.

6. Implications and Open Problems

Optimality of the JL lemma for all distortion criteria—worst-case, average-case, and $q$ -moment—establishes the $O(\epsilon^{-2}\log n)$ dimension as the universal information-theoretic threshold for Euclidean embedding, regardless of algorithmic approach, linearity, or practical distortion objective (Bartal et al., 2021). This remains true for both linear and nonlinear embeddings. In practical terms, JL is the gold-standard baseline for all dimension-reduction methods and heuristics.

Notable open questions include:

Sharpening leading constants: Can the absolute constant in $m \leq C_0 \epsilon^{-2} \log n$ be reduced below current bounds, especially for terminal embeddings?
Derandomization: Can one construct deterministic, polynomial-time algorithms achieving the optimal JL dimension in practical or terminal settings?
Extensions to other norms $\ell_p$ : Extending sharp bounds and methodological tools to $\ell_p$ spaces or other embedding settings remains partly unresolved (Narayanan et al., 2018).
Non-Euclidean or structured scenarios: Generalizing optimality results to structured or restricted isometry regimes continues to be an active topic.

A plausible implication is that any heuristic or data-dependent method for dimensionality reduction should be rigorously benchmarked against the JL threshold both asymptotically and in terms of explicit constants in regimes relevant to modern large-scale data analysis.

7. Summary Table: Optimal Dimension-Reduction Constants

Setting	Dimension Bound (Asymptotic)	Leading Constant	Universality	References
Worst-case (JL)	$O(\epsilon^{-2} \log n)$	$\approx 4$ (Gauss)	Linear & Nonlinear	(Larsen et al., 2014, 0806.4422)
Average distortion	$O(\epsilon^{-2} \log n)$	Absolute, best-known	Any method	(Bartal et al., 2021)
Terminal embedding	$O(\epsilon^{-2} \log n)$	$10 - 100$ (proof)	Nonlinear	(Narayanan et al., 2018)
Additive error, $k$	$O(\epsilon^{-2}\log(2+\epsilon^2 n))$	$40$ (bipartite)	JL-type	(Alon et al., 2016)

All settings achieve $\epsilon^{-2}$ scaling in the denominator, matching upper and lower bounds up to universal constant factors. The only residual uncertainties pertain to precise numerical optimization and deterministic constructions. There are no known settings in Euclidean dimension reduction where the JL threshold can be beaten.