Chebyshev Polynomial Approximations

Updated 9 February 2026

Chebyshev polynomial approximations are techniques that express functions through sums of Chebyshev polynomials, ensuring near-optimal interpolation and rapid convergence on bounded intervals.
They leverage key properties such as orthogonality, a three-term recurrence, and efficient node selection to minimize errors like the Runge phenomenon in numerical integration and spectral methods.
Applications include solving differential equations, performing signal processing on graphs, and stabilizing deep network layers, with extensions to multivariate and rational approximations.

Chebyshev polynomial approximations refer to the representation and approximation of functions on bounded intervals, most classically $[-1, 1]$ , by sums or expansions involving Chebyshev polynomials. These polynomials possess favorable extremal, orthogonality, and computational properties, resulting in efficient schemes for interpolation, numerical integration, spectral methods for differential equations, signal processing, and machine learning.

1. Definition and Fundamental Properties of Chebyshev Polynomials

The Chebyshev polynomials of the first kind, $T_n(x)$ , are defined by

$T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$

They satisfy the three-term recurrence: $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ These polynomials solve the Sturm–Liouville equation: $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ exhibiting a deep connection to harmonic analysis. Orthogonality holds under the Chebyshev weight: $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ A related set, the Chebyshev polynomials of the second kind, $U_n(x)$ , are given by

$U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$

orthogonal with respect to $\sqrt{1-x^2}$ .

The generating function is: $G(x,t) = \sum_{n=0}^\infty T_n(x) t^n = \frac{1-x\,t}{1 - 2x\,t + t^2}.$

Parseval’s identity holds: $T_n(x)$ 0 for the Chebyshev expansion coefficients $T_n(x)$ 1 (Karjanto, 2020).

2. Chebyshev Series and Interpolation

Any $T_n(x)$ 2 can be expressed as a (possibly infinite) Chebyshev series: $T_n(x)$ 3 with coefficients

$T_n(x)$ 4

Chebyshev interpolation at the so-called Chebyshev–Gauss–Lobatto nodes: $T_n(x)$ 5 gives near-minimax polynomial interpolation and mitigates the Runge phenomenon exhibited by equispaced interpolation.

The interpolant is: $T_n(x)$ 6 where coefficients $T_n(x)$ 7 are computed via a discrete cosine transform: $T_n(x)$ 8 Uniform error estimates for $T_n(x)$ 9 functions: $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 0 where $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 1 (Karjanto, 2020).

3. Convergence and Error Rates

Smooth Functions: For functions analytic in a Bernstein ellipse, Chebyshev coefficients decay exponentially, and the partial sum error satisfies $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 2 for some $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 3 (Tang et al., 2019).

Functions with Bounded Variation: When $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 4 is of bounded variation, Chebyshev coefficients $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 5 decay as $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 6, and the $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 7 error for degree $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 8 approximation satisfies $T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots$ 9 with explicit constants (Akansha, 2024).

Endpoint Singularities and Basis Choice: For $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 0 with $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 1, Chebyshev, difference, and quadratic-factor basis coefficients decay asymptotically as $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 2, $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 3, and $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 4 respectively with $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 5. Standard Chebyshev truncations incur boundary-layer errors, while bases encoding Dirichlet BC yield uniform error distribution (Zhang et al., 2021).

Taylor-like Bounds: For example, for the Chebyshev expansion of $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 6 on $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 7, explicit upper and lower polynomial approximants $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 8 for $T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).$ 9 can be derived via auxiliary inequalities involving Bessel functions and Chebyshev polynomials of both kinds (Wodecki, 2024).

4. Multivariate and Weighted Chebyshev Approximation

Bivariate Approximations: For $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 0 on $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 1, the expansion

$\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 2

converges uniformly when $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 3, and coefficients are given by double orthogonality integrals. Fast algorithms use the 2D FFT on Chebyshev nodes. The uniform remainder decays at $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 4 with $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 5 the polynomial degrees, with sharper $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 6, $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 7 coefficient decay in pure directions (Scheiber, 2015).

Hyperbolic Cross and Numerical Differentiation: High-dimensional differentiation is stabilized by truncating Chebyshev expansions to hyperbolic crosses. For $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 8 in bivariate weighted Wiener classes $\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,$ 9, specific choices of truncation parameter $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 0 minimize the total error, resulting in error bounds in weighted $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 1 norms of explicit algebraic form in terms of the noise level and smoothness parameters (Kyselov et al., 30 Jan 2026).

Adaptive Partitioning: Adaptive partition-of-unity frameworks recursively split domains, fitting low-degree tensor-product Chebyshev expansions locally and combining via smooth bump functions, yielding a global $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 2 approximation with spectral or near-spectral convergence, automatic anisotropy adaptation, and performance advantages especially in higher dimensions or for functions with localized sharp features (Aiton et al., 2018).

5. Applications and Algorithms

Spectral Methods for PDEs and BVPs: Chebyshev collocation methods solve high-order boundary value problems by reducing to first-order systems, expanding unknowns in Chebyshev series, and collocating at Chebyshev clustered nodes. The solution converges spectrally, with direct imposition of boundary conditions and efficient differentiation via sparse recurrence matrices (Bhowmik, 2014).

Distributed Signal and Graph Processing: Shifted and scaled Chebyshev polynomials approximate graph filters $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 3, avoiding spectral decompositions. With $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 4 the graph Laplacian, $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 5, so that its spectrum fits $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 6. The matrix polynomial

$\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 7

can be efficiently and fully distributedly evaluated via the three-term recurrence. Error decays rapidly for smooth filters; cost scales as $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 8 for sparse graphs (Shuman et al., 2011).

Stable Deep Networks: Chebyshev coefficient truncation yields robust function approximation layers in deep networks (ChebNets). These constructions achieve spectral accuracy with depth $\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}$ 9, width $U_n(x)$ 0, and conditioning $U_n(x)$ 1, outperforming power-series-based RePU architectures for large $U_n(x)$ 2 in both stability and accuracy (Tang et al., 2019).

Alias-free Differentiation: Least-squares constrained mock-Chebyshev operators use a subset of nodes mimicking Chebyshev-Lobatto points, combining interpolation and regression to control the operator norm and reduce the Runge phenomenon; derivative approximation (even of high order) is accurate up to $U_n(x)$ 3 derivatives for $U_n(x)$ 4 data points (Dell'Accio et al., 2022).

Rational and Hermite–Chebyshev Theories: Rational Chebyshev approximants, including (linear/nonlinear) Hermite–Chebyshev and Padé–Chebyshev constructions, extend polynomial approximation to quotient spaces, balancing uniform accuracy with specialized properties (e.g., simultaneous interpolation, endpoint constraints, or best rational approximation under shrinking domains). These approaches admit explicit determinantal formulas and connect closely to classical rational-approximation theory (Jawecki, 2024, Starovoitov et al., 21 Jul 2025).

6. Error Bounds, Filtering, and Computational Aspects

Tail Probability and Monomials: The Chebyshev expansion of $U_n(x)$ 5 provides a truncation error expressible exactly as a tail sum of binomial coefficients, with a probabilistic interpretation: the error is twice the probability that a symmetric random walk deviates by more than $U_n(x)$ 6 from $U_n(x)$ 7 steps. Using Hoeffding bounds, $U_n(x)$ 8, so error decays subexponentially in $U_n(x)$ 9 (Saibaba, 2021).

Filtered Interpolation: Applying de la Vallée Poussin (VP) filters to Chebyshev interpolation controls the Lebesgue constant and attains uniform convergence in weighted Jacobi norms. The filtered interpolants maintain near-best approximation error with explicit necessary and sufficient conditions on Jacobi weights; increasing the filter strength mitigates the Gibbs phenomenon while preserving global convergence rates (Occorsio et al., 2020).

Efficient Polynomial Evaluation and Root-Finding: The Clenshaw algorithm provides $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 0 evaluation of Chebyshev expansions. Interval ball-arithmetic variants control error growth (quadratic rather than exponential in $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 1) when evaluating on intervals, enabling rigorous root isolation schemes with complexity $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 2 in the worst case and $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 3 practical performance for well-separated roots (Ledoux et al., 2019).

Weighted and Regularized Minimax Approximation: In estimation problems (e.g., for support size), weighted Chebyshev polynomial approximation (with or without regularization) optimally trades bias and variance, yielding efficient convex programs with $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 4 dimension and matching minimax rates for suitable choices of weight (I et al., 2019).

7. Extensions and Generalizations

Generalized Chebyshev-II and Sobolev Orthogonality: The Chebyshev polynomials of the second kind and their generalizations admit expansions in the Bernstein basis, possess orthogonality under Sobolev-type measures (including point masses at endpoints), and enable interpolation and approximation results that connect to $U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},$ 5 spaces and weighted polynomial inequalities (AlQudah, 2015).

Uniform Approximation for D-finite and Complex-Valued Functions: Rigorous Chebyshev expansion methods for D-finite functions, utilizing block-Clenshaw algorithms and validated functional enclosures, provide uniform (near-minimax) approximations with explicit complexity and error bounds, covering solutions to linear ODEs with polynomial coefficients (Benoit et al., 2014).

Multiseries Hermite–Chebyshev Approximants: The theory of linear and nonlinear Hermite–Chebyshev rational approximations gives determinant-based existence and uniqueness criteria even in the case of multiple (possibly vector-valued) functions, reducing the problem to full-rank conditions on structured Hankel–Toeplitz matrices (Starovoitov et al., 21 Jul 2025).

Chebyshev polynomial approximations, encompassing both theoretical and algorithmic aspects, provide one of the most effective frameworks for the stable, rapidly convergent, and computationally efficient approximation of functions on bounded intervals. Their impact spans classical numerical analysis, numerical PDEs, signal processing on graphs, and modern machine learning architectures, with continuing extensions to multivariate domains, non-classical weights, generalized orthogonalities, and rational function approximations.