Papers
Topics
Authors
Recent
Search
2000 character limit reached

Chebyshev Polynomial Approximations

Updated 9 February 2026
  • Chebyshev polynomial approximations are techniques that express functions through sums of Chebyshev polynomials, ensuring near-optimal interpolation and rapid convergence on bounded intervals.
  • They leverage key properties such as orthogonality, a three-term recurrence, and efficient node selection to minimize errors like the Runge phenomenon in numerical integration and spectral methods.
  • Applications include solving differential equations, performing signal processing on graphs, and stabilizing deep network layers, with extensions to multivariate and rational approximations.

Chebyshev polynomial approximations refer to the representation and approximation of functions on bounded intervals, most classically [1,1][-1, 1], by sums or expansions involving Chebyshev polynomials. These polynomials possess favorable extremal, orthogonality, and computational properties, resulting in efficient schemes for interpolation, numerical integration, spectral methods for differential equations, signal processing, and machine learning.

1. Definition and Fundamental Properties of Chebyshev Polynomials

The Chebyshev polynomials of the first kind, Tn(x)T_n(x), are defined by

Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots

They satisfy the three-term recurrence: T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x). These polynomials solve the Sturm–Liouville equation: ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0, exhibiting a deep connection to harmonic analysis. Orthogonality holds under the Chebyshev weight: 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases} A related set, the Chebyshev polynomials of the second kind, Un(x)U_n(x), are given by

Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},

orthogonal with respect to 1x2\sqrt{1-x^2}.

The generating function is: G(x,t)=n=0Tn(x)tn=1xt12xt+t2.G(x,t) = \sum_{n=0}^\infty T_n(x) t^n = \frac{1-x\,t}{1 - 2x\,t + t^2}.

Parseval’s identity holds: Tn(x)T_n(x)0 for the Chebyshev expansion coefficients Tn(x)T_n(x)1 (Karjanto, 2020).

2. Chebyshev Series and Interpolation

Any Tn(x)T_n(x)2 can be expressed as a (possibly infinite) Chebyshev series: Tn(x)T_n(x)3 with coefficients

Tn(x)T_n(x)4

Chebyshev interpolation at the so-called Chebyshev–Gauss–Lobatto nodes: Tn(x)T_n(x)5 gives near-minimax polynomial interpolation and mitigates the Runge phenomenon exhibited by equispaced interpolation.

The interpolant is: Tn(x)T_n(x)6 where coefficients Tn(x)T_n(x)7 are computed via a discrete cosine transform: Tn(x)T_n(x)8 Uniform error estimates for Tn(x)T_n(x)9 functions: Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots0 where Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots1 (Karjanto, 2020).

3. Convergence and Error Rates

Smooth Functions: For functions analytic in a Bernstein ellipse, Chebyshev coefficients decay exponentially, and the partial sum error satisfies Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots2 for some Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots3 (Tang et al., 2019).

Functions with Bounded Variation: When Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots4 is of bounded variation, Chebyshev coefficients Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots5 decay as Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots6, and the Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots7 error for degree Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots8 approximation satisfies Tn(x)=cos(narccosx),x[1,1], n=0,1,2,T_n(x) = \cos(n\,\arccos x), \qquad x \in [-1,1],\ n=0,1,2,\dots9 with explicit constants (Akansha, 2024).

Endpoint Singularities and Basis Choice: For T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).0 with T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).1, Chebyshev, difference, and quadratic-factor basis coefficients decay asymptotically as T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).2, T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).3, and T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).4 respectively with T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).5. Standard Chebyshev truncations incur boundary-layer errors, while bases encoding Dirichlet BC yield uniform error distribution (Zhang et al., 2021).

Taylor-like Bounds: For example, for the Chebyshev expansion of T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).6 on T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).7, explicit upper and lower polynomial approximants T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).8 for T0(x)=1,T1(x)=x,Tn+1(x)=2xTn(x)Tn1(x).T_0(x)=1,\qquad T_1(x)=x,\qquad T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x).9 can be derived via auxiliary inequalities involving Bessel functions and Chebyshev polynomials of both kinds (Wodecki, 2024).

4. Multivariate and Weighted Chebyshev Approximation

Bivariate Approximations: For ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,0 on ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,1, the expansion

ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,2

converges uniformly when ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,3, and coefficients are given by double orthogonality integrals. Fast algorithms use the 2D FFT on Chebyshev nodes. The uniform remainder decays at ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,4 with ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,5 the polynomial degrees, with sharper ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,6, ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,7 coefficient decay in pure directions (Scheiber, 2015).

Hyperbolic Cross and Numerical Differentiation: High-dimensional differentiation is stabilized by truncating Chebyshev expansions to hyperbolic crosses. For ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,8 in bivariate weighted Wiener classes ddx((1x2)dydx)+n2y=0,\frac{d}{dx}\left((1-x^2)\frac{dy}{dx}\right) + n^2 y = 0,9, specific choices of truncation parameter 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}0 minimize the total error, resulting in error bounds in weighted 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}1 norms of explicit algebraic form in terms of the noise level and smoothness parameters (Kyselov et al., 30 Jan 2026).

Adaptive Partitioning: Adaptive partition-of-unity frameworks recursively split domains, fitting low-degree tensor-product Chebyshev expansions locally and combining via smooth bump functions, yielding a global 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}2 approximation with spectral or near-spectral convergence, automatic anisotropy adaptation, and performance advantages especially in higher dimensions or for functions with localized sharp features (Aiton et al., 2018).

5. Applications and Algorithms

Spectral Methods for PDEs and BVPs: Chebyshev collocation methods solve high-order boundary value problems by reducing to first-order systems, expanding unknowns in Chebyshev series, and collocating at Chebyshev clustered nodes. The solution converges spectrally, with direct imposition of boundary conditions and efficient differentiation via sparse recurrence matrices (Bhowmik, 2014).

Distributed Signal and Graph Processing: Shifted and scaled Chebyshev polynomials approximate graph filters 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}3, avoiding spectral decompositions. With 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}4 the graph Laplacian, 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}5, so that its spectrum fits 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}6. The matrix polynomial

11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}7

can be efficiently and fully distributedly evaluated via the three-term recurrence. Error decays rapidly for smooth filters; cost scales as 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}8 for sparse graphs (Shuman et al., 2011).

Stable Deep Networks: Chebyshev coefficient truncation yields robust function approximation layers in deep networks (ChebNets). These constructions achieve spectral accuracy with depth 11Tm(x)Tn(x)dx1x2={π,m=n=0, π2,m=n1, 0,mn.\int_{-1}^1 T_m(x)\,T_n(x)\frac{dx}{\sqrt{1-x^2}} = \begin{cases} \pi, & m=n=0, \ \frac{\pi}{2}, & m=n\ge1, \ 0, & m\ne n. \end{cases}9, width Un(x)U_n(x)0, and conditioning Un(x)U_n(x)1, outperforming power-series-based RePU architectures for large Un(x)U_n(x)2 in both stability and accuracy (Tang et al., 2019).

Alias-free Differentiation: Least-squares constrained mock-Chebyshev operators use a subset of nodes mimicking Chebyshev-Lobatto points, combining interpolation and regression to control the operator norm and reduce the Runge phenomenon; derivative approximation (even of high order) is accurate up to Un(x)U_n(x)3 derivatives for Un(x)U_n(x)4 data points (Dell'Accio et al., 2022).

Rational and Hermite–Chebyshev Theories: Rational Chebyshev approximants, including (linear/nonlinear) Hermite–Chebyshev and Padé–Chebyshev constructions, extend polynomial approximation to quotient spaces, balancing uniform accuracy with specialized properties (e.g., simultaneous interpolation, endpoint constraints, or best rational approximation under shrinking domains). These approaches admit explicit determinantal formulas and connect closely to classical rational-approximation theory (Jawecki, 2024, Starovoitov et al., 21 Jul 2025).

6. Error Bounds, Filtering, and Computational Aspects

Tail Probability and Monomials: The Chebyshev expansion of Un(x)U_n(x)5 provides a truncation error expressible exactly as a tail sum of binomial coefficients, with a probabilistic interpretation: the error is twice the probability that a symmetric random walk deviates by more than Un(x)U_n(x)6 from Un(x)U_n(x)7 steps. Using Hoeffding bounds, Un(x)U_n(x)8, so error decays subexponentially in Un(x)U_n(x)9 (Saibaba, 2021).

Filtered Interpolation: Applying de la Vallée Poussin (VP) filters to Chebyshev interpolation controls the Lebesgue constant and attains uniform convergence in weighted Jacobi norms. The filtered interpolants maintain near-best approximation error with explicit necessary and sufficient conditions on Jacobi weights; increasing the filter strength mitigates the Gibbs phenomenon while preserving global convergence rates (Occorsio et al., 2020).

Efficient Polynomial Evaluation and Root-Finding: The Clenshaw algorithm provides Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},0 evaluation of Chebyshev expansions. Interval ball-arithmetic variants control error growth (quadratic rather than exponential in Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},1) when evaluating on intervals, enabling rigorous root isolation schemes with complexity Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},2 in the worst case and Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},3 practical performance for well-separated roots (Ledoux et al., 2019).

Weighted and Regularized Minimax Approximation: In estimation problems (e.g., for support size), weighted Chebyshev polynomial approximation (with or without regularization) optimally trades bias and variance, yielding efficient convex programs with Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},4 dimension and matching minimax rates for suitable choices of weight (I et al., 2019).

7. Extensions and Generalizations

Generalized Chebyshev-II and Sobolev Orthogonality: The Chebyshev polynomials of the second kind and their generalizations admit expansions in the Bernstein basis, possess orthogonality under Sobolev-type measures (including point masses at endpoints), and enable interpolation and approximation results that connect to Un(x)=sin((n+1)arccosx)sin(arccosx),U_n(x) = \frac{\sin((n+1)\arccos x)}{\sin(\arccos x)},5 spaces and weighted polynomial inequalities (AlQudah, 2015).

Uniform Approximation for D-finite and Complex-Valued Functions: Rigorous Chebyshev expansion methods for D-finite functions, utilizing block-Clenshaw algorithms and validated functional enclosures, provide uniform (near-minimax) approximations with explicit complexity and error bounds, covering solutions to linear ODEs with polynomial coefficients (Benoit et al., 2014).

Multiseries Hermite–Chebyshev Approximants: The theory of linear and nonlinear Hermite–Chebyshev rational approximations gives determinant-based existence and uniqueness criteria even in the case of multiple (possibly vector-valued) functions, reducing the problem to full-rank conditions on structured Hankel–Toeplitz matrices (Starovoitov et al., 21 Jul 2025).


Chebyshev polynomial approximations, encompassing both theoretical and algorithmic aspects, provide one of the most effective frameworks for the stable, rapidly convergent, and computationally efficient approximation of functions on bounded intervals. Their impact spans classical numerical analysis, numerical PDEs, signal processing on graphs, and modern machine learning architectures, with continuing extensions to multivariate domains, non-classical weights, generalized orthogonalities, and rational function approximations.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Chebyshev Polynomial Approximations.