Analytic expression for the NNGP kernel with tanh activation
Derive a closed-form analytic expression for the normalized covariance kernel κ(u) corresponding to the infinite-width Gaussian process (NNGP) limit of a fully connected neural network with hyperbolic tangent activation σ(x) = tanh(x), i.e., κ(u) = E[tanh(Z1) tanh(Z2)] / E[tanh(Z)^2] as a function of the correlation u ∈ [-1,1], where (Z1, Z2) are standard Gaussian random variables with correlation u. Establishing this expression would provide an explicit formula for the depth-1 limiting kernel K1(x,y) = κ(⟨x,y⟩) used to generate deeper kernels by composition.
References
The associated kernel is not known analytically (to the best of our knowledge) but \Cref{der_sigmoide} shows that the derivative at the origin is greater than one.
— Spectral complexity of deep neural networks
(2405.09541 - Lillo et al., 2024) in Section 4 (Numerical evidence), High-disorder case paragraph