Legacy Concept Lab
Reproducing Kernel Hilbert Spaces
Unifies SVMs, Gaussian processes, kernel regression, and NTK under one framework
#71RKHSTheory
key equation
f(x) = \langle f, k(x, \cdot) \rangle_{\mathcal{H}}Phase 10: Mathematical foundations & information geometryConcept 71 of 100
Why It Matters for Modern Models
- Unifies SVMs, Gaussian processes, kernel regression, and NTK under one framework
- Explains why "similarity functions" must be positive definite—they define inner products
- NTK shows neural networks are kernel machines in the infinite-width limit
What Tutorials Skip
What is still poorly explained in textbooks and papers:
- Kernels implicitly define an (often infinite-dimensional) feature space
- Positive definiteness = you can build a Hilbert space where the kernel is an inner product
- Attention can be viewed as a learned, data-dependent kernel
Interactive Visualization
Core Math (Optional Deep Dive)
If you want intuition first, start with the key equation and the visualization. Come back here for the full walkthrough.
Key Equation
Reproducing property: evaluation is an inner product:
Kernel trick: inner product in feature space without explicit computation:
Mercer decomposition (spectral):
Representer theorem: optimal function is a linear combination of kernel evaluations: