Variational Autoencoders & Variational Inference
Canonical Papers
Auto-Encoding Variational Bayes
Read paper →Core Mathematics
Latent variable model with intractable posterior. Introduce variational encoder and maximize ELBO:
Reparameterization trick for Gaussian encoder:
Key Equation
Interactive Visualization
Why It Matters for Modern Models
- Stable Diffusion is a latent diffusion model: an autoencoder maps images ↔ compressed latent space where diffusion operates
- VAEs underpin many multimodal encoders (audio, video latents) used as building blocks
Missing Intuition
What is still poorly explained in textbooks and papers:
- Intuitive grasp of why ELBO works as both reconstruction + regularization
- Visualizations of how the prior p(z) and posterior families affect sample quality/diversity