Legacy Concept Lab
World Models & Model-Based RL
Sample-efficient RL: learn from imagined experience, not just real data
#76World ModelsTheory
key equation
\hat{s}_{t+1} = f_\theta(s_t, a_t)Phase 10: Mathematical foundations & information geometryConcept 76 of 100
Why It Matters for Modern Models
- Sample-efficient RL: learn from imagined experience, not just real data
- DreamerV3 achieves superhuman Atari with 100× less data than model-free methods
- Foundation for planning-based AI: simulate futures before acting
What Tutorials Skip
What is still poorly explained in textbooks and papers:
- World models let agents "imagine" consequences without taking real actions
- Latent space prediction is easier than pixel prediction—compress, then predict
- Model error compounds over long horizons—need careful uncertainty handling
Interactive Visualization
Core Math (Optional Deep Dive)
If you want intuition first, start with the key equation and the visualization. Come back here for the full walkthrough.
Key Equation
A world model learns to predict future states:
Latent world model (DreamerV3):
- Encoder:
- Dynamics:
- Decoder:
Planning in imagination:
Train policy entirely in the learned model ("dreaming").