Legacy Concept Lab
Iterated Amplification
Concrete proposal for scalable oversight when AI exceeds human capability
#87IDAScaling & Alignment
key equation
A' = \arg\min_\pi \mathrm{KL}(\text{Amp}(H,A) \| \pi)Phase 12: Advanced alignment & safety researchConcept 87 of 100
Why It Matters for Modern Models
- Concrete proposal for scalable oversight when AI exceeds human capability
- Human decomposes task, assistants solve subtasks, distill back
- Foundational to modern AI safety research
What Tutorials Skip
What is still poorly explained in textbooks and papers:
- Like teaching: break hard problems into pieces students can help with
- Distillation compresses the amplified procedure into single model
- Each iteration enables supervision of harder tasks
Interactive Visualization
Core Math (Optional Deep Dive)
If you want intuition first, start with the key equation and the visualization. Come back here for the full walkthrough.
Key Equation
Amplify human with assistants , then distill:
Then iterate: .
Recursion: as improves, becomes more capable.