Legacy Concept Lab
Test-Time Compute & Inference Scaling
The paradigm behind o1: spend more compute at inference for harder problems
#74Test-TimeScaling & Alignment
key equation
\text{Quality} \sim \log(\text{inference compute})Phase 11: Frontier research & scalingConcept 74 of 100
Why It Matters for Modern Models
- The paradigm behind o1: spend more compute at inference for harder problems
- Enables adaptive compute: easy questions are fast, hard ones "think longer"
- May be more efficient than pure pretraining scaling for reasoning tasks
What Tutorials Skip
What is still poorly explained in textbooks and papers:
- Train-time and test-time compute are substitutes: you can trade one for the other
- Verifiers (reward models) let you search through many candidate solutions
- Tree search over reasoning steps explores the space of possible derivations
Interactive Visualization
Core Math (Optional Deep Dive)
If you want intuition first, start with the key equation and the visualization. Come back here for the full walkthrough.
Key Equation
Test-time scaling trades compute for quality at inference:
Best-of-N sampling: Generate N responses, select best via verifier:
Process Reward Models score intermediate steps:
Monte Carlo Tree Search for reasoning: