Running Featured 32 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 32 Who needs 1T parameters? Olympiad proofs with a 4B model
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 11 days ago • 20