Running on CPU Upgrade Featured 2.87k The Smol Training Playbook 📚 2.87k The secrets to building world-class LLMs
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 272
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 593k • • 1.42k
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 178
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 29