Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 2 days ago • 77
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Paper • 2601.22146 • Published 1 day ago • 4
Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning Paper • 2601.20829 • Published 2 days ago • 5
VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning Paper • 2601.20055 • Published 3 days ago • 6
Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning Paper • 2601.20209 • Published 3 days ago • 20
EvolVE: Evolutionary Search for LLM-based Verilog Generation and Optimization Paper • 2601.18067 • Published 5 days ago • 4
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning Paper • 2411.07279 • Published Nov 11, 2024 • 4
Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family Paper • 2504.18225 • Published Apr 25, 2025 • 15
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Paper • 2512.06266 • Published Dec 6, 2025 • 5
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 9 days ago • 176
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 4 days ago • 38
PowerInfer-2: Fast Large Language Model Inference on a Smartphone Paper • 2406.06282 • Published Jun 10, 2024 • 39
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU Paper • 2312.12456 • Published Dec 16, 2023 • 45
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation Paper • 2409.02098 • Published Sep 3, 2024 • 3
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment Paper • 2510.07743 • Published Oct 9, 2025 • 10