DFlash: Block Diffusion for Flash Speculative Decoding Paper β’ 2602.06036 β’ Published 12 days ago β’ 41
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper β’ 2506.16406 β’ Published Jun 19, 2025 β’ 130
Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters