From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 1 day ago • 21
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning Paper • 2602.16742 • Published 8 days ago • 7
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 12 days ago • 43
view article Article Train AI models with Unsloth and Hugging Face Jobs for FREE +4 6 days ago • 72
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 14 days ago • 58
BitDance Collection BitDance: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model. • 11 items • Updated 4 days ago • 9
Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments Paper • 2602.11964 • Published 13 days ago • 12
Running MCP 177 Recommend Similar Papers 🌖 177 Get similar paper recommendations from a Hugging Face link
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published 14 days ago • 30
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 13 days ago • 97