Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models Paper • 2510.14853 • Published Oct 16 • 4
How Far Are We from Genuinely Useful Deep Research Agents? Paper • 2512.01948 • Published 9 days ago • 52
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Aug 25 • 82
AndesVL Collection AndesVL is a suite of mobile-optimized Multimodal Large Language Models (MLLMs) with 0.6B to 4B parameters. • 8 items • Updated Oct 15 • 12
Qwen-Image-Pruning Collection Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers • 7 items • Updated 19 days ago • 5