One-step Latent-free Image Generation with Pixel Mean Flows Paper • 2601.22158 • Published 2 days ago • 6
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 3 days ago • 83
Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory Paper • 2601.16296 • Published 9 days ago • 27
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 9 days ago • 13
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 10 days ago • 42
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper • 2601.09499 • Published 17 days ago • 9
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published 16 days ago • 12
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 17 days ago • 32
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices Paper • 2601.08303 • Published 18 days ago • 16
DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving Paper • 2601.01528 • Published 27 days ago • 19
Orient Anything V2: Unifying Orientation and Rotation Understanding Paper • 2601.05573 • Published 22 days ago • 9
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals Paper • 2601.05848 • Published 22 days ago • 16
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published 22 days ago • 23
Guiding a Diffusion Transformer with the Internal Dynamics of Itself Paper • 2512.24176 • Published Dec 30, 2025 • 8
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection Paper • 2512.23273 • Published Dec 29, 2025 • 14
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published Dec 29, 2025 • 45
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published Dec 26, 2025 • 60