Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published 3 days ago • 31
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning Paper • 2511.20549 • Published 11 days ago • 23
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout Paper • 2511.20649 • Published 11 days ago • 43
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Paper • 2511.22625 • Published 9 days ago • 45
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 9 days ago • 145
Canvas-to-Image: Compositional Image Generation with Multimodal Controls Paper • 2511.21691 • Published 10 days ago • 32
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy Paper • 2511.21579 • Published 10 days ago • 22
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation Paper • 2511.19320 • Published 12 days ago • 39
In-Video Instructions: Visual Signals as Generative Control Paper • 2511.19401 • Published 12 days ago • 29
Computer-Use Agents as Judges for Generative User Interface Paper • 2511.15567 • Published 17 days ago • 51
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents Paper • 2511.13593 • Published 19 days ago • 24
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations Paper • 2511.13703 • Published 19 days ago • 20
SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking Paper • 2511.16618 • Published 16 days ago • 7
First Frame Is the Place to Go for Video Content Customization Paper • 2511.15700 • Published 17 days ago • 52