view article Article Training Design for Text-to-Image Models: Lessons from Ablations 8 days ago • 55
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published 8 days ago • 41
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published 9 days ago • 41
NOVA Collection NOVA: Autoregressive Video Generation without Vector Quantization • 6 items • Updated 7 days ago • 5
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands Paper • 2512.24965 • Published Dec 31, 2025 • 42
Running on Zero Featured 1.37k Qwen3-TTS Demo 🎙 1.37k Generate speech from text with voice design, cloning, or speakers
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 23 days ago • 37
Runtime error Featured 62 Waypoint 1 Small 🎮 62 Explore and navigate through AI-generated worlds in real-time
Skywork-Unipic3 Collection Unified Multi-Image Composition with Sequence Modeling • 10 items • Updated 3 days ago • 12