ZhuoweiChen (Zhuowei

upvoted a paper 3 months ago

MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation

Paper • 2510.18692 • Published Oct 21, 2025 • 40

upvoted 3 papers 4 months ago

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22, 2025 • 66

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 128

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

Paper • 2508.18966 • Published Aug 26, 2025 • 56

upvoted a paper 5 months ago

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Paper • 2507.21033 • Published Jul 28, 2025 • 21

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

745

upvoted a paper 6 months ago

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Paper • 2507.01945 • Published Jul 2, 2025 • 76

upvoted a paper 7 months ago

Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

Paper • 2506.18851 • Published Jun 23, 2025 • 30

upvoted 2 papers 9 months ago

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20, 2025 • 51

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

Paper • 2504.09454 • Published Apr 13, 2025 • 11

upvoted a paper 11 months ago

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16, 2025 • 59

upvoted a collection 12 months ago

LISA++

Collection

Reasoning Segmentation • 6 items • Updated Dec 30, 2024 • 5

Zhuowei_Chen

AI & ML interests

Organizations

MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

SmolLM3: smol, multilingual, long-context reasoner

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

Phantom: Subject-consistent video generation via cross-modal alignment

LISA++

Zhuowei_Chen

AI & ML interests

Organizations

ZhuoweiChen's activity

SmolLM3: smol, multilingual, long-context reasoner