Collections
Discover the best community collections!
Collections including paper arxiv:2507.12956
-
ObjectClear: Complete Object Removal via Object-Effect Attention
Paper • 2505.22636 • Published • 2 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
GaussianSpeech: Audio-Driven Gaussian Avatars
Paper • 2411.18675 • Published -
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis
Paper • 2502.04128 • Published • 27 -
MOSPA: Human Motion Generation Driven by Spatial Audio
Paper • 2507.11949 • Published • 24 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24
-
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper • 2412.01106 • Published • 24 -
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper • 2412.04448 • Published • 10 -
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper • 2412.14963 • Published • 6 -
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Paper • 2502.01061 • Published • 222
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 16 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 26 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 33
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24 -
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation
Paper • 2506.03126 • Published • 22
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper • 2412.11279 • Published • 13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper • 2501.02260 • Published • 5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper • 2501.09978 • Published • 6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper • 2502.13995 • Published • 9
-
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Paper • 2410.10306 • Published • 56 -
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
Paper • 2411.05003 • Published • 71 -
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
Paper • 2411.04709 • Published • 26 -
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Paper • 2410.07171 • Published • 43
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24 -
AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation
Paper • 2506.03126 • Published • 22
-
ObjectClear: Complete Object Removal via Object-Effect Attention
Paper • 2505.22636 • Published • 2 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 42
-
GaussianSpeech: Audio-Driven Gaussian Avatars
Paper • 2411.18675 • Published -
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis
Paper • 2502.04128 • Published • 27 -
MOSPA: Human Motion Generation Driven by Spatial Audio
Paper • 2507.11949 • Published • 24 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 24
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper • 2412.11279 • Published • 13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper • 2501.02260 • Published • 5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper • 2501.09978 • Published • 6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper • 2502.13995 • Published • 9
-
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper • 2412.01106 • Published • 24 -
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper • 2412.04448 • Published • 10 -
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper • 2412.14963 • Published • 6 -
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Paper • 2502.01061 • Published • 222
-
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Paper • 2410.10306 • Published • 56 -
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
Paper • 2411.05003 • Published • 71 -
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
Paper • 2411.04709 • Published • 26 -
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Paper • 2410.07171 • Published • 43
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 16 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 26 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 33