James Hunter Carter's picture

James Hunter Carter PRO

jameshuntercarter

·

https://www.jameshuntercarter.com

platformkit

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

meituan-longcat/LongCat-Image

liked a Space 2 days ago

MCP-1st-Birthday/voicekit

liked a model 2 days ago

alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union

View all activity

Organizations

upvoted a collection about 2 months ago

Edit-R1

5 items • Updated Oct 21 • 7

upvoted a paper about 2 months ago

DreamOmni2: Multimodal Instruction-based Editing and Generation

Paper • 2510.06679 • Published Oct 8 • 73

upvoted a paper 2 months ago

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22 • 66

upvoted a collection 4 months ago

MGM-Omni

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech • 18 items • Updated Oct 11 • 10

upvoted a paper 4 months ago

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

Paper • 2402.06894 • Published Feb 10, 2024 • 1

upvoted a collection 8 months ago

VACE

VACE: All-in-One Video Creation and Editing • 7 items • Updated May 15 • 34

upvoted a paper 8 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138

upvoted a collection 9 months ago

LipSync and Face Operations

22 items • Updated Aug 25 • 60

upvoted 5 papers 10 months ago

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Paper • 2502.13995 • Published Feb 19 • 9

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Paper • 2502.07531 • Published Feb 11 • 13

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 106

Stable Flow: Vital Layers for Training-Free Image Editing

Paper • 2411.14430 • Published Nov 21, 2024 • 22

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5 • 30

upvoted 5 collections about 1 year ago

Zero-Shot Voice Cloning

TTS models that support zero-shot voice cloning • 8 items • Updated 5 days ago • 14

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 33

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 241

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 78

gazelle v0.2

2 items • Updated Mar 19, 2024 • 15

upvoted a paper over 1 year ago

Audio Dialogues: Dialogues dataset for audio and music understanding

Paper • 2404.07616 • Published Apr 11, 2024 • 16