Urodoc Oncall's picture

92 48

Urodoc Oncall

UDCAI

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation

upvoted a paper 3 days ago

Qwen3-VL Technical Report

upvoted a paper 4 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 days ago

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation

Paper • 2512.04678 • Published 3 days ago • 31

upvoted a paper 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 10 days ago • 106

upvoted 2 papers 4 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published 11 days ago • 23

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published 11 days ago • 43

upvoted 2 papers 6 days ago

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published 9 days ago • 45

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 9 days ago • 145

upvoted 2 papers 9 days ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published 10 days ago • 32

Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy

Paper • 2511.21579 • Published 10 days ago • 22

upvoted 2 papers 11 days ago

Fara-7B: An Efficient Agentic Model for Computer Use

Paper • 2511.19663 • Published 12 days ago • 10

SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation

Paper • 2511.19320 • Published 12 days ago • 39

upvoted 4 papers 12 days ago

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published 12 days ago • 29

HunyuanVideo 1.5 Technical Report

Paper • 2511.18870 • Published 13 days ago • 22

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 13 days ago • 155

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published 17 days ago • 51

upvoted 2 papers 13 days ago

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

Paper • 2511.13593 • Published 19 days ago • 24

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 16 days ago • 105

upvoted a paper 14 days ago

Generalist Foundation Models Are Not Clinical Enough for Hospital Operations

Paper • 2511.13703 • Published 19 days ago • 20

upvoted 3 papers 16 days ago

SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking

Paper • 2511.16618 • Published 16 days ago • 7

First Frame Is the Place to Go for Video Content Customization

Paper • 2511.15700 • Published 17 days ago • 52

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published 17 days ago • 51