-
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models
Paper • 2511.18373 • Published • 5 -
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Paper • 2511.13288 • Published • 17 -
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
Paper • 2511.19418 • Published • 26 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 109
Innocent Emmanuel
EL102
AI & ML interests
None yet
Recent Activity
updated
a collection
12 days ago
My thing
updated
a collection
13 days ago
My thing
updated
a collection
14 days ago
My thing
Organizations
None yet