PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published 5 days ago • 29
Thinking with Programming Vision: Towards a Unified View for Thinking with Images Paper • 2512.03746 • Published 3 days ago • 15
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 4 days ago • 25
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 5 days ago • 166
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published 16 days ago • 91
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 23 days ago • 92
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 25 days ago • 193
Cambrian-S: Towards Spatial Supersensing in Video Paper • 2511.04670 • Published about 1 month ago • 36
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published about 1 month ago • 208
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30 • 81
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Paper • 2510.26802 • Published Oct 30 • 33
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published Oct 16 • 47