Agents
updated
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper
• 2507.15846
• Published
• 133
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper
• 2508.05748
• Published
• 141
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Paper
• 2508.15144
• Published
• 64
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published
• 160
DeepResearch Arena: The First Exam of LLMs' Research Abilities via
Seminar-Grounded Tasks
Paper
• 2509.01396
• Published
• 58
Agentic Entropy-Balanced Policy Optimization
Paper
• 2510.14545
• Published
• 106
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper
• 2510.16872
• Published
• 109
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper
• 2510.21618
• Published
• 101
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper
• 2510.23587
• Published
• 67
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic,
and Long-Horizon Task Execution
Paper
• 2510.25726
• Published
• 46
Scaling Latent Reasoning via Looped Language Models
Paper
• 2510.25741
• Published
• 228
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
Paper
• 2512.02395
• Published
• 49
Paper
• 2512.16301
• Published
• 106
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization
Paper
• 2601.05432
• Published
• 166
Kimi K2.5: Visual Agentic Intelligence
Paper
• 2602.02276
• Published
• 243
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
Paper
• 2601.22060
• Published
• 156
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation
Paper
• 2602.01756
• Published
• 22
Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-Tuning
Paper
• 2602.09439
• Published
• 13
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling
Paper
• 2602.09084
• Published
• 27