GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities Paper • 2507.12367 • Published Jul 16, 2025 • 7
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published Jul 5, 2025 • 52
LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published Jun 30, 2025 • 6
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation Paper • 2510.04373 • Published Oct 5, 2025
FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents Paper • 2510.03204 • Published Oct 3, 2025 • 7
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 3 days ago • 19
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 3 days ago • 19
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance Dec 9, 2025 • 82
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 106
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 229
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published Jul 5, 2025 • 52 • 3