7 13 15

Massimo Caccia

optimass

https://optimass.github.io/

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

ServiceNow/WorkArena-Instances:Update instances.json

upvoted an article about 2 months ago

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

new activity 2 months ago

ServiceNow/WorkArena-Instances:Update instances.json

View all activity

Organizations

upvoted an article about 2 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

upvoted an article 2 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Nov 19, 2025

•

upvoted a paper 3 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 106

upvoted 2 papers 5 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 229

upvoted an article 7 months ago

Article

How to Train Your LLM Web Agent: A Statistical Diagnosis

Jul 8, 2025

•

upvoted a paper 7 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5, 2025 • 52

upvoted an article 8 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

upvoted an article 9 months ago

Article

PipelineRL

Apr 25, 2025

•

upvoted 2 papers about 1 year ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 24

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

upvoted a paper over 1 year ago

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Paper • 2406.11811 • Published Jun 17, 2024 • 16

upvoted a paper almost 2 years ago

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 51

Massimo Caccia

AI & ML interests

Recent Activity

Organizations

optimass's activity

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

How to Train Your LLM Web Agent: A Statistical Diagnosis

GRPO for GUI Grounding Done Right

PipelineRL