Fan Zhou's picture

Fan Zhou

koalazf99

·

https://koalazf99.github.io/

AI & ML interests

Deep Learning; Natural Language Processing; Foundation Models

Recent Activity

upvoted a paper about 1 month ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

upvoted a paper about 1 month ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

upvoted a paper 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

View all activity

Organizations

upvoted 2 papers about 1 month ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22 • 19

upvoted a paper 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 78

authored a paper 5 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 47

New activity in OctoThinker/MegaMath-Web-Pro-Max 5 months ago

[bot] Conversion to Parquet

#3 opened 5 months ago by

parquet-converter

liked a dataset 5 months ago

OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6 • 69.2M • 7.2k • 37

updated a collection 5 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26 • 2

upvoted a paper 5 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 47

updated a collection 5 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26 • 2

updated a collection 6 months ago

🧙 Guru

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective • 4 items • Updated Jun 20

authored a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

liked 2 datasets 6 months ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 596k • 233

LLM360/guru-RL-92k

Viewer • Updated Aug 20 • 91.9k • 1.89k • 40

upvoted 2 papers 6 months ago

Thinking with Generated Images

Paper • 2505.22525 • Published May 28 • 15

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

New activity in LLM360/MegaMath 7 months ago

Megamath-code parquets do not contain text column

#6 opened 7 months ago by