-
Towards General Agentic Intelligence via Environment Scaling
Paper • 2509.13311 • Published • 71 -
Establishing Best Practices for Building Rigorous Agentic Benchmarks
Paper • 2507.02825 • Published • 1 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 61 -
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Paper • 2510.18941 • Published • 7
Shang Hong Sim
shanghong
AI & ML interests
Neural decoding, neuroengineering, signal processing
Recent Activity
upvoted
an
article
21 days ago
Welcome GPT OSS, the new open-source model family from OpenAI!
updated
a collection
about 1 month ago
gold_datasets
updated
a collection
about 1 month ago
to read