haoyu wang's picture

1 8 1

haoyu wang

haoyuw

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

upvoted a paper 1 day ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

upvoted a paper 2 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

View all activity

Organizations

Papers 2

arxiv:2602.01511

arxiv:2510.07743

models 3

haoyuw/Qwen2.5-1.5B-Math-Instruct-LIMO-Rewrite

Text Generation • 2B • Updated Mar 8, 2025 • 2

haoyuw/Qwen2.5-1.5B-Math-Instruct-LIMO

Text Generation • 2B • Updated Mar 8, 2025 • 4

haoyuw/Qwen2.5-1.5B-Instruct-LIMO

Text Generation • 2B • Updated Mar 8, 2025 • 1

datasets 5

haoyuw/cn_math_2024

Viewer • Updated Jun 30, 2025 • 30 • 2

haoyuw/aime

Viewer • Updated May 22, 2025 • 30 • 2

haoyuw/minerva

Viewer • Updated May 7, 2025 • 272 • 3

haoyuw/olympiad_bench

Viewer • Updated May 7, 2025 • 675 • 1

haoyuw/minervamath_latex

Viewer • Updated Mar 24, 2025 • 272 • 5