-
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 101 -
Robot Learning from a Physical World Model
Paper • 2511.07416 • Published • 29 -
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper • 2511.06805 • Published • 12 -
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms
Paper • 2511.17592 • Published • 118
Harihara Valliappan
HarishValliappan
·
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
RL
updated
a collection
5 days ago
RL
upvoted
a
paper
5 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Organizations
None yet