-
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
A Survey on Large Language Model Benchmarks
Paper • 2508.15361 • Published • 20 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102
yangdechuan
yangdechuan
·
AI & ML interests
None yet
Recent Activity
new activity
13 days ago
yangdechuan/mt5-small-finetuned-amazon-en-es:Adding `safetensors` variant of this model
updated
a collection
5 months ago
LLM
updated
a collection
5 months ago
LLM
Organizations
None yet