Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MercedeSnape 's Collections
Problem Definition
future
Evolve
LLM reasoning
reasoning evaluation
mm thinking
agent reasoning
agent training
RL agent
agent env
mas
model paradigm
MoE
Memory
RAG
KG
Tokenization

RL agent

updated 2 days ago
Upvote
-

  • Scaling Agent Learning via Experience Synthesis

    Paper • 2511.03773 • Published Nov 5, 2025 • 81

    Note for online RL training “提炼为经验模型”


  • ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

    Paper • 2511.21689 • Published Nov 26, 2025 • 111
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs