Collections
Discover the best community collections!
Collections including paper arxiv:2503.10970
-
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Paper • 2503.10970 • Published • 18 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 300 -
Grounding Computer Use Agents on Human Demonstrations
Paper • 2511.07332 • Published • 104
-
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Paper • 2304.09842 • Published • 2 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 30 -
Gorilla: Large Language Model Connected with Massive APIs
Paper • 2305.15334 • Published • 5 -
Reflexion: Language Agents with Verbal Reinforcement Learning
Paper • 2303.11366 • Published • 5
-
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 51 -
Training Software Engineering Agents and Verifiers with SWE-Gym
Paper • 2412.21139 • Published • 24 -
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Paper • 2412.19723 • Published • 87 -
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Paper • 2408.00764 • Published • 1
-
Crowdsourced Evaluation
🌍1Evaluate model responses for clinical accuracy and relevance
-
mims-harvard/TxAgent-T1-Llama-3.1-8B
Text Generation • 8B • Updated • 1.05k • • 30 -
mims-harvard/ToolRAG-T1-GTE-Qwen2-1.5B
2B • Updated • 1.28k • 11 -
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Paper • 2503.10970 • Published • 18
-
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Paper • 2503.10970 • Published • 18 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 300 -
Grounding Computer Use Agents on Human Demonstrations
Paper • 2511.07332 • Published • 104
-
Crowdsourced Evaluation
🌍1Evaluate model responses for clinical accuracy and relevance
-
mims-harvard/TxAgent-T1-Llama-3.1-8B
Text Generation • 8B • Updated • 1.05k • • 30 -
mims-harvard/ToolRAG-T1-GTE-Qwen2-1.5B
2B • Updated • 1.28k • 11 -
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Paper • 2503.10970 • Published • 18
-
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Paper • 2304.09842 • Published • 2 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 30 -
Gorilla: Large Language Model Connected with Massive APIs
Paper • 2305.15334 • Published • 5 -
Reflexion: Language Agents with Verbal Reinforcement Learning
Paper • 2303.11366 • Published • 5
-
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Paper • 2412.14161 • Published • 51 -
Training Software Engineering Agents and Verifiers with SWE-Gym
Paper • 2412.21139 • Published • 24 -
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Paper • 2412.19723 • Published • 87 -
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Paper • 2408.00764 • Published • 1