SEGAgentRL

non-profit

AI & ML interests

We target improved agent reinforcement learning in terms of stability (S), efficiency (E), and generalization (G).

Recent Activity

dwenlong updated a collection about 8 hours ago

dwenlong updated a collection about 19 hours ago

dwenlong updated a collection about 19 hours ago

View all activity

SEGAgentRL 's models 9

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 20 hours ago • 24

SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base

Reinforcement Learning • 3B • Updated about 20 hours ago • 22

SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 20 hours ago • 26

SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 20 hours ago • 32

SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 20 hours ago • 25

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base

Reinforcement Learning • 3B • Updated about 20 hours ago • 17

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA

Reinforcement Learning • 3B • Updated about 20 hours ago • 28

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base

Reinforcement Learning • 8B • Updated about 20 hours ago • 56 • 2

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins

Reinforcement Learning • 8B • Updated about 20 hours ago • 81 • 1