v
ziqi7
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
RL&LLM Agent-强化学习
upvoted
a
paper
about 1 month ago
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes
Correct Reasoning in Base LLMs
liked
a model
2 months ago
Soul-AILab/SoulX-Podcast-1.7B
Organizations
None yet