PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
upvoted a paper 12 days ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters upvoted a paper 13 days ago
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth liked
a model 22 days ago
stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S Organizations
None yet