Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Yuexi Shen
yuexishen
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
12 days ago
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
upvoted
a
paper
2 months ago
Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks
updated
a model
6 months ago
yuexishen/codellama-7b-humaneval-ppo-qlora
View all activity
Organizations
None yet
models
11
Sort: Recently updated
yuexishen/codellama-7b-humaneval-ppo-qlora
Updated
Jun 5
yuexishen/codellama-7b-instruct-humaneval-ppo-qlora
Updated
Jun 5
yuexishen/codellama-7b-python-mbpp-grpo-qlora
Updated
Jun 5
yuexishen/codellama-7b-python-mbpp-ppo-qlora
Updated
Jun 5
yuexishen/codellama-7b-grpo-qlora
Updated
Jun 3
yuexishen/deepseek-coder-7b-instruc-ppo-qlora
Updated
Jun 2
yuexishen/deepseek-coder-7b-base-v1-ppo-qlora
Updated
Jun 2
yuexishen/codellama-7b-mbpp-ppo-qlora
Updated
Jun 1
yuexishen/codellama-7b-instruct-ppo-qlora
Updated
Jun 1
yuexishen/Llama-3-8B-Instruct-Finance-RAG
Text Generation
•
8B
•
Updated
Jan 20
•
3
View 11 models
datasets
0
None public yet