Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhu's picture
5 33 1

zhu

xuekai
Charlie-LChen's profile picture lindsay-qu's profile picture XingtaiHF's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago
P1: Mastering Physics Olympiads with Reinforcement Learning
commented on a paper about 1 month ago
FlowRL: Matching Reward Distributions for LLM Reasoning
updated a model about 1 month ago
xuekai/FlowRL-DeepSeek-7B-code
View all activity

Organizations

TsinghuaC3I's profile picture

commented a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •
8
commented 2 papers 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •
8

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •
8
commented a paper 12 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 52 •
4
New activity in allenai/dolma over 1 year ago

JSON ERROR in loading files of v1_6-sample using load_dataset

2
#22 opened almost 2 years ago by
sakurapeng
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs