zhu's picture

5 33 1

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

commented on a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

updated a model about 1 month ago

xuekai/FlowRL-DeepSeek-7B-code

View all activity

Organizations

commented a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •

commented 2 papers 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114 •

commented a paper 12 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 52 •

New activity in allenai/dolma over 1 year ago

JSON ERROR in loading files of v1_6-sample using load_dataset

#22 opened almost 2 years ago by