2 5

Nan

Sirius518

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

upvoted a paper 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

upvoted a paper 3 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

View all activity

Organizations

None yet

upvoted a paper 10 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 10 days ago • 64

upvoted a paper 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 19

upvoted a paper 3 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

upvoted a paper 4 months ago

Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels

Paper • 2509.16596 • Published Sep 20, 2025 • 14

New activity in Sirius518/NovelSum 7 months ago

Rename novelselect/weighted_select_10k_figure.json to novelselect/weighted_select_10k_embedding.json

#1 opened 7 months ago by

Umean

Update README.md

#2 opened 7 months ago by

Umean

updated a dataset 7 months ago

Sirius518/NovelSum

Preview • Updated Jun 17, 2025 • 1.72k • 2

published a dataset 7 months ago

Sirius518/NovelSum

Preview • Updated Jun 17, 2025 • 1.72k • 2

upvoted a paper 8 months ago

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

Nan

AI & ML interests

Recent Activity

Organizations

Sirius518's activity

Rename novelselect/weighted_select_10k_figure.json to novelselect/weighted_select_10k_embedding.json

Update README.md