ZiYi Yang's picture

ZiYi Yang

AALF

·

https://github.com/yangzy39

yangzy39

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

Tongyi-Zhiwen/QwenLong-L1.5-30B-A3B:Is it multi lingual as usual?

new activity about 1 month ago

Tongyi-Zhiwen/QwenLong-L1.5-30B-A3B:Replication

authored a paper about 1 month ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

View all activity

Organizations

authored a paper about 1 month ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 107

authored a paper 4 months ago

SPELL: Self-Play Reinforcement Learning for evolving Long-Context Language Models

Paper • 2509.23863 • Published Sep 28, 2025 • 2

authored 3 papers 5 months ago

ThinkSwitcher: When to Think Hard, When to Think Fast

Paper • 2505.14183 • Published May 20, 2025 • 1

Mutual-Taught for Co-adapting Policy and Reward Models

Paper • 2506.06292 • Published May 17, 2025

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

Paper • 2504.06562 • Published Apr 9, 2025

authored a paper 8 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

authored a paper 11 months ago

FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Paper • 2503.04222 • Published Mar 6, 2025 • 15

authored a paper about 1 year ago

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 12

authored a paper almost 2 years ago

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25, 2024 • 39