BoyceYi's picture

4 14

BoyceYi

DeadFishhh

·

Yiozolm

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Qwen/Qwen3.5-397B-A17B

upvoted a paper 5 days ago

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

liked a model 5 days ago

openbmb/MiniCPM-SALA

View all activity

Organizations

upvoted a paper 5 days ago

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Paper • 2602.11761 • Published 6 days ago • 6

upvoted a paper 7 days ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 10 days ago • 65

upvoted a collection 2 months ago

GLM-4.6

7 items • Updated Nov 5, 2025 • 52

upvoted a collection 10 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 217