arxiv:2503.09662
Zikai Zhou
Klayand
AI & ML interests
Knowledge Distillation, Generated Models
Recent Activity
upvoted
a
paper
2 days ago
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better
upvoted
a
paper
5 days ago
Mano: Restriking Manifold Optimization for LLM Training
upvoted
a
paper
5 days ago
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
Organizations
None yet