Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
34
73
109
Li Dong
unilm
Follow
wxy1988's profile picture
zirui3's profile picture
shuyuej's profile picture
52 followers
ยท
21 following
AI & ML interests
Language Model Pre-Training
Recent Activity
liked
a model
about 6 hours ago
microsoft/VibeVoice-ASR
authored
a paper
1 day ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
authored
a paper
1 day ago
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
View all activity
Organizations
Articles
1
Article
16
Differential Transformer V2
Papers
81
arxiv:
2601.08808
arxiv:
2511.10643
arxiv:
2510.26658
arxiv:
2510.24514
Expand 81 papers
spaces
1
Runtime error
4
Promptist
๐
models
0
None public yet
datasets
0
None public yet