Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shizhe Diao's picture
38 33 29

Shizhe Diao

shizhediao2
lysandre's profile picture research4pan's profile picture 21world's profile picture
ยท
https://shizhediao.github.io/
  • shizhediao
  • shizhediao
  • shizhediao

AI & ML interests

LLM pre-training and reasoning

Recent Activity

posted an update 3 days ago
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration https://huggingface.co/papers/2511.21689
reacted to di-zhang-fdu's post with ๐Ÿ”ฅ 3 days ago
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration https://huggingface.co/papers/2511.21689
upvoted a paper 3 days ago
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
View all activity

Organizations

NVIDIA's profile picture temp_math_data's profile picture UGPhysics's profile picture Data Filtering Challenge for Training Edge Language Models's profile picture brorl's profile picture

Posts 1

view post
Post
102
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

Articles 1

Article
17

Can Your LLM Think Like a Professional? Introducing ProfBench

models 3

shizhediao2/ToolOrchestrator-8B

Updated Oct 15 โ€ข 1

shizhediao2/Llama-Nemotron-8B-v1-Prorl

Updated Aug 25

shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B

Updated May 14

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs