RULER Datasets
Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
new activity
about 3 hours ago
Nanbeige/Nanbeige4.1-3B:Add evaluation results for GPQA, HLE
liked
a model
about 3 hours ago
Nanbeige/Nanbeige4.1-3B
new activity
about 3 hours ago
MiniMaxAI/MiniMax-M2.5:Add evaluation results for GPQA, HLE