Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
demo-leaderboard-backend/leaderboard
qualifire
/
EvalArena
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
EvalArena
/
src
/
judge.py
Commit History
hotfix - tsq table
d0c066f
dror44
commited on
May 28
wip
d43ab95
dror44
commited on
May 20
wip
6b070cd
dror44
commited on
May 17
Feature - pq
4403e4e
dror44
commited on
May 6
wip
df184ed
dror44
commited on
May 5
Hotfixes and benchmarks
5a05fa9
dror44
commited on
Apr 28
fixed the error issues
b4df4b9
dror44
commited on
Apr 27
wip
1efbc3f
dror44
commited on
Apr 26
added qualifire to the mix
45a014d
dror44
commited on
Apr 26
remove reasoning tokens
0bcfec8
dror44
commited on
Apr 24
fix confidences
982b157
dror44
commited on
Apr 24
more work
3df66f9
dror44
commited on
Apr 24
wip
94407ab
dror44
commited on
Apr 24
refactoring
b286969
dror44
commited on
Apr 23
refactoring
af28f6f
dror44
commited on
Apr 23