Add Artificial Analysis evaluations for glm-4-6

#28
Files changed (1) hide show
  1. README.md +51 -0
README.md CHANGED
@@ -5,6 +5,57 @@ language:
5
  library_name: transformers
6
  license: mit
7
  pipeline_tag: text-generation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
  # GLM-4.6
 
5
  library_name: transformers
6
  license: mit
7
  pipeline_tag: text-generation
8
+ model-index:
9
+ - name: GLM-4.6
10
+ results:
11
+ - task:
12
+ type: evaluation
13
+ dataset:
14
+ name: Artificial Analysis Benchmarks
15
+ type: artificial_analysis
16
+ metrics:
17
+ - name: Artificial Analysis Intelligence Index
18
+ type: artificial_analysis_intelligence_index
19
+ value: 44.7
20
+ - name: Artificial Analysis Coding Index
21
+ type: artificial_analysis_coding_index
22
+ value: 38.7
23
+ - name: Artificial Analysis Math Index
24
+ type: artificial_analysis_math_index
25
+ value: 44.3
26
+ - name: Mmlu Pro
27
+ type: mmlu_pro
28
+ value: 0.784
29
+ - name: Gpqa
30
+ type: gpqa
31
+ value: 0.632
32
+ - name: Hle
33
+ type: hle
34
+ value: 0.052
35
+ - name: Livecodebench
36
+ type: livecodebench
37
+ value: 0.561
38
+ - name: Scicode
39
+ type: scicode
40
+ value: 0.331
41
+ - name: Aime 25
42
+ type: aime_25
43
+ value: 0.443
44
+ - name: Ifbench
45
+ type: ifbench
46
+ value: 0.367
47
+ - name: Lcr
48
+ type: lcr
49
+ value: 0.263
50
+ - name: Terminalbench Hard
51
+ type: terminalbench_hard
52
+ value: 0.27
53
+ - name: Tau2
54
+ type: tau2
55
+ value: 0.769
56
+ source:
57
+ name: Artificial Analysis API
58
+ url: https://artificialanalysis.ai
59
  ---
60
 
61
  # GLM-4.6