Stateful Language Models, Supervised Finetuned from Qwen3
xyliu
xiaoyuanliu
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
StateLM
updated
a collection
1 day ago
StateLM
updated
a collection
1 day ago
StateLM
Organizations
None yet
HELMET-Eval
21 subsets of HELMET evaluation datasets
-
xiaoyuanliu/HELMET_icl_nlu_8296shot_balance__eval
Viewer • Updated • 500 • 2 -
xiaoyuanliu/HELMET_icl_banking77_5900shot_balance__eval
Viewer • Updated • 500 • 1 -
xiaoyuanliu/HELMET_icl_trec_fine_6400shot_balance__eval
Viewer • Updated • 500 • 1 -
xiaoyuanliu/HELMET_icl_trec_coarse_6600shot_balance__eval
Viewer • Updated • 500 • 1
StateLM
Stateful Language Models, Supervised Finetuned from Qwen3
HELMET-Eval
21 subsets of HELMET evaluation datasets
-
xiaoyuanliu/HELMET_icl_nlu_8296shot_balance__eval
Viewer • Updated • 500 • 2 -
xiaoyuanliu/HELMET_icl_banking77_5900shot_balance__eval
Viewer • Updated • 500 • 1 -
xiaoyuanliu/HELMET_icl_trec_fine_6400shot_balance__eval
Viewer • Updated • 500 • 1 -
xiaoyuanliu/HELMET_icl_trec_coarse_6600shot_balance__eval
Viewer • Updated • 500 • 1
models
88
xiaoyuanliu/StateLM-4B-SFT
Text Generation
•
4B
•
Updated
•
11
xiaoyuanliu/StateLM-14B-SFT
Text Generation
•
15B
•
Updated
•
11
xiaoyuanliu/StateLM-8B-SFT
Text Generation
•
8B
•
Updated
•
21
xiaoyuanliu/Qwen3-30B-A3B-SFT-V4_OPT
Text Generation
•
31B
•
Updated
•
6
xiaoyuanliu/Qwen2.5-1.5B-simplerl-ppo-verifier
Text Generation
•
2B
•
Updated
•
2
xiaoyuanliu/Qwen2.5-3B-simplerl-ppo-verifier
Text Generation
•
3B
•
Updated
•
2
xiaoyuanliu/Qwen2.5-7B-simplerl-ppo-verifier
Text Generation
•
8B
•
Updated
•
2
xiaoyuanliu/Qwen3-4B-SFT-V2.1-ml.16K-lr.1e-5-ep.3
Text Generation
•
4B
•
Updated
•
1
xiaoyuanliu/Qwen3-8B-SFT-V2.1-ml.16K-lr.1e-5-ep.3
Text Generation
•
8B
•
Updated
•
1
xiaoyuanliu/Qwen3-8B-SFT-V2.1-ml.16K-lr.1e-5-ep1
Updated
datasets
71
xiaoyuanliu/mmlu-redux
Viewer
•
Updated
•
3k
•
30
xiaoyuanliu/LongBench-v2-verified
Viewer
•
Updated
•
503
•
7
xiaoyuanliu/claude4-agentic-samples-V4-opt-swift-format-500
Viewer
•
Updated
•
500
•
33
xiaoyuanliu/claude4-agentic-samples-V4-opt-swift-format
Viewer
•
Updated
•
35.7k
•
19
xiaoyuanliu/V4-BAScan-Warmup360
Viewer
•
Updated
•
7.17k
•
5
xiaoyuanliu/longmemeval-s
Viewer
•
Updated
•
500
•
57
xiaoyuanliu/LongBench-v2-rlvr
Viewer
•
Updated
•
503
•
13
xiaoyuanliu/LongBench-v2-T100
Viewer
•
Updated
•
100
•
4
xiaoyuanliu/V4-BA-Warmup300
Viewer
•
Updated
•
3.72k
•
1
xiaoyuanliu/claude4-agentic-samples-V4-Balanced
Viewer
•
Updated
•
28.6k
•
7