arxiv:2405.01573
Anmol Agarwal
anmolagarwal999
·
AI & ML interests
None yet
Organizations
models
307
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-30
0.5B
•
Updated
•
2
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-20
0.5B
•
Updated
•
1
anmolagarwal999/lora_gkd_run_20251212172042__checkpoint-10
0.5B
•
Updated
•
1
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_560
Text Generation
•
0.5B
•
Updated
•
6
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_550
Text Generation
•
0.5B
•
Updated
•
4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_540
Text Generation
•
0.5B
•
Updated
•
4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_530
Text Generation
•
0.5B
•
Updated
•
4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_520
Text Generation
•
0.5B
•
Updated
•
4
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510
Text Generation
•
0.5B
•
Updated
•
6
anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_504
Text Generation
•
0.5B
•
Updated
•
2
datasets
9
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer
•
Updated
•
4.37k
•
3
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset_v2
Viewer
•
Updated
•
4.37k
•
4
anmolagarwal999/qwq_rl_train_dataset_countdown_v2
Viewer
•
Updated
•
4.37k
•
1
anmolagarwal999/math_dataset_train_based_on_qwen_distilled_r1_32b
Viewer
•
Updated
•
3.64k
•
1
anmolagarwal999/math_dataset_test_based_on_gt_reasoning_trace
Viewer
•
Updated
•
500
•
2
anmolagarwal999/math_dataset_train_based_on_gt_reasoning_trace
Viewer
•
Updated
•
3.64k
•
3
anmolagarwal999/qwq_rl_train_dataset_countdown
Viewer
•
Updated
•
4.37k
•
1
anmolagarwal999/validation_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer
•
Updated
•
440
•
2
anmolagarwal999/train_countdown_sft_deepseek_qwen_distilled_32b_dataset
Viewer
•
Updated
•
2.72k
•
3