koutch/qwen_falcon_qwen3-instruct-4b_train_sft_2.json Text Generation • 4B • Updated about 1 hour ago • 61
koutch/qwenb_falcon_6.json_train_grpo_v1_2.json Text Generation • 8B • Updated about 2 hours ago • 61
koutch/qwenb_falcon_qwen3-8b_train_grpo_v1_2.json Text Generation • 8B • Updated about 3 hours ago • 63
koutch/qwen_falcon_qwen3-instruct-4b_train_sft_0.json Text Generation • 4B • Updated about 3 hours ago • 116
koutch/qwen_falcon_6.json_train_grpo_v1_2.json Text Generation • 4B • Updated about 13 hours ago • 39
koutch/qwen_falcon_6.json_train_grpo_v1_2.json Text Generation • 4B • Updated about 13 hours ago • 39
koutch/qwenb_falcon_6.json_train_grpo_v1_2.json Text Generation • 8B • Updated about 2 hours ago • 61
koutch/qwenb_falcon_qwen3-8b_train_grpo_v1_2.json Text Generation • 8B • Updated about 3 hours ago • 63
koutch/qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json Text Generation • 4B • Updated 2 days ago • 37
koutch/qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json Text Generation • 4B • Updated 2 days ago • 37