shuoxing/qwen2-5-7b-full-sft-control-tweet-1m-en-reproduce-bs128 Text Generation • 333k • Updated 10 days ago • 16
shuoxing/qwen2-5-7b-full-sft-mix-high-tweet-1m-en-reproduce-bs128 Text Generation • 333k • Updated 10 days ago • 13
shuoxing/qwen2-5-7b-full-sft-mix-mid-tweet-1m-en-reproduce-bs128 Text Generation • 333k • Updated 11 days ago • 13
shuoxing/qwen2-5-7b-full-sft-mix-low-tweet-1m-en-reproduce-bs128 Text Generation • 333k • Updated 11 days ago • 11
shuoxing/qwen3-4b-full-sft-control-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 11 days ago • 13
shuoxing/qwen3-4b-full-sft-mix-high-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 11 days ago • 16
shuoxing/qwen3-4b-full-sft-mix-mid-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 11 days ago • 14
shuoxing/qwen3-4b-full-sft-mix-low-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 11 days ago • 16
shuoxing/qwen2-5-0.5b-full-sft-control-tweet-1m-en-reproduce-bs64 Text Generation • 0.5B • Updated 11 days ago • 20
shuoxing/qwen2-5-0.5b-full-sft-mix-high-tweet-1m-en-reproduce-bs64 Text Generation • 0.5B • Updated 11 days ago • 14
shuoxing/qwen2-5-0.5b-full-sft-mix-mid-tweet-1m-en-reproduce-bs64 Text Generation • 0.5B • Updated 11 days ago • 15
shuoxing/qwen2-5-0.5b-full-sft-mix-low-tweet-1m-en-reproduce-bs64 Text Generation • 0.5B • Updated 11 days ago • 10
shuoxing/qwen2-5-0.5b-full-pretrain-control-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated 11 days ago • 27
shuoxing/qwen2-5-0.5b-full-pretrain-mix-high-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated 12 days ago • 23
shuoxing/qwen2-5-0.5b-full-pretrain-mix-mid-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated 12 days ago • 22
shuoxing/qwen2-5-0.5b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated 12 days ago • 23
shuoxing/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 13 days ago • 38
shuoxing/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 13 days ago • 37
shuoxing/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 13 days ago • 42
shuoxing/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-reproduce-bs128 Text Generation • 196k • Updated 14 days ago • 36
shuoxing/qwen2-5-7b-full-pretrain-control-tweet-1m-en-reproduce-bs8 Text Generation • 333k • Updated 14 days ago • 60
shuoxing/qwen2-5-7b-full-pretrain-mix-high-tweet-1m-en-reproduce-bs8 Text Generation • 333k • Updated 14 days ago • 66
shuoxing/qwen2-5-7b-full-pretrain-mix-mid-tweet-1m-en-reproduce-bs8 Text Generation • 333k • Updated 14 days ago • 41
shuoxing/qwen2-5-7b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8 Text Generation • 333k • Updated 14 days ago • 52
shuoxing/llama3-8b-full-sft-control-tweet-1m-en-reproduce-bs128 Text Generation • 266k • Updated 28 days ago • 34
shuoxing/llama3-8b-full-sft-mix-high-tweet-1m-en-reproduce-bs128 Text Generation • 266k • Updated 28 days ago • 31
shuoxing/llama3-8b-full-sft-mix-mid-tweet-1m-en-reproduce-bs128 Text Generation • 266k • Updated 28 days ago • 33
shuoxing/llama3-8b-full-sft-mix-low-tweet-1m-en-reproduce-bs128 Text Generation • 266k • Updated 28 days ago • 42
shuoxing/llama3-8b-full-sft-mix-high-tweet-1m-en-reproduce-bs16 Text Generation • 266k • Updated Dec 30, 2025 • 3