RedHatAI/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
• 3B • Updated
• 5
RedHatAI/Llama-3.2-1B-quantized.w8a8
1B • Updated
• 20.8k
• 1
RedHatAI/DeepSeek-Coder-V2-Instruct-0724-quantized.w4a16
Text Generation
• 32B • Updated
• 12
• 1
RedHatAI/DeepSeek-V2.5-1210-quantized.w4a16
Text Generation
• 32B • Updated
• 41
RedHatAI/DeepSeek-V2.5-1210-FP8
Text Generation
• 236B • Updated
• 46.5k
• 4
RedHatAI/DeepSeek-Coder-V2-Instruct-0724-FP8
Text Generation
• 236B • Updated
• 4
• 1
RedHatAI/QwQ-32B-Preview-quantized.w8a8
Text Generation
• 33B • Updated
• 1
RedHatAI/QwQ-32B-Preview-FP8-dynamic
Text Generation
• 33B • Updated
RedHatAI/QwQ-32B-Preview-quantized.w4a16
6B • Updated
• 76
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated
• 3
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 11B • Updated
• 3
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
18B • Updated
• 3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
• 8B • Updated
• 3
• 1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
• 8B • Updated
• 2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
• 8B • Updated
• 64
• 2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
• 2B • Updated
• 5
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
• 2B • Updated
• 1
• 3
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
• 2B • Updated
• 2
RedHatAI/Qwen2.5-3B-quantized.w4a16
Text Generation
• 1.0B • Updated
• 646
RedHatAI/Qwen2.5-1.5B-quantized.w4a16
Text Generation
• 0.6B • Updated
• 1
RedHatAI/Qwen2.5-0.5B-quantized.w4a16
Text Generation
• 0.3B • Updated
• 4
RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8
Text Generation
• 15B • Updated
• 3
RedHatAI/granite-3.1-8b-instruct-GGUF
8B • Updated
• 5
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
• 8B • Updated
• 31
• 62
RedHatAI/Qwen2.5-Math-7B-Instruct-FP8-dynamic
8B • Updated
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
• 0.6B • Updated
• 32
RedHatAI/Qwen2.5-72B-FP8-dynamic
Text Generation
• 73B • Updated
• 17
• 1
RedHatAI/Qwen2.5-72B-quantized.w8a8
Text Generation
• 73B • Updated
• 1
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
• 15B • Updated
• 3
• 2
RedHatAI/Qwen2.5-14B-FP8-dynamic
Text Generation
• 15B • Updated
• 47
• 2