RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
8B
•
Updated
•
81
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
•
236B
•
Updated
•
75
•
3
RedHatAI/Qwen3-30B-A3B-FP8-block
Text Generation
•
31B
•
Updated
•
10.3k
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
•
109B
•
Updated
•
51
•
3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
•
402B
•
Updated
•
2
•
1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
•
71B
•
Updated
•
372
RedHatAI/Qwen3-32B-FP8-block
Text Generation
•
33B
•
Updated
•
17
RedHatAI/Qwen3-14B-FP8-block
Text Generation
•
15B
•
Updated
•
45
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
14.8k
•
14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
71B
•
Updated
•
11
•
2
RedHatAI/Llama-3.2-1B-FP8
1B
•
Updated
•
28.9k
Image-Text-to-Text
•
12B
•
Updated
•
12
•
1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
•
236B
•
Updated
•
1.31k
•
4
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-Text-to-Text
•
8B
•
Updated
•
1.09k
•
8
RedHatAI/Apertus-70B-Instruct-2509-FP8-dynamic
Text Generation
•
71B
•
Updated
•
120
•
1
RedHatAI/phi-4-FP8-dynamic
Text Generation
•
15B
•
Updated
•
6.04k
RedHatAI/phi-4-quantized.w8a8
Text Generation
•
15B
•
Updated
•
522
•
2
Text Generation
•
15B
•
Updated
•
152
•
1
RedHatAI/phi-4-quantized.w4a16
Text Generation
•
3B
•
Updated
•
3.54k
•
4
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
94
•
2
RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16
Text Generation
•
11B
•
Updated
•
127
•
1
RedHatAI/Qwen2.5-Coder-14B-Instruct-FP8-dynamic
Text Generation
•
15B
•
Updated
•
215
•
1
Text Generation
•
9B
•
Updated
•
76
•
1
RedHatAI/gemma-2-9b-it-FP8
Text Generation
•
9B
•
Updated
•
247
•
5
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
9.29k
•
19
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
1B
•
Updated
•
16
•
1
RedHatAI/Qwen2.5-7B-Instruct-quantized.w4a16
Text Generation
•
2B
•
Updated
•
39
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
231
•
2
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
1B
•
Updated
•
703
•
1
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
Text Generation
•
11B
•
Updated
•
1.84k
•
3