ISTA-DASLab/Meta-Llama-3-8B-AQLM-PV-1Bit-1x16
Text Generation
•
2B
•
Updated
•
1
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers