llama-cpp-python (Windows CUDA build)
Prebuilt wheel for:
- Windows x64
- Python 3.12 (cp312)
- CUDA enabled
- AVX512 disabled
- Supports NVIDIA 10 / 20 / 30 / 40 / 50 series GPUs
Install
Direct install:
Or download manually and install:
pip install llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl
Uninstall
pip uninstall llama-cpp-python
Requirements
- Windows 64-bit
- Python 3.12
- NVIDIA GPU
- CUDA Toolkit installed
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support