marcorez8
/

llama-cpp-python-windows-blackwell-cuda

llama-cpp-python

prebuilt-wheels

machine-learning

large-language-models

gpu-acceleration

Model card Files Files and versions

marcorez8 commited on Jun 2

Commit

2d1c2b5

·

verified ·

1 Parent(s): 80e42c2

Create README.md

Files changed (1) hide show

README.md +35 -0

README.md ADDED Viewed

	@@ -0,0 +1,35 @@

+---
+# Model card metadata following Hugging Face specification:
+# https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
+# Documentation: https://huggingface.co/docs/hub/model-cards
+license: mit
+tags:
+  - llama-cpp-python
+  - cuda
+  - nvidia
+  - blackwell
+  - windows
+  - prebuilt-wheels
+  - python
+  - machine-learning
+  - large-language-models
+  - gpu-acceleration
+---
+# llama-cpp-python 0.3.9 Prebuilt Wheel with CUDA Support for Windows
+This repository provides a prebuilt Python wheel for **llama-cpp-python** (version 0.3.9) with NVIDIA CUDA support, optimized for Windows 10/11 (x64) systems. This wheel enables GPU-accelerated inference for large language models (LLMs) using the `llama.cpp` library, simplifying setup by eliminating the need to compile from source. The wheel is compatible with Python 3.10 and supports NVIDIA GPUs, including the latest Blackwell architecture.
+## Available Wheel
+- `llama_cpp_python-0.3.9-cp310-cp310-win_amd64.whl` (Python 3.10, CUDA 12.8)
+## Compatibility
+The prebuilt wheels are designed for NVIDIA Blackwell GPUs but have been tested and confirmed compatible with previous-generation NVIDIA GPUs, including:
+- NVIDIA RTX 5090
+- NVIDIA RTX 3090
+## Installation
+To install the wheel, use the following command in your Python 3.10 environment:
+```bash
+pip install llama_cpp_python-0.3.9-cp310-cp310-win_amd64.whl