Spaces:
Sleeping
Sleeping
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
@@ -4,11 +4,13 @@ app_file: app.py
|
|
4 |
sdk: gradio
|
5 |
sdk_version: 4.44.1
|
6 |
---
|
7 |
-
# 🤗 AutoQuantNX
|
8 |
|
9 |
## Overview
|
10 |
AutoQuantNX is a powerful Gradio-based web application designed to simplify the process of optimizing and deploying Hugging Face models. It supports a wide range of tasks, including quantization, ONNX conversion, and seamless integration with the Hugging Face Hub. With AutoQuantNX, you can easily convert models to ONNX format, apply quantization techniques, and push the optimized models to your Hugging Face account—all through an intuitive user interface.
|
11 |
|
|
|
|
|
12 |
## Features
|
13 |
|
14 |
### Supported Tasks
|
|
|
4 |
sdk: gradio
|
5 |
sdk_version: 4.44.1
|
6 |
---
|
7 |
+
# 🤗 [AutoQuantNX](https://huggingface.co/spaces/smokxy/AutoQuantNX)
|
8 |
|
9 |
## Overview
|
10 |
AutoQuantNX is a powerful Gradio-based web application designed to simplify the process of optimizing and deploying Hugging Face models. It supports a wide range of tasks, including quantization, ONNX conversion, and seamless integration with the Hugging Face Hub. With AutoQuantNX, you can easily convert models to ONNX format, apply quantization techniques, and push the optimized models to your Hugging Face account—all through an intuitive user interface.
|
11 |
|
12 |
+
## ```In the deployed UI, only 16 Bit quantization works because of GPU requirement of BitsAndBytes and no GPU availability in free HF space.```
|
13 |
+
|
14 |
## Features
|
15 |
|
16 |
### Supported Tasks
|