Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,8 @@ base_model_relation: quantized
|
|
6 |
tags: [green, llmware-rag, p1, ov]
|
7 |
---
|
8 |
|
|
|
|
|
9 |
# bling-tiny-llama-onnx
|
10 |
|
11 |
**bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.
|
|
|
6 |
tags: [green, llmware-rag, p1, ov]
|
7 |
---
|
8 |
|
9 |
+
cloned from # llmware/bling-tiny-llama-onnx
|
10 |
+
|
11 |
# bling-tiny-llama-onnx
|
12 |
|
13 |
**bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.
|