dewdev commited on
Commit
39a0cce
·
verified ·
1 Parent(s): 89fb4cf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -6,6 +6,8 @@ base_model_relation: quantized
6
  tags: [green, llmware-rag, p1, ov]
7
  ---
8
 
 
 
9
  # bling-tiny-llama-onnx
10
 
11
  **bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.
 
6
  tags: [green, llmware-rag, p1, ov]
7
  ---
8
 
9
+ cloned from # llmware/bling-tiny-llama-onnx
10
+
11
  # bling-tiny-llama-onnx
12
 
13
  **bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.