dewdev
/

bling-tiny-llama-onnx

Model card Files Files and versions

dewdev commited on Feb 4

Commit

39a0cce

·

verified ·

1 Parent(s): 89fb4cf

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -6,6 +6,8 @@ base_model_relation: quantized
 tags: [green, llmware-rag, p1, ov]
 ---
 # bling-tiny-llama-onnx
 **bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.

 tags: [green, llmware-rag, p1, ov]
 ---
+cloned from # llmware/bling-tiny-llama-onnx
 # bling-tiny-llama-onnx
 **bling-tiny-llama-onnx** is a very small, very fast fact-based question-answering model, designed for retrieval augmented generation (RAG) with complex business documents, quantized and packaged in ONNX int4 for AI PCs using Intel GPU, CPU and NPU.