Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
handraise-dev
/
gguf-inference
like
1
Text Generation
multilingual
nlp
code
License:
mit
Model card
Files
Files and versions
Community
Deploy
f96aa72
gguf-inference
Ctrl+K
Ctrl+K
2 contributors
History:
19 commits
syberWolf
update handler and add flash attention
f96aa72
about 1 year ago
.gitattributes
Safe
3.48 kB
llamacpp quants
about 1 year ago
.gitignore
Safe
10 Bytes
add handler
about 1 year ago
README.md
Safe
8.85 kB
llamacpp quants
about 1 year ago
handler.py
1.14 kB
update handler and add flash attention
about 1 year ago
requirements.txt
18 Bytes
update handler and add flash attention
about 1 year ago