Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ssaroya
/
gptq_model
like
0
arxiv:
2302.13971
arxiv:
2210.17323
Model card
Files
Files and versions
Community
Deploy
main
gptq_model
/
quant
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
ssaroya
Upload 7 files
401522d
about 2 years ago
__init__.py
312 Bytes
Upload 7 files
about 2 years ago
custom_autotune.py
8.78 kB
Upload 7 files
about 2 years ago
fused_attn.py
8.6 kB
Upload 7 files
about 2 years ago
fused_mlp.py
11.9 kB
Upload 7 files
about 2 years ago
placeholder.txt
Safe
4 Bytes
Create quant/placeholder.txt
about 2 years ago
quant_linear.py
18.3 kB
Upload 7 files
about 2 years ago
quantizer.py
4.26 kB
Upload 7 files
about 2 years ago
triton_norm.py
3.12 kB
Upload 7 files
about 2 years ago