refactor(model): remove explicit device_type parameter from amp decorators d65e5f6 PierrunoYT commited on 18 days ago
perf(model): enable 8-bit quantization and explicit CUDA device targeting 50f1efd PierrunoYT commited on 18 days ago