70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper โข 2504.11651 โข Published Apr 15 โข 28