view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs 15 days ago • 25
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 103
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 26 days ago • 191
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages Paper • 2503.20212 • Published Mar 26 • 6
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 126
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 4 days ago • 31
Llama Nemotron Collection Open, Production-ready Enterprise Models • 6 items • Updated 4 days ago • 52