Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 3 items • Updated Jul 9 • 9
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 149
view article Article Optimize and deploy models with Optimum-Intel and OpenVINO GenAI By AlexKoff88 and 6 others • Sep 20, 2024 • 23