Mungert/Qwen2.5-VL-3B-Instruct-GGUF Image-Text-to-Text • 3B • Updated about 12 hours ago • 5.57k • 21
bartowski/Teuken-7B-instruct-research-v0.4-GGUF Text Generation • 7B • Updated Nov 26, 2024 • 1.19k • 4
openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation • 7B • Updated Dec 11, 2024 • 1.68k • 74
SebastianBodza/Kartoffel_Orpheus-3B_german_natural-v0.1 Text-to-Speech • 3B • Updated May 17 • 112 • 13
view post Post 2932 this paper has been blowing upthey train an open-source multimodal LLM (InternVL3) that can compete with GPT-4o and Claude 3.5 Sonnet by:> training text and vision on a single stage> a novel V2PE positional encoding> SFT & mixed preference optimizationPaper: InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models (2504.10479)> test-time scaling See translation ❤️ 6 6 👍 2 2 🔥 2 2 👀 1 1 + Reply
diarizers-community/speaker-segmentation-fine-tuned-callhome-deu 0.0B • Updated Apr 25, 2024 • 73 • 6