view article Article Blazingly fast whisper transcriptions with Inference Endpoints about 21 hours ago β’ 16
view article Article LeRobot Community Datasets: The βImageNetβ of Robotics β When and How? 3 days ago β’ 44
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training Paper β’ 2505.00358 β’ Published 13 days ago β’ 20
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models Paper β’ 2505.03821 β’ Published 11 days ago β’ 22
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper β’ 2505.04588 β’ Published 6 days ago β’ 55
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published 8 days ago β’ 67
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning Paper β’ 2111.10952 β’ Published Nov 22, 2021 β’ 2
Finetuned Language Models Are Zero-Shot Learners Paper β’ 2109.01652 β’ Published Sep 3, 2021 β’ 3
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. β’ 13 items β’ Updated 12 days ago β’ 144
Towards Understanding Sycophancy in Language Models Paper β’ 2310.13548 β’ Published Oct 20, 2023 β’ 6
MAI-DS-R1 Collection MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team. β’ 2 items β’ Updated 13 days ago β’ 11