view article Article Blazingly fast whisper transcriptions with Inference Endpoints By mfuntowicz and 5 others β’ about 21 hours ago β’ 16
view article Article LeRobot Community Datasets: The βImageNetβ of Robotics β When and How? 3 days ago β’ 44
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training Paper β’ 2505.00358 β’ Published 13 days ago β’ 20
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models Paper β’ 2505.03821 β’ Published 11 days ago β’ 22
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper β’ 2505.04588 β’ Published 6 days ago β’ 55
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published 8 days ago β’ 67