view article Article Blazingly fast whisper transcriptions with Inference Endpoints about 22 hours ago • 16
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? 3 days ago • 44
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs 15 days ago • 25
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 5 items • Updated 13 days ago • 110
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 154
π_0: A Vision-Language-Action Flow Model for General Robot Control Paper • 2410.24164 • Published Oct 31, 2024 • 13
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11 • 79