view article Article ScreenEnv: Deploy your full stack Desktop Agent By A-Mahla and 1 other • Jul 10 • 62
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 642
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 114
view article Article The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • Jun 24 • 10
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published May 5 • 86
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 7 items • Updated May 22 • 13
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated Feb 6 • 53
view article Article Evaluating Audio Reasoning with Big Bench Audio By mhillsmith and 1 other • Dec 20, 2024 • 23
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • Jan 20 • 48
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 281
Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated Jun 3 • 6
view article Article TTS Arena: Benchmarking Text-to-Speech Models in the Wild By mrfakename and 6 others • Feb 27, 2024 • 71
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 233
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 407