Eustache Le Bihan's picture

Eustache Le Bihan

eustlb

·

AI & ML interests

Audio - ASR

Recent Activity

updated a model 13 days ago

eustlb/parakeet-ctc-1.1b

published a model 14 days ago

eustlb/parakeet-ctc-1.1b

new activity 16 days ago

mistralai/Voxtral-Mini-3B-2507:VoxtralForConditionalGeneration import error

View all activity

Organizations

upvoted a paper 27 days ago

Voxtral

Paper • 2507.13264 • Published 28 days ago • 25

upvoted an article about 1 month ago

Article

ScreenEnv: Deploy your full stack Desktop Agent

By

and 1 other •

Jul 10

• 62

upvoted a collection about 1 month ago

Ultravox v0.6

4 items • Updated Jul 5 • 4

upvoted 2 articles about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

Jul 9

• 642

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

Jun 26

• 114

upvoted an article about 2 months ago

Article

The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't)

By

•

Jun 24

• 10

upvoted a paper 3 months ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5 • 86

upvoted an article 4 months ago

Article

Understanding Vector Quantization in VQ-VAE

By

•

Aug 28, 2024

• 37

upvoted 2 collections 6 months ago

Slam

All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 7 items • Updated May 22 • 13

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated Feb 6 • 53

upvoted an article 6 months ago

Article

Evaluating Audio Reasoning with Big Bench Audio

By

and 1 other •

Dec 20, 2024

• 23

upvoted an article 7 months ago

Article

Yay! Organizations can now publish blog Articles

By

and 3 others •

Jan 20

• 48

upvoted a collection 7 months ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 8 items • Updated 7 days ago • 12

upvoted a collection 9 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 281

upvoted a collection 11 months ago

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated Jun 3 • 6

upvoted an article 11 months ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

By

and 6 others •

Feb 27, 2024

• 71

upvoted a collection 11 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 233

upvoted an article about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 407