9 10 18

Yifan Peng

pyf98

https://pyf98.github.io

pyf98

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Recent Activity

new activity 7 days ago

nvidia/Nemotron-H-8B-Reasoning-128K:Errors in HybridMambaAttentionDynamicCache

upvoted an article about 1 month ago

Gotchas in Tokenizer Behavior Every Developer Should Know

liked a model about 1 month ago

google/gemma-3-1b-pt

View all activity

Organizations

New activity in nvidia/Nemotron-H-8B-Reasoning-128K 7 days ago

Errors in HybridMambaAttentionDynamicCache

#1 opened 7 days ago by

pyf98

upvoted an article about 1 month ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 40

liked a model about 1 month ago

google/gemma-3-1b-pt

Text Generation • 1.0B • Updated Mar 21 • 46.3k • 144

upvoted a collection about 1 month ago

OLMo 2

Collection

Artifacts for the OLMo 2 release. • 35 items • Updated May 1 • 136

authored 5 papers 2 months ago

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

Paper • 2111.14706 • Published Nov 29, 2021

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Paper • 2406.09282 • Published Jun 13, 2024

OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models

Paper • 2502.10373 • Published Feb 14 • 1

Granary: Speech Recognition and Translation Dataset in 25 European Languages

Paper • 2505.13404 • Published May 19 • 1

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31 • 10

updated a collection 2 months ago

Open Whisper-style Speech Models (OWSM)

Collection

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated Jun 3 • 6

commented a paper 2 months ago

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31 • 10 •

upvoted a paper 2 months ago

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31 • 10

updated a dataset 2 months ago

espnet/yodas_owsmv4

Updated Jun 4 • 89 • 10

updated 4 models 2 months ago

New activity in espnet/owsm_v4_medium_1B 2 months ago

where can i find the v4 paper

#1 opened 2 months ago by

StephennFernandes

liked a dataset 2 months ago

espnet/yodas_owsmv4

Updated Jun 4 • 89 • 10

updated a collection 2 months ago

Open Whisper-style Speech Models (OWSM)

Collection

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated Jun 3 • 6

Yifan Peng

AI & ML interests

Recent Activity

Organizations

pyf98's activity

Errors in HybridMambaAttentionDynamicCache

Gotchas in Tokenizer Behavior Every Developer Should Know

where can i find the v4 paper