This collection contains datasets that can be used in OpenBench to measure the quality of a system to generate diarized transcripts
AI & ML interests
Foundation Models On Device
Recent Activity
Speaker Diarization Datasets that accompany our Interspeech 2025 paper SDBench
Models, datasets and evaluations results for DiffusionKit: https://github.com/argmaxinc/DiffusionKit
This collection contains datasets that can be used in OpenBench to measure the quality of a system to generate diarized transcripts
A collection of STT (Speech-to-Text) datasets compatible with OpenBench.
Speaker Diarization Datasets that accompany our Interspeech 2025 paper SDBench
https://github.com/argmaxinc/WhisperKit & https://argmaxinc.com/#SDK
Models, datasets and evaluations results for DiffusionKit: https://github.com/argmaxinc/DiffusionKit