AbstractTTS group

community

AI & ML interests

None defined yet.

zenyn

authored 2 papers 3 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 19

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 17

zenyn

authored a paper 4 months ago

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

Paper • 2506.05140 • Published Jun 5, 2025

zenyn

authored 2 papers 6 months ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

Paper • 2505.17496 • Published May 23, 2025 • 2

zenyn

authored 2 papers 7 months ago

Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

Paper • 2401.00273 • Published Dec 30, 2023

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 3

windcrossroad

updated 2 datasets over 1 year ago

AbstractTTS/combined_dataset_map

Viewer • Updated Aug 20, 2024 • 124k • 57

AbstractTTS/IEMOCAP_map

Viewer • Updated Aug 20, 2024 • 4.82k • 58

zenyn

updated 7 datasets over 1 year ago

AbstractTTS/PODCAST

Viewer • Updated Aug 12, 2024 • 149k • 479 • 8

AbstractTTS/TESS

Viewer • Updated Aug 11, 2024 • 2.8k • 24

AbstractTTS/SAVEE

Viewer • Updated Aug 11, 2024 • 480 • 19 • 1

AbstractTTS/RAVDESS

Viewer • Updated Aug 11, 2024 • 1.44k • 8 • 1

AbstractTTS/ESD_english

Viewer • Updated Aug 11, 2024 • 17.5k • 27 • 1

AbstractTTS/CREMA-D

Viewer • Updated Aug 11, 2024 • 7.44k • 50

AbstractTTS/IEMOCAP

Viewer • Updated Aug 11, 2024 • 10k • 790 • 18