TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion Paper • 2303.09057 • Published Mar 16, 2023 • 2
RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow Matching Paper • 2506.16741 • Published Jun 20, 2025 • 1
Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary Model Paper • 2507.03302 • Published Jul 4, 2025 • 1
A Robust framework for sound event localization and detection on real recordings Paper • 2512.22156 • Published 17 days ago • 1
Rethinking Leveraging Pre-Trained Multi-Layer Representations for Speaker Verification Paper • 2512.22148 • Published 17 days ago • 1
WAY: Estimation of Vessel Destination in Worldwide AIS Trajectory Paper • 2512.13190 • Published 17 days ago • 6
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection Paper • 2303.15703 • Published Mar 28, 2023 • 3
Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models Paper • 2409.07770 • Published Sep 12, 2024 • 3
WAY: Estimation of Vessel Destination in Worldwide AIS Trajectory Paper • 2512.13190 • Published 17 days ago • 6