File size: 1,871 Bytes
92207a2 cda56a6 34e4b16 cda56a6 9566cd3 ca46c27 fe376d4 b6be31a d8d0ddf b6be31a cda56a6 ca46c27 9566cd3 7f80c9b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
---
title: README
emoji: ⚡
colorFrom: green
colorTo: gray
sdk: static
pinned: false
---
# Audio, Music, and AI Lab (AMAAI)
The Audio, Music, and AI lab at Singapore University of Technology and Design focuses on cutting-edge innovations in multimodal AI, more specifically: Audio and Music AI.
[More info and publications here.](https://dorienherremans.com/biblio)
Popular software:
- SonicMaster: all-in-one music restoration and mastering: [code](https://github.com/AMAAI-Lab/SonicMaster) - [Examples](https://amaai-lab.github.io/SonicMaster/)
- Jam 0.5: text-to-song: [code](https://huggingface.co/declare-lab/JAM-0.5) - [Examples](https://declare-lab.github.io/jamify)- [Dataset](https://huggingface.co/datasets/declare-lab/JAME) in collaboration with Declare lab
- SonicVerse: time-aware music captioning: [code](https://github.com/AMAAI-Lab/sonicverse) - [live demo](https://huggingface.co/spaces/amaai-lab/SonicVerse)
- Music2Emo: emotion detection from music: [code](https://github.com/AMAAI-Lab/Music2Emotion) - [live demo](https://huggingface.co/spaces/amaai-lab/music2emo/discussions/1)
- Mustango: text-to-music generation: [code](https://github.com/AMAAI-Lab/mustango) - [live demo](https://replicate.com/declare-lab/mustango)
- Video2Music: video-to-music generation: [code](https://github.com/AMAAI-Lab/Video2Music)
- Text2midi: text-to-midi generation: [code](https://github.com/AMAAI-Lab/text2midi)
- nnAudio: on-the-fly spectrogram extraction: [code](https://github.com/AMAAI-Lab/nnAudio)
Popular Datasets:
- [JamendoMaxCaps](https://huggingface.co/datasets/amaai-lab/JamendoMaxCaps): text captions with instrumental music audio
- [MusicBench](https://huggingface.co/datasets/amaai-lab/MusicBench): text captions with music audio
- [MidiCaps](https://huggingface.co/datasets/amaai-lab/MidiCaps): text captions with music midi (large-scale)
|