Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
23
48
47
Joya Chen
PRO
chenjoya
Follow
xiaopeng0012's profile picture
Aleniles's profile picture
Mi6paulino's profile picture
22 followers
·
13 following
https://chenjoya.github.io/
chenjoya
AI & ML interests
Video LLM
Recent Activity
upvoted
a
paper
2 days ago
Reinforcement Learning in Vision: A Survey
liked
a model
9 days ago
Qwen/Qwen-Image
updated
a dataset
10 days ago
chenjoya/Live-WhisperX-526K
View all activity
Organizations
chenjoya
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
9 days ago
Qwen/Qwen-Image
Text-to-Image
•
Updated
8 days ago
•
69.7k
•
•
1.58k
liked
a dataset
22 days ago
keithito/lj_speech
Updated
Aug 14, 2024
•
575
•
56
liked
a model
22 days ago
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
16 days ago
•
278k
•
552
liked
a model
about 1 month ago
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
about 1 month ago
•
129k
•
716
liked
a model
about 2 months ago
showlab/show-o2-7B
Any-to-Any
•
Updated
Jun 22
•
689
•
13
liked
2 models
3 months ago
pyannote/speaker-diarization-3.1
Automatic Speech Recognition
•
Updated
May 10, 2024
•
20.4M
•
1.04k
nvidia/diar_sortformer_4spk-v1
Audio Classification
•
Updated
Feb 3
•
23.4k
•
75
liked
8 datasets
3 months ago
parler-tts/mls_eng
Viewer
•
Updated
Apr 9, 2024
•
10.8M
•
5.07k
•
27
edinburghcstr/ami
Viewer
•
Updated
Jan 16, 2023
•
110k
•
3.56k
•
60
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16, 2024
•
13M
•
40.4k
•
329
facebook/multilingual_librispeech
Viewer
•
Updated
Aug 12, 2024
•
1.49M
•
8.32k
•
145
facebook/voxpopuli
Updated
Oct 14, 2022
•
7.27k
•
127
CSTR-Edinburgh/vctk
Updated
Aug 14, 2024
•
385
•
46
MLCommons/peoples_speech
Viewer
•
Updated
Nov 20, 2024
•
8.05M
•
21.5k
•
137
MERaLiON/Multitask-National-Speech-Corpus-v1
Viewer
•
Updated
Jan 21
•
15.2M
•
13.3k
•
15
liked
2 models
3 months ago
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition
•
Updated
3 days ago
•
524k
•
1.29k
zai-org/glm-4-voice-tokenizer
0.4B
•
Updated
Oct 25, 2024
•
59.8k
•
10
liked
a Space
3 months ago
Running
on
Zero
14
14
FlexTok
🖼
FlexTok flexible sequence length autoencoding demo
liked
a model
3 months ago
moonshotai/Kimi-Audio-7B-Instruct
Text-to-Speech
•
10B
•
Updated
May 29
•
8.98k
•
344
liked
a Space
3 months ago
Running
148
148
Seed1.5 VL
🚀
Seed1.5-VL API Demo
Load more