11 26

Armen Mkrtumyan

armenmkrt

https://github.com/armenmkrt

AI & ML interests

None yet

Recent Activity

liked a model 10 months ago

kyutai/mimi

liked a Space about 1 year ago

srinivasbilla/llasa-3b-tts

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

liked a model 10 months ago

kyutai/mimi

Feature Extraction • 96.2M • Updated Jul 2, 2025 • 564k • • 291

liked a Space about 1 year ago

Llasa 3b Tts

🔥

313

Zero Shot voice cloning with llasa 3b (Unofficial Demo)

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 1.13M • • 13.1k

upvoted 2 papers about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 441

SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces

Paper • 2501.09756 • Published Jan 16, 2025 • 20

liked a model about 1 year ago

unsloth/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated May 9, 2025 • 229k • 90

upvoted 3 papers about 1 year ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6, 2025 • 44

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3, 2025 • 47

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Paper • 2501.01821 • Published Jan 3, 2025 • 20

liked a model about 1 year ago

tencent/HunyuanVideo

Text-to-Video • Updated Mar 6, 2025 • 1.11k • • 2.13k

liked a model over 1 year ago

Metric-AI/armenian-text-embeddings-1

Feature Extraction • 0.3B • Updated Feb 20, 2025 • 619 • 20

liked a Space over 1 year ago

Whisper

📉

2.72k

Transcribe audio files and YouTube videos into text

liked 2 models over 1 year ago

microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 447 • 1.71k

OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated Apr 17, 2025 • 1.13k • 302

upvoted a paper over 1 year ago

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published Oct 30, 2024 • 55

liked a model over 1 year ago

amphion/MaskGCT

Text-to-Speech • Updated Apr 13, 2025 • 707 • 306

liked a dataset over 1 year ago

Marqo/marqo-GS-10M

Viewer • Updated Oct 23, 2024 • 9.81M • 608 • 53

liked a model over 1 year ago

SWivid/F5-TTS

Text-to-Speech • Updated Mar 21, 2025 • 822k • 1.15k

upvoted a paper over 1 year ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 46

liked a model over 1 year ago

Freepik/flux.1-lite-8B-alpha

Text-to-Image • Updated Dec 30, 2024 • 387 • 428

Armen Mkrtumyan

AI & ML interests

Recent Activity

Organizations

armenmkrt's activity

Llasa 3b Tts

Whisper