-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 787k • • 12.6k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 1.54k • 933 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 178k • • 713 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 772k • • 1.42k
DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
524
Chat with DeepSeek-VL2-small
🌍Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 63.9k • 211 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 28.2k • 161 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 3.53k • 353
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 10.2k • 648 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 1.33k • 75 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 3.98k • 87 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 710k • • 458
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 15.5k • 537 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 53.4k • 445 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 41.7k • 137 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 48.2k • 136
DeepSeek MoE series
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek Math series
-
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 40.9k • 133 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 5.01k • 85 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 4.59k • 73 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 127
DeepSeek-VL model series
DeepSeek LLM series
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 787k • • 12.6k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 1.54k • 933 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 178k • • 713 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 772k • • 1.42k
-
524
Chat with DeepSeek-VL2-small
🌍Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 63.9k • 211 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 28.2k • 161 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 3.53k • 353
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 10.2k • 648 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 1.33k • 75 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 3.98k • 87 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 710k • • 458
DeepSeek Math series
-
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 40.9k • 133 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 5.01k • 85 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 4.59k • 73 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 127
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 15.5k • 537 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 53.4k • 445 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 41.7k • 137 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 48.2k • 136
DeepSeek LLM series
DeepSeek MoE series