Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
dark-pen's profile picture
Oztobuzz's profile picture
2 followers
·
1 following
AI & ML interests
Reinforcement learning, NLP
Recent Activity
published
a model
2 days ago
DatPySci/RLVR-SGDM-Gap
updated
a model
7 days ago
DatPySci/RLVR-CoTs
published
a model
8 days ago
DatPySci/RLVR-CoTs
View all activity
Organizations
DatPySci
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
a model
2 days ago
DatPySci/RLVR-SGDM-Gap
Updated
2 days ago
updated
a model
7 days ago
DatPySci/RLVR-CoTs
Updated
7 days ago
published
a model
8 days ago
DatPySci/RLVR-CoTs
Updated
7 days ago
published
a model
9 days ago
DatPySci/RLM
Updated
9 days ago
updated
a model
11 days ago
DatPySci/PreRLVR-Controlled
Updated
11 days ago
published
a model
27 days ago
DatPySci/PreRLVR-Controlled
Updated
11 days ago
updated
a model
about 2 months ago
DatPySci/RLDI
2B
•
Updated
Dec 18, 2025
•
3
updated
a dataset
5 months ago
DatPySci/Qwen2.5-Math-1.5B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
19
published
a dataset
5 months ago
DatPySci/Qwen2.5-Math-1.5B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
19
updated
a dataset
5 months ago
DatPySci/Qwen2.5-Math-7B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
3
•
1
published
a dataset
5 months ago
DatPySci/Qwen2.5-Math-7B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
3
•
1
updated
a dataset
5 months ago
DatPySci/Llama-3.2-3B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
4
published
a dataset
5 months ago
DatPySci/Llama-3.2-3B-deepscaler
Viewer
•
Updated
Sep 16, 2025
•
161k
•
4
published
a model
6 months ago
DatPySci/RLDI
2B
•
Updated
Dec 18, 2025
•
3
updated
a model
10 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
May 3, 2025
•
1
published
2 models
10 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
May 3, 2025
•
1
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
3B
•
Updated
Feb 10, 2025
•
1
updated
2 models
10 months ago
DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Apr 28, 2025
•
2
DatPySci/DeepSeek-Qwen-1.5B-GRPO
2B
•
Updated
Apr 22, 2025
•
1
published
a model
10 months ago
DatPySci/DeepSeek-Qwen-1.5B-GRPO
2B
•
Updated
Apr 22, 2025
•
1
Load more