Nguyễn Minh Phúc

DatPySci

AI & ML interests

Reinforcement learning, NLP

Recent Activity

published a model 2 days ago

DatPySci/RLVR-SGDM-Gap

updated a model 7 days ago

DatPySci/RLVR-CoTs

published a model 8 days ago

DatPySci/RLVR-CoTs

View all activity

Organizations

published a model 2 days ago

DatPySci/RLVR-SGDM-Gap

Updated 2 days ago

updated a model 7 days ago

DatPySci/RLVR-CoTs

Updated 7 days ago

published a model 8 days ago

DatPySci/RLVR-CoTs

Updated 7 days ago

published a model 9 days ago

DatPySci/RLM

Updated 9 days ago

updated a model 11 days ago

DatPySci/PreRLVR-Controlled

Updated 11 days ago

published a model 27 days ago

DatPySci/PreRLVR-Controlled

Updated 11 days ago

updated a model about 2 months ago

DatPySci/RLDI

2B • Updated Dec 18, 2025 • 3

updated a dataset 5 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 19

published a dataset 5 months ago

DatPySci/Qwen2.5-Math-1.5B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 19

updated a dataset 5 months ago

DatPySci/Qwen2.5-Math-7B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 3 • 1

published a dataset 5 months ago

DatPySci/Qwen2.5-Math-7B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 3 • 1

updated a dataset 5 months ago

DatPySci/Llama-3.2-3B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 4

published a dataset 5 months ago

DatPySci/Llama-3.2-3B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 4

published a model 6 months ago

DatPySci/RLDI

2B • Updated Dec 18, 2025 • 3

updated a model 10 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3, 2025 • 1

published 2 models 10 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3, 2025 • 1

DatPySci/Llama-3.2-3B-sft-mixture

Text Generation • 3B • Updated Feb 10, 2025 • 1

updated 2 models 10 months ago

DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated Apr 28, 2025 • 2

DatPySci/DeepSeek-Qwen-1.5B-GRPO

2B • Updated Apr 22, 2025 • 1

published a model 10 months ago

DatPySci/DeepSeek-Qwen-1.5B-GRPO

2B • Updated Apr 22, 2025 • 1

Nguyễn Minh Phúc

AI & ML interests

Recent Activity

Organizations

DatPySci's activity