Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
dengjia
qwertsdcv
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28_final2_optm
updated
a model
2 days ago
qwertsdcv/grad_clip0.28_ppl_240_new0.1_v3
published
a model
2 days ago
qwertsdcv/grad_clip0.28_ppl_240_new0.1_v3
View all activity
Organizations
None yet
qwertsdcv
's models
15
Sort: Recently updated
qwertsdcv/baseline_grpo_clip0.2_0.28_final2_optm
Updated
2 days ago
qwertsdcv/grad_clip0.28_ppl_240_new0.1_v3
Updated
2 days ago
qwertsdcv/baseline_dapo_positive_only
Updated
3 days ago
qwertsdcv/grad_cilp0.28_math_ppl_200_v3
Updated
3 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28_final_with_optm
Updated
3 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28_final_w_optm
Updated
4 days ago
qwertsdcv/base_ppl_with_optm
Updated
4 days ago
qwertsdcv/baseline_math
Updated
4 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28_final_ppl_with_optm
Updated
4 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28_final
Updated
4 days ago
qwertsdcv/baseline_grpo_clip0.2_0.28
Updated
4 days ago
qwertsdcv/stage5
Text Generation
•
33B
•
Updated
May 27
•
9
qwertsdcv/stage4
Text Generation
•
33B
•
Updated
May 27
•
9
qwertsdcv/32b_stage3_epo1_stage2.1
Updated
May 24
qwertsdcv/32b_stage1_epo3_hardest_1k_w100_s3_wp
Updated
May 23