Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Zidi Xiong
polaris-73
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
published
a model
8 days ago
polaris-73/ds8b_grpo_math_gsm8k_reinforce-global_step_100
updated
a model
8 days ago
polaris-73/ds8b_grpo_math_gsm8k_rloo-global_step_700
published
a model
8 days ago
polaris-73/ds8b_grpo_math_gsm8k_rloo-global_step_700
View all activity
Organizations
polaris-73
's models
74
Sort:Â Recently updated
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_400
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_870
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_800
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_600
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_400
2B
•
Updated
Jul 14
•
2
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_870
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_800
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_600
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_400
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds7b_grpo_math_faithful_step200
8B
•
Updated
Jul 2
•
8
•
1
polaris-73/control_unlearn_llama3
8B
•
Updated
Mar 13
Previous
1
2
3
Next