Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
106
35
23
TY.Zheng
aaabiao
Follow
kangz's profile picture
21world's profile picture
Meka-1018's profile picture
20 followers
·
9 following
https://scholar.google.com/citations?user=Vq-VZnUAAAAJ&hl=zh-CN
Zheng0428
AI & ML interests
None yet
Recent Activity
updated
a dataset
10 days ago
aaabiao/dapo_filter
published
a dataset
10 days ago
aaabiao/dapo_filter
upvoted
a
paper
10 days ago
Agentic Reinforced Policy Optimization
View all activity
Organizations
Papers
25
arxiv:
2507.07017
arxiv:
2507.00432
arxiv:
2504.05535
arxiv:
2502.14739
Expand 25 papers
models
62
Sort: Recently updated
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
3
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
3
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
3
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
3
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix2
15B
•
Updated
Jun 29
•
5
aaabiao/verl-8B-100step-v1
8B
•
Updated
Jun 29
•
25
aaabiao/verl-4B-100step-v1
4B
•
Updated
Jun 28
•
4
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix
15B
•
Updated
Jun 28
•
7
aaabiao/verl-14B-60step-v1
15B
•
Updated
Jun 27
•
4
aaabiao/verl-14B-120step-v1
15B
•
Updated
Jun 27
•
4
View 62 models
datasets
11
Sort: Recently updated
aaabiao/dapo_filter
Preview
•
Updated
10 days ago
•
10
aaabiao/data_bon8
Viewer
•
Updated
24 days ago
•
261k
•
50
aaabiao/Transfer_Dataset
Viewer
•
Updated
Jul 7
•
39.9k
•
22
aaabiao/OpenThoughts2-1M-fiilter
Viewer
•
Updated
Jun 5
•
497k
•
5
aaabiao/neo-stage1
Viewer
•
Updated
May 4
•
1.55M
•
2
aaabiao/neo-stage2
Viewer
•
Updated
May 4
•
231k
•
4
aaabiao/RL-dataset
Preview
•
Updated
Apr 16
•
26
aaabiao/RL-datasets
Updated
Apr 9
•
2
aaabiao/Code_Data
Preview
•
Updated
Jan 10
•
979
aaabiao/DAG
Updated
Dec 29, 2024
•
43
View 11 datasets