SATQuest Dataset Collections
Adam Yanxiao Zhao
sdpkjc
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
collection
3 days ago
SATQuest
updated
a collection
3 days ago
SATQuest
updated
a collection
3 days ago
SATQuest
Organizations
Collections
1
Papers
2
models
98

sdpkjc/Qwen2.5-1.5B-Instruct-FT-DPO
Text Generation
•
Updated
•
17

sdpkjc/SmolLM2-FT-DPO
Text Generation
•
Updated
•
3

sdpkjc/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
2

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated

sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated

sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated

sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
datasets
14
sdpkjc/SATQuest-RFT-3k
Viewer
•
Updated
•
3k
•
277
sdpkjc/SATQuest-RFT-1k
Viewer
•
Updated
•
1k
•
178
sdpkjc/SATQuest-Tiny
Viewer
•
Updated
•
10
•
109
sdpkjc/SATQuest
Viewer
•
Updated
•
140
•
450
sdpkjc/SATQuest-G
Viewer
•
Updated
•
963
•
52
sdpkjc/NumBase-N01-S2g-B2g
Viewer
•
Updated
•
983k
•
17
sdpkjc/NumBase-N01-S2g-B28
Viewer
•
Updated
•
459k
•
67
sdpkjc/NumBase-N01-S2g-B24
Viewer
•
Updated
•
197k
•
20
sdpkjc/NumBase-N01-S28-B2g
Viewer
•
Updated
•
3.81k
•
19
sdpkjc/NumBase-N01-S28-B28
Viewer
•
Updated
•
1.78k
•
43