-
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation • 8B • Updated • 8 -
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation • 8B • Updated • 8 -
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation • 8B • Updated • 7 -
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation • 8B • Updated • 7
ProgramTrace
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation • 8B • Updated • 8 -
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation • 8B • Updated • 8 -
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation • 8B • Updated • 7 -
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation • 8B • Updated • 7
models
8
PTPReasoning/Llama-3.1-8B-RL-Clean-V2
8B
•
Updated
PTPReasoning/Llama-3.1-8B-RL-Baseline-V2
8B
•
Updated
PTPReasoning/Llama-3.1-8B-SFT-Baseline
Text Generation
•
8B
•
Updated
PTPReasoning/Llama-3.1-8B-SFT-Clean-V2
Text Generation
•
8B
•
Updated
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation
•
8B
•
Updated
•
7
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation
•
8B
•
Updated
•
7
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation
•
8B
•
Updated
•
8
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation
•
8B
•
Updated
•
8
datasets
12
PTPReasoning/finqa
Viewer
•
Updated
•
1.15k
•
43
PTPReasoning/hotpot_qa
Viewer
•
Updated
•
500
•
35
PTPReasoning/PubMedQA
Viewer
•
Updated
•
1.5k
•
6
PTPReasoning/MedCalc-Bench-v1.0
Viewer
•
Updated
•
22.5k
•
30
•
1
PTPReasoning/PTP-RL-ITL-Final-Clean-V2
Viewer
•
Updated
•
19k
•
2
PTPReasoning/PTP-SFT-ITL-Final-Baseline-V2
Viewer
•
Updated
•
4.12k
•
1
PTPReasoning/PTP-SFT-ITL-Final-Clean-V2
Viewer
•
Updated
•
4.21k
•
3
PTPReasoning/PTP-RL-MedCalc-Bench
Viewer
•
Updated
•
9.34k
•
3
PTPReasoning/PTP-RL-DAPO-EN
Viewer
•
Updated
•
14.1k
•
1
PTPReasoning/mmlu_pro_biology
Viewer
•
Updated
•
717
•
5