Models trained using data with different filtering strategies (difficulty, quality filtering)
AI & ML interests
None defined yet.
models
12

ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-all-RL
8B
•
Updated
•
2

ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-quality-difficulty-RL
Updated
•
6

ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-difficulty-RL
8B
•
Updated
•
3

ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-Huatuo-SFT-quality-RL
8B
•
Updated
•
3

ReasoningEval/DeepSeek-R1-Distill-Qwen-7B-RL
8B
•
Updated
•
4

ReasoningEval/Qwen2.5-7B-Huatuo-RL
8B
•
Updated
•
7

ReasoningEval/Qwen2.5-7B-Huatuo-quality-SFT-RL
8B
•
Updated
•
4

ReasoningEval/Qwen2.5-7B-Huatuo-quality-difficulty-SFT-RL
8B
•
Updated
•
3

ReasoningEval/Qwen2.5-7B-Huatuo-difficulty-SFT-RL
8B
•
Updated
•
3

ReasoningEval/Qwen2.5-7B-Huatuo-all-SFT-RL
8B
•
Updated
•
3
datasets
0
None public yet