AI & ML interests
None defined yet.
Recent Activity
View all activity
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
-
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation • 2B • Updated • 70 • 1 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k
Text Generation • 2B • Updated • 3 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k
Text Generation • 2B • Updated • 7 -
Shiyu-Lab/QwQ-32B-thinkprune-4k
Text Generation • 33B • Updated • 2
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
-
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation • 2B • Updated • 70 • 1 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k
Text Generation • 2B • Updated • 3 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k
Text Generation • 2B • Updated • 7 -
Shiyu-Lab/QwQ-32B-thinkprune-4k
Text Generation • 33B • Updated • 2
models 30
Shiyu-Lab/HarnessLLM_SFT_Llama3_3B
4B • Updated
Shiyu-Lab/Inputoutput_SFT_Llama3_3B
4B • Updated
Shiyu-Lab/Inputoutput_SFT_Qwen3_4B
4B • Updated
• 217
Shiyu-Lab/HarnessLLM_SFT_Qwen3_4B
4B • Updated
• 786
Shiyu-Lab/Inputoutput_RL_Llama3_3B
4B • Updated
• 1
Shiyu-Lab/HarnessLLM_RL_Llama3_3B
4B • Updated
Shiyu-Lab/Inputoutput_RL_Qwen3_4B
4B • Updated
Shiyu-Lab/HarnessLLM_RL_Qwen3_4B
4B • Updated
Shiyu-Lab/QwQ-32B-thinkprune-iter2k
Text Generation • 33B • Updated
• 3
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-3k
Text Generation • 2B • Updated
• 5
datasets 14
Shiyu-Lab/WebArena_video_demo
Viewer
• Updated
• 3.69k • 16
Shiyu-Lab/OSWorld_video_demo
Preview
• Updated
• 15
Shiyu-Lab/Testcase_eval_data
Viewer
• Updated
• 215 • 48
Shiyu-Lab/Testcase_RL_Data
Viewer
• Updated
• 12k • 42
Shiyu-Lab/Inputoutput_SFT_Data
Viewer
• Updated
• 15.6k • 12
Shiyu-Lab/HarnessLLM_SFT_Data
Viewer
• Updated
• 15.6k • 10
Shiyu-Lab/Testcase_MBPPHard
Viewer
• Updated
• 141 • 16
Shiyu-Lab/Testcase_CF_Seen
Viewer
• Updated
• 100 • 10
Shiyu-Lab/Testcase_CF_Unseen
Viewer
• Updated
• 84 • 13
Shiyu-Lab/Testcase_LCB_Unseen
Viewer
• Updated
• 93 • 17