zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink_20k-GRPO_step80 8B • Updated Jul 29 • 8
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step160 8B • Updated Jul 29 • 8
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step240 8B • Updated Jul 29 • 8
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step304 8B • Updated Jul 29 • 8
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step256 8B • Updated Jul 29 • 8
zhangchenxu/RB-Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp6_hybrid_nothink-GRPO_step144 8B • Updated Jul 28 • 6
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_random-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated Jul 27 • 8
zhangchenxu/Qwen2.5-VL-7B-Instruct-SFT-visualsphinx_10k_reject-LR2.0e-5-EPOCHS3-LF Image-to-Text • 8B • Updated Jul 27 • 7
zhangchenxu/Qwen2.5-VL-7B-Instruct-vlr_syn_filtered_10k_exp12_nothink-GRPO-01_step256 8B • Updated May 14 • 4