infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated about 14 hours ago • 606k
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated about 14 hours ago • 606k
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 4 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 4 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 4 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 4 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 4 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 4 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Single-Stage-Rollout-16-Full-Finetuning 2B • Updated 6 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Single-Stage-Rollout-16-Full-Finetuning 2B • Updated 6 days ago • 8
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-2 Text Generation • 2B • Updated 9 days ago • 29
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-2 Text Generation • 2B • Updated 9 days ago • 29
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 Text Generation • 2B • Updated 12 days ago • 11.6k
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 Text Generation • 2B • Updated 12 days ago • 11.6k