RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_bs256 Text Generation • 3B • Updated Oct 15, 2024 • 19
RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr3e4_bs256 Text Generation • 3B • Updated Oct 14, 2024 • 17
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr5e4_bs256 3B • Updated Oct 13, 2024 • 7
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr5e4_2k_bs256 3B • Updated Oct 11, 2024 • 7
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr3e4_2k_bs256 3B • Updated Oct 11, 2024 • 7
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr2e4_2k_bs256 3B • Updated Oct 11, 2024 • 7
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr3e4_bs256 3B • Updated Oct 10, 2024 • 5
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr2e4_bs256 3B • Updated Oct 10, 2024 • 7