ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26 • 9.6k • 221
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 2.18k • 185
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16 Reinforcement Learning • 8B • Updated Mar 25 • 4.64k • 86
tensorblock/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16-GGUF Reinforcement Learning • 8B • Updated Jul 9 • 194 • 1
takedakoji00/Llama-3.1-8B-Instruct-custom-qg-full_20250219-7th_random_pad_is_eos_test Reinforcement Learning • Updated Feb 28 • 2
takedakoji00/Llama-3.1-8B-Instruct-custom-qg-full_20250219-7th_random_pad_is_eos_ppo_2nd Reinforcement Learning • Updated Feb 28 • 2
takedakoji00/Llama-3.1-8B-Instruct-custom-qg-full_20250219-7th_random_pad_is_eos_offline_nav Reinforcement Learning • 5B • Updated Mar 1 • 3
takedakoji00/Llama-3.1-8B-Instruct-custom-qg-full_20250219-7th_random_pad_is_eos_offline_nav_2nd Reinforcement Learning • 5B • Updated Mar 1 • 2
takedakoji00/Llama-3.1-8B-Instruct-custom-qg-full_20250219-7th_random_pad_is_eos_ppo_3rd Reinforcement Learning • Updated Mar 2 • 3
mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-GGUF Reinforcement Learning • 8B • Updated Mar 2 • 182 • 1
mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-i1-GGUF Reinforcement Learning • 8B • Updated Mar 2 • 426