mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-GGUF Reinforcement Learning • Updated Mar 2 • 257 • 1
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos Reinforcement Learning • Updated 15 days ago • 36 • 4
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos Reinforcement Learning • Updated 15 days ago • 61 • 3
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • Updated 16 days ago • 21 • 3