OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 7 items • Updated 4 days ago • 16
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos-GGUF Reinforcement Learning • Updated 4 days ago • 246 • 2
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF Reinforcement Learning • Updated 8 days ago • 103 • 2