Reasoning Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Paper • 2506.14234 • Published Jun 17 • 41
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Paper • 2506.14234 • Published Jun 17 • 41
Useful stuff Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Paper • 2005.11401 • Published May 22, 2020 • 14
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Paper • 2005.11401 • Published May 22, 2020 • 14
Translation haoranxu/ALMA-13B-R Text Generation • 13B • Updated Jan 19, 2024 • 12.2k • • 82 CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26 facebook/nllb-200-distilled-600M Translation • Updated Feb 14, 2024 • 218k • 814
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26
To read ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published Dec 19, 2024 • 16 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published Dec 19, 2024 • 16
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52
LocalModels TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF 47B • Updated Dec 14, 2023 • 28.9k • 650 Octopus v2: On-device language model for super agent Paper • 2404.01744 • Published Apr 2, 2024 • 58
Octopus v2: On-device language model for super agent Paper • 2404.01744 • Published Apr 2, 2024 • 58
Reasoning Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Paper • 2506.14234 • Published Jun 17 • 41
Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Paper • 2506.14234 • Published Jun 17 • 41
To read ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published Dec 19, 2024 • 16 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing Paper • 2412.14711 • Published Dec 19, 2024 • 16
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 52
Useful stuff Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Paper • 2005.11401 • Published May 22, 2020 • 14
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Paper • 2005.11401 • Published May 22, 2020 • 14
LocalModels TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF 47B • Updated Dec 14, 2023 • 28.9k • 650 Octopus v2: On-device language model for super agent Paper • 2404.01744 • Published Apr 2, 2024 • 58
Octopus v2: On-device language model for super agent Paper • 2404.01744 • Published Apr 2, 2024 • 58
Translation haoranxu/ALMA-13B-R Text Generation • 13B • Updated Jan 19, 2024 • 12.2k • • 82 CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26 facebook/nllb-200-distilled-600M Translation • Updated Feb 14, 2024 • 218k • 814
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26