Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models Paper • 2305.03025 • Published May 4, 2023
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published May 1 • 36
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published Jul 19 • 134
First Try Matters: Revisiting the Role of Reflection in Reasoning Models Paper • 2510.08308 • Published Oct 9 • 24
Preference Optimization for Reasoning with Pseudo Feedback Paper • 2411.16345 • Published Nov 25, 2024 • 1
Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks Paper • 2410.01428 • Published Oct 2, 2024 • 1
How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library Paper • 2404.00699 • Published Mar 31, 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning? Paper • 2404.12728 • Published Apr 19, 2024
Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training Paper • 2404.14604 • Published Apr 22, 2024
Improving In-context Learning via Bidirectional Alignment Paper • 2312.17055 • Published Dec 28, 2023
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing Paper • 2402.00658 • Published Feb 1, 2024
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? Paper • 2311.16989 • Published Nov 28, 2023
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning Paper • 2309.04766 • Published Sep 9, 2023
Retrieving Multimodal Information for Augmented Generation: A Survey Paper • 2303.10868 • Published Mar 20, 2023
LogicLLM: Exploring Self-supervised Logic-enhanced Training for Large Language Models Paper • 2305.13718 • Published May 23, 2023
REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training Paper • 2105.04201 • Published May 10, 2021