PandaLLMCommunity (PandaLLMCommunity)

mzf666

authored 5 papers 2 months ago

Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models

Paper • 2305.03025 • Published May 4, 2023

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 36

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6 • 30

First Try Matters: Revisiting the Role of Reflection in Reasoning Models

Paper • 2510.08308 • Published Oct 9 • 24

chitanda

authored a paper 11 months ago

Preference Optimization for Reasoning with Pseudo Feedback

Paper • 2411.16345 • Published Nov 25, 2024 • 1

chitanda

authored a paper about 1 year ago

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Paper • 2410.01428 • Published Oct 2, 2024 • 1

chitanda

authored 3 papers over 1 year ago

How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library

Paper • 2404.00699 • Published Mar 31, 2024

Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?

Paper • 2404.12728 • Published Apr 19, 2024

Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training

Paper • 2404.14604 • Published Apr 22, 2024

chitanda

authored 2 papers almost 2 years ago

Improving In-context Learning via Bidirectional Alignment

Paper • 2312.17055 • Published Dec 28, 2023

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing

Paper • 2402.00658 • Published Feb 1, 2024

chitanda

authored 2 papers about 2 years ago

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

Paper • 2311.16989 • Published Nov 28, 2023

Unanswerable Visual Question Answering

Paper • 2310.10942 • Published Oct 17, 2023

qcw

updated 2 models about 2 years ago

PandaLLMCommunity/panda-index-large-zh

PandaLLMCommunity/panda-index-large-en

chitanda

authored 4 papers about 2 years ago

SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning

Paper • 2309.04766 • Published Sep 9, 2023

Retrieving Multimodal Information for Augmented Generation: A Survey

Paper • 2303.10868 • Published Mar 20, 2023

LogicLLM: Exploring Self-supervised Logic-enhanced Training for Large Language Models

Paper • 2305.13718 • Published May 23, 2023

REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training

Paper • 2105.04201 • Published May 10, 2021

AI & ML interests

Team members 5

PandaLLMCommunity's activity