The official datasets and model checkpoints of ARPO
KABI
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
liked
a model
1 day ago
dongguanting/RAG-Critic-3B
upvoted
a
paper
1 day ago
VeriGUI: Verifiable Long-Chain GUI Dataset
upvoted
a
paper
1 day ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens