MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge Paper • 2507.21183 • Published Jul 27 • 14 • 2
Contextual Integrity in LLMs via Reasoning and Reinforcement Learning Paper • 2506.04245 • Published May 29 • 4 • 1