TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 65
SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation Paper • 2410.16665 • Published Oct 22, 2024
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published Oct 24, 2024 • 11
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models Paper • 2402.16786 • Published Feb 26, 2024
QASem Parsing: Text-to-text Modeling of QA-based Semantics Paper • 2205.11413 • Published May 23, 2022
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback Paper • 2406.09279 • Published Jun 13, 2024 • 3
The Art of Saying No: Contextual Noncompliance in Language Models Paper • 2407.12043 • Published Jul 2, 2024 • 4
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild Paper • 2406.04770 • Published Jun 7, 2024 • 31
RewardBench: Evaluating Reward Models for Language Modeling Paper • 2403.13787 • Published Mar 20, 2024 • 23
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement Paper • 2310.08559 • Published Oct 12, 2023 • 1
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations Paper • 2212.10409 • Published Dec 20, 2022
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design Paper • 2304.00815 • Published Apr 3, 2023
Draw Me a Flower: Processing and Grounding Abstraction in Natural Language Paper • 2106.14321 • Published Jun 27, 2021 • 1
Asking It All: Generating Contextualized Questions for any Semantic Role Paper • 2109.04832 • Published Sep 10, 2021
The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing Paper • 2106.08037 • Published Jun 15, 2021
QADiscourse -- Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines Paper • 2010.02815 • Published Oct 6, 2020
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties Paper • 2309.00779 • Published Sep 2, 2023 • 1
PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning Paper • 2305.19472 • Published May 31, 2023 • 1
Revisiting Sentence Union Generation as a Testbed for Text Consolidation Paper • 2305.15605 • Published May 24, 2023