Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation Paper • 2310.18794 • Published Oct 28, 2023
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models Paper • 2403.02246 • Published Mar 4, 2024 • 1
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning Paper • 2505.08054 • Published May 12 • 2
Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective Paper • 2506.19028 • Published Jun 23 • 1
Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective Paper • 2506.19028 • Published Jun 23 • 1
Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective Paper • 2506.19028 • Published Jun 23 • 1 • 1
OAgents: An Empirical Study of Building Effective Agents Paper • 2506.15741 • Published Jun 17 • 35
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices Paper • 2506.17538 • Published Jun 21 • 7
Steering Conceptual Bias via Transformer Latent-Subspace Activation Paper • 2506.18887 • Published Jun 23 • 6
FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Dataset Dependencies Paper • 2506.17673 • Published Jun 21 • 6
SoK: Evaluating Jailbreak Guardrails for Large Language Models Paper • 2506.10597 • Published Jun 12 • 3
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions Paper • 2506.00643 • Published May 31 • 5 • 2
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions Paper • 2506.00643 • Published May 31 • 5
Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation Paper • 2406.03703 • Published Jun 6, 2024 • 2