5 12 10

Weijie Xu

xwjzds

https://weijiexu.com/

AI & ML interests

LLM Evaluation @Amazon

Recent Activity

authored a paper about 2 months ago

Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation

authored a paper about 2 months ago

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

authored a paper about 2 months ago

FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning

View all activity

Organizations

authored 4 papers about 2 months ago

Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation

Paper • 2310.18794 • Published Oct 28, 2023

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

Paper • 2403.02246 • Published Mar 4, 2024 • 1

FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning

Paper • 2505.08054 • Published May 12 • 2

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published Jun 23 • 1

New activity in weijiejailbreak/bias_eval_advice_format about 2 months ago

Update README.md

#2 opened about 2 months ago by

xwjzds

New activity in weijiejailbreak/bias_eval_suggestion_format about 2 months ago

Update README.md

#1 opened about 2 months ago by

xwjzds

upvoted a paper about 2 months ago

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published Jun 23 • 1

commented a paper about 2 months ago

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published Jun 23 • 1 •

upvoted 5 papers about 2 months ago

FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Dataset Dependencies

Paper • 2506.17673 • Published Jun 21 • 6

SoK: Evaluating Jailbreak Guardrails for Large Language Models

Paper • 2506.10597 • Published Jun 12 • 3

commented a paper 2 months ago

SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Paper • 2506.00643 • Published May 31 • 5 •

upvoted a paper 2 months ago

SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Paper • 2506.00643 • Published May 31 • 5

liked a dataset 3 months ago

AmazonScience/FalseReject

Viewer • Updated May 14 • 15.8k • 388 • 13

liked a dataset 4 months ago

weijiejailbreak/r1-1776-jailbreak

Viewer • Updated Mar 17 • 36 • 95 • 5

upvoted a paper 12 months ago

Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation

Paper • 2406.03703 • Published Jun 6, 2024 • 2

upvoted a collection over 1 year ago

text2text diffusion

Collection

2 items • Updated Feb 17, 2024 • 1

New activity in xwjzds/extractive_qa_question_answering_hr over 1 year ago

Librarian Bot: Add language metadata for dataset

#1 opened over 1 year ago by

librarian-bot

Weijie Xu

AI & ML interests

Recent Activity

Organizations

xwjzds's activity

Update README.md

Update README.md

Librarian Bot: Add language metadata for dataset