5 12 10

Weijie Xu

xwjzds

https://weijiexu.com/

AI & ML interests

LLM Evaluation @Amazon

Recent Activity

authored a paper 1 day ago

Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation

authored a paper 1 day ago

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

authored a paper 1 day ago

FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning

View all activity

Organizations

authored 4 papers 1 day ago

Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation

Paper • 2310.18794 • Published Oct 28, 2023

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

Paper • 2403.02246 • Published Mar 4, 2024 • 1

FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning

Paper • 2505.08054 • Published May 12 • 1

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published 3 days ago • 1

New activity in weijiejailbreak/bias_eval_advice_format 1 day ago

Update README.md

#2 opened 1 day ago by

xwjzds

New activity in weijiejailbreak/bias_eval_suggestion_format 1 day ago

Update README.md

#1 opened 1 day ago by

xwjzds

upvoted a paper 1 day ago

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published 3 days ago • 1

commented a paper 1 day ago

Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective

Paper • 2506.19028 • Published 3 days ago • 1 •

upvoted 5 papers 1 day ago

FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Dataset Dependencies

Paper • 2506.17673 • Published 5 days ago • 6

SoK: Evaluating Jailbreak Guardrails for Large Language Models

Paper • 2506.10597 • Published 14 days ago • 3

commented a paper 24 days ago

SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Paper • 2506.00643 • Published 26 days ago • 5 •

upvoted a paper 24 days ago

SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Paper • 2506.00643 • Published 26 days ago • 5

liked a dataset about 1 month ago

AmazonScience/FalseReject

Viewer • Updated May 14 • 15.8k • 187 • 4

liked a dataset 3 months ago

weijiejailbreak/r1-1776-jailbreak

Viewer • Updated Mar 17 • 36 • 124 • 3

upvoted a paper 10 months ago

Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation

Paper • 2406.03703 • Published Jun 6, 2024 • 2

upvoted a collection about 1 year ago

text2text diffusion

Collection

2 items • Updated Feb 17, 2024 • 1

New activity in xwjzds/extractive_qa_question_answering_hr over 1 year ago

Librarian Bot: Add language metadata for dataset

#1 opened over 1 year ago by

librarian-bot

Weijie Xu

AI & ML interests

Recent Activity

Organizations

xwjzds's activity

Update README.md

Update README.md

Librarian Bot: Add language metadata for dataset