Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xiaoqijian's picture
4 2

xiaoqijian

mx1024
·

AI & ML interests

None yet

Recent Activity

authored a paper 28 days ago
Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance
authored a paper 28 days ago
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design
upvoted a paper 28 days ago
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design
View all activity

Organizations

OpenReasoning's profile picture

upvoted a paper 28 days ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published about 1 month ago • 19
upvoted a collection 2 months ago

Qwen3

Collection
72 items • Updated 20 days ago • 824
upvoted a collection 4 months ago

TinyR1

Collection
2 items • Updated Apr 21 • 3
upvoted an article 4 months ago
view article
Article

Open R1: Update #3

By open-r1 and 9 others •
Mar 11
• 294
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs