Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xiaoqijian's picture
4 2

xiaoqijian

mx1024
ยท

AI & ML interests

None yet

Recent Activity

authored a paper 25 days ago
Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance
authored a paper 25 days ago
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design
upvoted a paper 25 days ago
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design
View all activity

Organizations

OpenReasoning's profile picture

Papers 2

arxiv:2506.04734
arxiv:2502.12459

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs