Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
James X. Zhao's picture
1 7 3

James X. Zhao

JamesXZ
jimmyneu's profile picture 21world's profile picture jersonalvr's profile picture
·
  • XuZhao0

AI & ML interests

None yet

Organizations

National University of Singapore's profile picture

upvoted 2 papers 3 months ago

How Does Response Length Affect Long-Form Factuality

Paper • 2505.23295 • Published May 29, 2025 • 1

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174
upvoted 2 papers 4 months ago

SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge

Paper • 2509.07968 • Published Sep 9, 2025 • 14

Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

Paper • 2509.06861 • Published Sep 8, 2025 • 8
upvoted a paper 6 months ago

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning

Paper • 2502.11962 • Published Feb 17, 2025 • 38
upvoted a paper 7 months ago

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published Jun 2, 2025 • 52
upvoted a paper 8 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs