DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Paper โข 2504.15716 โข Published 5 days ago โข 1
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper โข 2504.15521 โข Published 5 days ago โข 58