Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Paper • 2411.06272 • Published 8 days ago • 3 • 2