From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper • 2508.13491 • Published 7 days ago • 58
Me LLaMA: Foundation Large Language Models for Medical Applications Paper • 2402.12749 • Published Feb 20, 2024 • 2
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Paper • 2412.18174 • Published Dec 24, 2024
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published Feb 9 • 42
FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information Paper • 2505.20650 • Published May 27 • 16
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper • 2506.14028 • Published Jun 16 • 92
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Paper • 2412.18174 • Published Dec 24, 2024
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published Feb 9 • 42
RKEFino1: A Regulation Knowledge-Enhanced Large Language Model Paper • 2506.05700 • Published Jun 6 • 4
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper • 2506.14028 • Published Jun 16 • 92
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Paper • 2503.20990 • Published Mar 26 • 19
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Paper • 2503.20990 • Published Mar 26 • 19
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Paper • 2503.20990 • Published Mar 26 • 19
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Paper • 2502.11433 • Published Feb 17 • 37
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Paper • 2502.11433 • Published Feb 17 • 37
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Paper • 2412.18174 • Published Dec 24, 2024
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published Feb 9 • 42
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published Feb 12 • 59
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 62