J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Paper • 2505.10320 • Published 24 days ago • 22
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification Paper • 2010.12421 • Published Oct 23, 2020 • 1