When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration Paper • 2506.05579 • Published 7 days ago • 3
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration Paper • 2506.05579 • Published 7 days ago • 3 • 2
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Paper • 2407.12883 • Published Jul 16, 2024 • 10
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs By StringChaos and 6 others • Apr 16, 2024 • 15