-
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Paper • 2502.08946 • Published • 195 -
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 33 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 71 • 3 -
ShunchiZhang/PhysiCo
Viewer • Updated • 600 • 79 • 5
Mo
BishopGorov
AI & ML interests
None yet
Recent Activity
updated
a collection
about 24 hours ago
AGI_assessments
updated
a collection
about 24 hours ago
AGI_assessments
updated
a collection
about 24 hours ago
AGI_assessments
Organizations
None yet