Jeff
JiayuJeff
ยท
AI & ML interests
None yet
Recent Activity
commented on
a paper
1 day ago
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in
Dynamic Environments for LLM Tool-Use Agents
authored
a paper
1 day ago
Mathematical Proof as a Litmus Test: Revealing Failure Modes of Advanced
Large Reasoning Models
Organizations
None yet