Wenyue Hua's picture

3 11 5

Wenyue Hua

wenyueH

·

https://wenyueh.github.io/

AI & ML interests

LLM-based agent, LLM reasoning

Recent Activity

liked a dataset 10 days ago

SWE-bench/SWE-bench_Lite

liked a dataset 11 days ago

princeton-nlp/SWE-bench_Verified

upvoted a paper about 1 month ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

View all activity

Organizations

None yet

wenyueH's activity

commented a paper 4 months ago

InductionBench: LLMs Fail in the Simplest Complexity Class

Paper • 2502.15823 • Published Feb 20 • 7 •

commented a paper 6 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10 •

commented a paper 7 months ago

Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8, 2024 • 8 •