Wenyue Hua's picture

3 11 5

Wenyue Hua

wenyueH

·

https://wenyueh.github.io/

AI & ML interests

LLM-based agent, LLM reasoning

Recent Activity

liked a dataset 10 days ago

SWE-bench/SWE-bench_Lite

liked a dataset 11 days ago

princeton-nlp/SWE-bench_Verified

upvoted a paper about 1 month ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

View all activity

Organizations

None yet

wenyueH's activity

upvoted a paper about 1 month ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 54

upvoted 2 papers about 2 months ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 24

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published Apr 21 • 33

upvoted a paper 4 months ago

InductionBench: LLMs Fail in the Simplest Complexity Class

Paper • 2502.15823 • Published Feb 20 • 7

upvoted 2 papers 6 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 54

upvoted a paper 7 months ago

Game-theoretic LLM: Agent Workflow for Negotiation Games

Paper • 2411.05990 • Published Nov 8, 2024 • 8

upvoted 2 papers over 1 year ago

How to Index Item IDs for Recommendation Foundation Models

Paper • 2305.06569 • Published May 11, 2023 • 1

The Impact of Reasoning Step Length on Large Language Models

Paper • 2401.04925 • Published Jan 10, 2024 • 18