1 95 2

hangyu guo

Rosiness

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 4 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

upvoted a paper 4 days ago

WorldCache: Content-Aware Caching for Accelerated Video World Models

upvoted a paper 5 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

View all activity

Organizations

upvoted 2 papers 4 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 4 days ago • 23

WorldCache: Content-Aware Caching for Accelerated Video World Models

Paper • 2603.22286 • Published 4 days ago • 4

upvoted a paper 5 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 10 days ago • 104

upvoted 2 papers 9 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 10 days ago • 131

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 15 days ago • 32

authored a paper 11 days ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published 17 days ago • 19

upvoted a paper 17 days ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published 19 days ago • 27

upvoted a paper 18 days ago

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published 24 days ago • 37

upvoted a paper 21 days ago

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published 29 days ago • 44

upvoted 2 papers 24 days ago

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published 25 days ago • 25

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 26 days ago • 61

upvoted a paper 25 days ago

Spectral Condition for μP under Width-Depth Scaling

Paper • 2603.00541 • Published 28 days ago • 15

upvoted 7 papers about 1 month ago

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

Paper • 2602.19128 • Published Feb 22 • 7

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18

CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion

Paper • 2602.10999 • Published Feb 11 • 10

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 30

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 201

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 72

upvoted a paper about 2 months ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 117

hangyu guo

AI & ML interests

Recent Activity

Organizations

Rosiness's activity