3 747 67

wongyukim

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

upvoted a paper about 8 hours ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

upvoted a paper about 8 hours ago

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

View all activity

Organizations

None yet

upvoted 5 papers about 8 hours ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published 2 days ago • 5

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published 8 days ago • 12

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published 2 days ago • 25

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published 4 days ago • 35

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published 2 days ago • 48

upvoted 4 papers 1 day ago

upvoted 6 papers 2 days ago

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Paper • 2508.07101 • Published 6 days ago • 12

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published 4 days ago • 26

Reinforcement Learning in Vision: A Survey

Paper • 2508.08189 • Published 4 days ago • 25

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published 8 days ago • 34

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 4 days ago • 96

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published 6 days ago • 107

upvoted a paper 3 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 7 days ago • 135

upvoted 4 papers 5 days ago

Marco-Voice Technical Report

Paper • 2508.02038 • Published 11 days ago • 15

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Paper • 2508.02120 • Published 11 days ago • 16

Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Paper • 2508.03644 • Published 10 days ago • 24

Are Today's LLMs Ready to Explain Well-Being Concepts?

Paper • 2508.03990 • Published 10 days ago • 23

wongyukim

AI & ML interests

Recent Activity

Organizations

wongyukim's activity