lee dong ryeol's picture

lee dong ryeol

drlee1

·

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

google/embeddinggemma-300m

liked a model about 1 month ago

Qwen/Qwen3-Next-80B-A3B-Instruct

upvoted a paper about 1 month ago

Memp: Exploring Agent Procedural Memory

View all activity

Organizations

None yet

upvoted 4 papers about 1 month ago

Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published Aug 8 • 34

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30 • 68

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 112

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 213

upvoted an article 2 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

By

•

Aug 9

• 38

upvoted 4 papers 2 months ago

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 236

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 302

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 257

upvoted a paper 4 months ago

Large Language Models for Data Synthesis

Paper • 2505.14752 • Published May 20 • 49

upvoted 3 papers 5 months ago

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 36

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published May 1 • 26

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88

upvoted 5 papers 6 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21 • 20

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17 • 60

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 29

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 75

upvoted an article 7 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 465

upvoted a paper 8 months ago

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published Dec 16, 2024 • 36