Le Yu's picture

Le Yu

vanillaOVO

·

https://yule-buaa.github.io/

yule-BUAA

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window

upvoted a paper 3 months ago

Agentic Reinforced Policy Optimization

upvoted a paper 3 months ago

Group Sequence Policy Optimization

View all activity

Organizations

None yet

authored a paper 3 months ago

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

Paper • 2507.15024 • Published Jul 20 • 14

authored 9 papers 5 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 305

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 184

Towards Better Dynamic Graph Learning: New Architecture and Unified Library

Paper • 2303.13047 • Published Mar 23, 2023

Heterogeneous Graph Representation Learning with Relation Awareness

Paper • 2105.11122 • Published May 24, 2021

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 71

Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement

Paper • 2408.03092 • Published Aug 6, 2024 • 1

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34