Yu li

Yukkkop

427 596

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago

Zyphra/ZUNA1.1

liked a model 3 days ago

tencent/Rosetta-inference

upvoted a paper 3 days ago

ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

View all activity

Organizations

None yet

upvoted 4 papers 3 days ago

upvoted 2 papers 5 days ago

Generative World Renderer at the Speed of Play

Paper • 2607.18703 • Published 13 days ago • 81

Beyond Euclidean Clipping: Overcoming Exploration Collapse in LLM RL via Riemannian Isometric Policy Optimization

Paper • 2607.10169 • Published 23 days ago • 14

upvoted an article 5 days ago

Article

LFM2.5-Encoders for Fast Long-Context Inference on CPU

LiquidAI

•

6 days ago

• 60

upvoted a paper 10 days ago

KAT-Coder-V2.5 Technical Report

Paper • 2607.05471 • Published 28 days ago • 12

upvoted 5 papers 14 days ago

What LLM Forecasters Know but Don't Say: Probing Internal Representations for Calibration and Faithfulness

Paper • 2607.08046 • Published 25 days ago • 14

Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models

Paper • 2605.26895 • Published May 26 • 23

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published May 25 • 139

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published May 13 • 61

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 36

upvoted 7 papers 20 days ago

Morphing into Hybrid Attention Models

Paper • 2606.30562 • Published Jun 29 • 52

Efficient Reasoning via Thought-Training and Thought-Free Inference

Paper • 2511.03408 • Published Nov 5, 2025 • 1

Why Fine-Tuning Encourages Hallucinations and How to Fix It

Paper • 2604.15574 • Published Apr 16 • 26

ESPO: Early-Stopping Proximal Policy Optimization

Paper • 2605.29860 • Published May 28 • 21

Unsupervised Process Reward Models

Paper • 2605.10158 • Published May 11 • 28

Trust Region Policy Distillation

Paper • 2607.04751 • Published 28 days ago • 35

Jet-Long: Efficient Long-Context Extension with Dynamic Bifocal RoPE

Paper • 2607.07740 • Published 26 days ago • 24

Yu li

AI & ML interests

Recent Activity

Organizations

Yukkkop's activity

LFM2.5-Encoders for Fast Long-Context Inference on CPU