Enneng Yang

EnnengYang

AI & ML interests

Machine learning, Recommendation system

Recent Activity

upvoted a paper 19 days ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

upvoted a paper 24 days ago

RL's Razor: Why Online Reinforcement Learning Forgets Less

upvoted a paper about 2 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

View all activity

Organizations

upvoted a paper 19 days ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published 27 days ago • 63

upvoted a paper 24 days ago

RL's Razor: Why Online Reinforcement Learning Forgets Less

Paper • 2509.04259 • Published Sep 4 • 6

upvoted a paper about 2 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 59

upvoted 3 papers 3 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 131

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10 • 33

upvoted a collection 3 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.31k

upvoted a paper 3 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

upvoted a paper 5 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 296

upvoted 2 papers 6 months ago

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Paper • 2504.08120 • Published Apr 10 • 3

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 9

upvoted 4 papers 7 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 92

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 32

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69

upvoted 5 papers 8 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published Feb 19 • 29

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 205

Atom of Thoughts for Markov LLM Test-Time Scaling

Paper • 2502.12018 • Published Feb 17 • 17

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17 • 16

Enneng Yang

AI & ML interests

Recent Activity

Organizations

EnnengYang's activity