21 60 51

Linzheng Chai

Challenging666

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

upvoted a paper 1 day ago

A Survey on Latent Reasoning

upvoted a paper 1 day ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

View all activity

Organizations

upvoted 3 papers 1 day ago

upvoted a collection 15 days ago

Multilingual-Multimodal-Code

Collection

4 items • Updated 15 days ago • 1

upvoted a paper 16 days ago

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published 23 days ago • 36

upvoted a paper 22 days ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 25 days ago • 61

upvoted a paper 23 days ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published 29 days ago • 32

upvoted a paper about 2 months ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 27

upvoted a paper 3 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

upvoted 2 papers 4 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis

Paper • 2503.13265 • Published Mar 17 • 15

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 181

upvoted 3 papers 4 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 67

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23 • 37

upvoted a paper 5 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 105

upvoted a paper 6 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 119

upvoted 3 papers 7 months ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 74

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

Linzheng Chai

AI & ML interests

Recent Activity

Organizations

Challenging666's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge