He Yifan

jackt34

AI & ML interests

Alignment-focused model research.

Recent Activity

upvoted a paper 5 days ago

In-Context World Modeling for Robotic Control

liked a model 11 days ago

upvoted a paper 12 days ago

Looped World Models

View all activity

Organizations

None yet

upvoted a paper 5 days ago

In-Context World Modeling for Robotic Control

Paper • 2606.26025 • Published 7 days ago • 61

upvoted a paper 12 days ago

Looped World Models

Paper • 2606.18208 • Published 16 days ago • 473

upvoted a paper 28 days ago

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published May 28 • 250

upvoted 2 papers 30 days ago

HL-OutPaint: Coarse-to-Fine Video Outpainting for High-Resolution Long-Range Videos

Paper • 2605.17543 • Published May 19 • 14

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

upvoted 6 papers about 1 month ago

Unified Panoramic Geometry Estimation via Multi-View Foundation Models

Paper • 2605.26368 • Published May 25 • 4

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Paper • 2605.17757 • Published May 18 • 66

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 147

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 116

upvoted 2 papers about 2 months ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 106

AcademiClaw: When Students Set Challenges for AI Agents

Paper • 2605.02661 • Published May 4 • 17

upvoted 2 papers 3 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 639

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 329