1 18 8

Dai

Yinpei

https://yinpeidai.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

upvoted a paper 16 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

published a model about 1 month ago

Yinpei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

View all activity

Organizations

Yinpei's activity

upvoted a paper 9 days ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published 11 days ago • 47

upvoted a paper 16 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 19 days ago • 158

upvoted a collection about 2 months ago

Qwen3

Collection

72 items • Updated 6 days ago • 788

upvoted 3 articles 3 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 165

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

•

Jul 9, 2024

• 55

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 268

upvoted a paper 4 months ago

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16 • 60

upvoted an article 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.26k

upvoted a collection 7 months ago

Cosmos-Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated 4 days ago • 40

upvoted 4 papers 8 months ago

Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use

Paper • 2410.24218 • Published Oct 31, 2024 • 6

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 39

The Imperative of Conversation Analysis in the Era of LLMs: A Survey of Tasks, Techniques, and Trends

Paper • 2409.14195 • Published Sep 21, 2024 • 13

upvoted 2 papers 9 months ago

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

Paper • 2111.14592 • Published Nov 29, 2021 • 1

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23, 2024 • 44

upvoted 2 papers almost 2 years ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 171

Segment Anything Meets Point Tracking

Paper • 2307.01197 • Published Jul 3, 2023 • 35

Dai

AI & ML interests

Recent Activity

Organizations

Yinpei's activity

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

SmolVLM2: Bringing Video Understanding to Every Device

Open-source DeepResearch – Freeing our search agents

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡