Zikun Li

zikun-li

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

START: Self-taught Reasoner with Tools

upvoted a paper 2 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

upvoted a paper 3 days ago

Process-based Self-Rewarding Language Models

View all activity

Organizations

None yet

zikun-li's activity

upvoted a paper 1 day ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 4 days ago • 78

upvoted a paper 2 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 4 days ago • 66

upvoted a paper 3 days ago

Process-based Self-Rewarding Language Models

Paper • 2503.03746 • Published 5 days ago • 33

upvoted 4 papers 5 days ago

From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens

Paper • 2502.18890 • Published 12 days ago • 23

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 7 days ago • 65

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 7 days ago • 60

Multi-Turn Code Generation Through Single-Step Rewards

Paper • 2502.20380 • Published 11 days ago • 29

upvoted a paper 7 days ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 12 days ago • 44

liked a Space 14 days ago

2.15k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 14 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published 20 days ago • 14

LightThinker: Thinking Step-by-Step Compression

Paper • 2502.15589 • Published 17 days ago • 26

upvoted a paper 25 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 28 days ago • 142

upvoted 8 papers about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 84

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 341

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 49