37 37 247

Kaizhao Liang PRO

kz919

https://kyleliang919.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

rStar2-Agent: Agentic Reasoning Technical Report

updated a model 10 days ago

kz919/simpletuner-lora

updated a model 10 days ago

kz919/simpletuner-lora

View all activity

Organizations

upvoted a paper 9 days ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published 11 days ago • 98

upvoted a changelog 28 days ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30

• 175

upvoted an article about 1 month ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 69

upvoted a paper about 1 month ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

upvoted an article about 2 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 47

upvoted a paper 2 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41

upvoted a paper 4 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 126

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.29k

upvoted 3 papers 7 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 51

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30

upvoted 2 articles 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 879

Article

Welcome to Inference Providers on the Hub 🔥

and 6 others •

Jan 28

• 488

upvoted a paper 8 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11

upvoted 2 papers 9 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 81

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 25

upvoted a paper 10 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 21

upvoted 2 papers about 1 year ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 98

Memory-Efficient LLM Training with Online Subspace Descent

Paper • 2408.12857 • Published Aug 23, 2024 • 15

upvoted an article about 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

and 2 others •

Apr 15, 2024

• 186

Kaizhao Liang PRO

AI & ML interests

Recent Activity

Organizations

kz919's activity

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

A failed experiment: Infini-Attention, and why we should keep trying?

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

Welcome to Inference Providers on the Hub 🔥

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community