22 38 2

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

commented on a paper about 2 hours ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

commented on a paper about 21 hours ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

authored a paper 1 day ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

View all activity

Organizations

commented a paper about 2 hours ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 2 days ago • 20 •

commented a paper about 21 hours ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 2 days ago • 20 •

authored a paper 1 day ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 2 days ago • 20

upvoted a paper 1 day ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 2 days ago • 20

commented a paper 1 day ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published 2 days ago • 20 •

commented a paper 15 days ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 16 days ago • 28 •

upvoted a paper 15 days ago

FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 17 days ago • 40

authored 4 papers 15 days ago

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

Paper • 2505.21765 • Published May 27

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

Paper • 2506.10395 • Published about 1 month ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 16 days ago • 28

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 17 days ago • 40

commented a paper 16 days ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 17 days ago • 40 •

upvoted a paper 16 days ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 16 days ago • 28

commented 2 papers 16 days ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 17 days ago • 40 •

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 16 days ago • 28 •

upvoted a paper 24 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 25 days ago • 11

authored a paper 24 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 25 days ago • 11

commented a paper 25 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 25 days ago • 11 •

authored a paper 25 days ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 49

upvoted a paper 25 days ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 49

Tianyi Zhou

AI & ML interests

Recent Activity

Organizations

zhoutianyi's activity