Ge Yi

GY2233

ggeeyyi

AI & ML interests

Efficient ML & Reasoning

Recent Activity

updated a model 17 days ago

nics-efc/R2R_router_collections

updated a collection 17 days ago

R2R

published a model 17 days ago

nics-efc/R2R_router_collections

View all activity

Organizations

upvoted a paper 18 days ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published 29 days ago • 93

upvoted a paper 19 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 19 days ago • 169

upvoted a paper about 1 month ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 179

upvoted a collection 3 months ago

R2R

Collection

Collections for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing" • 5 items • Updated 17 days ago • 2

upvoted 2 papers 4 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 157

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19 • 60

upvoted 3 papers 5 months ago

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21, 2024 • 16

A Survey on Efficient Inference for Large Language Models

Paper • 2404.14294 • Published Apr 22, 2024 • 3

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 39

upvoted a collection 5 months ago

Papers from the NICS-EFFALG Team

Collection

12 items • Updated 23 days ago • 4

upvoted a paper 5 months ago

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 70

Ge Yi

AI & ML interests

Recent Activity

Organizations

GY2233's activity