Yuxian Gu

t1101675

AI & ML interests

Efficient methods for language models

Recent Activity

upvoted a paper about 23 hours ago
Rectified Sparse Attention
updated a dataset 9 days ago
Efficient-Large-Model/RULER-data
published a dataset 9 days ago
Efficient-Large-Model/RULER-data
View all activity

Organizations

Conversational AI (CoAI) group from Tsinghua University's profile picture Efficient-Large-Model's profile picture MiniLLM's profile picture Data Selection's profile picture VILA / Molmo's profile picture

t1101675's activity

New activity in MiniLLM/SFT-gpt2-120M 2 months ago
New activity in MiniLLM/SFT-gpt2-760M 2 months ago
New activity in Data-Selection/PDS-470M 2 months ago
New activity in Data-Selection/PDS-160M 2 months ago
New activity in Data-Selection/PDS-1B 2 months ago

Add link to code repository

#2 opened 2 months ago by
nielsr
New activity in Data-Selection/PDS-1.7B 2 months ago
New activity in Data-Selection/BSL-1.7B 2 months ago

Add link to code

#2 opened 2 months ago by
nielsr
New activity in MiniLLM/MiniPLM-Mamba-130M 2 months ago
New activity in MiniLLM/MiniPLM-Qwen-1.2B 2 months ago

Add link to code

#1 opened 2 months ago by
nielsr
New activity in MiniLLM/Ref-Pretrain-Qwen-104M 2 months ago

Add link to code

#1 opened 2 months ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-1.2B 2 months ago

Add link to code

#1 opened 2 months ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-500M 2 months ago

No changes needed

#1 opened 2 months ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-200M 2 months ago

Add link to code

#1 opened 2 months ago by
nielsr
New activity in MiniLLM/VanillaKD-Pretrain-Qwen-200M 2 months ago
New activity in MiniLLM/VanillaKD-Pretrain-Qwen-500M 2 months ago

Add link to code

#1 opened 2 months ago by
nielsr