1 4 10

Yaoqi Chen

yoki123

https://komorebi660.github.io

Komorebi660

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training

upvoted a paper 6 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-wiki-bsz16-lr1e-06

View all activity

Organizations

upvoted a paper 13 days ago

Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training

Paper • 2510.08008 • Published 14 days ago • 5

upvoted a paper 6 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-wiki-bsz16-lr1e-06

3B • Updated Apr 26

published a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-wiki-bsz16-lr1e-06

3B • Updated Apr 26

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-email-bsz16-lr1e-06

3B • Updated Apr 26

published a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-email-bsz16-lr1e-06

3B • Updated Apr 26

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-code-bsz16-lr1e-06

3B • Updated Apr 25

published a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-code-bsz16-lr1e-06

3B • Updated Apr 25

updated a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-email-bsz16-lr1e-06

3B • Updated Apr 25

published a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-email-bsz16-lr1e-06

3B • Updated Apr 25

updated a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-math-bsz16-lr1e-06

3B • Updated Apr 25

published a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-math-bsz16-lr1e-06

3B • Updated Apr 25

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-math-bsz16-lr1e-06

3B • Updated Apr 25 • 1

published a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-math-bsz16-lr1e-06

3B • Updated Apr 25 • 1

updated a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-code-bsz16-lr1e-06

3B • Updated Apr 25 • 1

updated a dataset 6 months ago

yoki123/ResultStorage

Updated Apr 25 • 12

published a model 6 months ago

yoki123/EleutherAI-pythia-2.8b-deduped-full-ft-code-bsz16-lr1e-06

3B • Updated Apr 25 • 1

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-clinical-bsz16-lr1e-06

3B • Updated Apr 25

updated a dataset 6 months ago

yoki123/ResultStorage

Updated Apr 25 • 12

updated a model 6 months ago

yoki123/Qwen-Qwen2.5-3B-full-ft-clinical-bsz16-lr1e-06

3B • Updated Apr 25

Yaoqi Chen

AI & ML interests

Recent Activity

Organizations

yoki123's activity