AI & ML interests

Training efficient language models (MiniLLM, MiniPLM)

Recent Activity

MiniLLM's activity

t1101675 
in MiniLLM/MiniPLM-Mamba-130M about 2 months ago

Improve MiniPLM-Mamba-130M model card

#1 opened about 2 months ago by
nielsr
t1101675 
in MiniLLM/MiniPLM-Qwen-1.2B about 2 months ago

Add link to code

#1 opened about 2 months ago by
nielsr
t1101675 
in MiniLLM/Ref-Pretrain-Qwen-104M about 2 months ago

Add link to code

#1 opened about 2 months ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-1.2B about 2 months ago

Add link to code

#1 opened about 2 months ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-500M about 2 months ago

No changes needed

#1 opened about 2 months ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-200M about 2 months ago

Add link to code

#1 opened about 2 months ago by
nielsr

Add link to code

#1 opened about 2 months ago by
nielsr

No changes

#1 opened about 2 months ago by
nielsr

Add dataset card

#1 opened about 2 months ago by
nielsr