AI & ML interests

Training efficient language models (MiniLLM, MiniPLM)

Recent Activity

MiniLLM's activity

t1101675 
in MiniLLM/MiniPLM-Mamba-130M about 1 month ago

Improve MiniPLM-Mamba-130M model card

#1 opened about 1 month ago by
nielsr
t1101675 
in MiniLLM/MiniPLM-Qwen-1.2B about 1 month ago

Add link to code

#1 opened about 1 month ago by
nielsr

Add link to code

#1 opened about 1 month ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-1.2B about 1 month ago

Add link to code

#1 opened about 1 month ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-500M about 1 month ago

No changes needed

#1 opened about 1 month ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-200M about 1 month ago

Add link to code

#1 opened about 1 month ago by
nielsr

Add link to code

#1 opened about 1 month ago by
nielsr

No changes

#1 opened about 1 month ago by
nielsr

Add dataset card

#1 opened about 1 month ago by
nielsr