Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MiniLLM
community
https://github.com/microsoft/LMOps/tree/main/minillm
t1101675
Activity Feed
Follow
25
AI & ML interests
Training efficient language models (MiniLLM, MiniPLM)
Recent Activity
t1101675
updated
a model
16 days ago
MiniLLM/MiniLLM-gpt2-340M
t1101675
new
activity
about 1 month ago
MiniLLM/MiniLLM-gpt2-340M:
Adding `safetensors` variant of this model
t1101675
new
activity
about 1 month ago
MiniLLM/SFT-gpt2-120M:
Adding `safetensors` variant of this model
View all activity
Team members
1
MiniLLM
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
t1101675
updated
a model
16 days ago
MiniLLM/MiniLLM-gpt2-340M
Text Generation
•
Updated
16 days ago
•
52
•
2
t1101675
in
MiniLLM/MiniLLM-gpt2-340M
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
t1101675
in
MiniLLM/SFT-gpt2-120M
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
t1101675
in
MiniLLM/SFT-gpt2-760M
about 1 month ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
t1101675
in
MiniLLM/MiniPLM-Qwen-500M
about 1 month ago
Improve model card: add paper abstract and link to paper
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/MiniPLM-llama3.1-212M
about 1 month ago
Add library name and link to code repository
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/MiniPLM-Mamba-130M
about 1 month ago
Improve MiniPLM-Mamba-130M model card
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/MiniPLM-Qwen-1.2B
about 1 month ago
Add link to code
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/Ref-Pretrain-Qwen-104M
about 1 month ago
Add link to code
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/Pretrain-Qwen-1.2B
about 1 month ago
Add link to code
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/Pretrain-Qwen-500M
about 1 month ago
No changes needed
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/Pretrain-Qwen-200M
about 1 month ago
Add link to code
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/VanillaKD-Pretrain-Qwen-200M
about 1 month ago
Add link to code and base model tag
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/VanillaKD-Pretrain-Qwen-500M
about 1 month ago
Add link to code
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/VanillaKD-Pretrain-Qwen-1.2B
about 1 month ago
No changes
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/pile-diff_samp-qwen_1.8B-qwen_104M-r0.5
about 1 month ago
Add dataset card
#1 opened about 1 month ago by
nielsr
t1101675
in
MiniLLM/SFT-OPT-1.3B
4 months ago
Difference between SFT and init models
2
#1 opened 4 months ago by
HyeongSoo
t1101675
authored
a paper
5 months ago
NVILA: Efficient Frontier Visual Language Models
Paper
•
2412.04468
•
Published
Dec 5, 2024
•
60
t1101675
updated
a dataset
5 months ago
MiniLLM/pile-tokenized
Updated
Nov 14, 2024
•
42
•
1
t1101675
in
MiniLLM/init-gpt2-120M
5 months ago
Adding `safetensors` variant of this model
#1 opened 5 months ago by
SFconvertbot
Load more