Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
euclaise
's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements
Small-ish SoTA (<5B), (quasi-)base
updated
Aug 10, 2024
Upvote
1
nvidia/Minitron-4B-Base
Text Generation
•
Updated
29 days ago
•
458
•
133
h2oai/h2o-danube3-4b-base
Text Generation
•
Updated
Jul 15, 2024
•
390
•
21
stabilityai/stablelm-3b-4e1t
Text Generation
•
Updated
Mar 7, 2024
•
17.5k
•
•
310
Qwen/Qwen2-1.5B
Text Generation
•
Updated
Jun 6, 2024
•
94.7k
•
•
89
internlm/internlm2_5-1_8b-chat
Text Generation
•
Updated
2 days ago
•
4.69k
•
25
Qwen/Qwen1.5-4B
Text Generation
•
Updated
Apr 5, 2024
•
5.54k
•
36
tensoropera/Fox-1-1.6B
Text Generation
•
Updated
Nov 21, 2024
•
446
•
31
TRI-ML/DCLM-1B
Updated
Jul 25, 2024
•
67
•
13
Upvote
1
Share collection
View history
Collection guide
Browse collections