Doge family of small language models!

Doge Face
community
AI & ML interests
A Family of Dynamic UltraFast Small Language Models Ready for Embodied Artificial General Intelligence!
Recent Activity
View all activity
Organization Card

SmallDoge
Welcome to SmallDoge, where we pioneer the development of compact, high-performance small language models. Our focus is on creating ultra-fast SLMs using innovative dynamic algorithms. Committed to transparency and collaboration, all our training details and code are openly accessible on the SmallDoge GitHub repository.
Our Mission: To democratize access to advanced AI by developing efficient, open-source small language models that empower a wide range of applications and research.
Join our community on Discord!
Explore Our Projects
We offer a suite of resources and models:
- Small-Doges: A versatile series of SLMs, including pre-trained base models, supervised fine-tuned models, and models enhanced with reinforcement learning.
- Doge-CheckPoints: A collection of model checkpoints designed for seamless continued training on new datasets, ensuring smoother adaptation and minimizing training instability.
- Small-Datasets: Curated, multi-stage, high-quality datasets specifically engineered to effectively train small language models, boosting their capabilities and helpfulness.
- Doge-Downstream-Applications: A selection of SLMs optimized for various downstream tasks and real-world applications.
Collections
4
models
81

SmallDoge/Qwen2.5-14b-llmlingua-50
Text Generation
•
Updated
•
8

SmallDoge/Qwen2.5-14b-budget2048
Text Generation
•
Updated
•
4

SmallDoge/Qwen2.5-math-7b-budget2048
Text Generation
•
Updated
•
5

SmallDoge/Llama3.1-8b-110k
Text Generation
•
Updated
•
4

SmallDoge/Qwen2.5-math-14b-llmlingua-90
Text Generation
•
Updated
•
8

SmallDoge/Qwen2.5-math-7b-llmlingua-90
Text Generation
•
Updated
•
6

SmallDoge/Qwen2.5-math-7b-llmlingua-50
Text Generation
•
Updated
•
4

SmallDoge/Doge2-175M-checkpoint
Text Generation
•
Updated
•
7

SmallDoge/Doge2-tokenizer
Updated
•
1

SmallDoge/Qwen2.5-math-7b-chain-of-draft25k
Text Generation
•
Updated
•
11
datasets
33
SmallDoge/smallcorpus
Viewer
•
Updated
•
259M
•
759
•
3
SmallDoge/Math_Benchmark_Difficulty
Viewer
•
Updated
•
1.07k
•
39
SmallDoge/SmallThoughts
Viewer
•
Updated
•
102k
•
389
•
45
SmallDoge/Doge2-tokenizer-samples
Viewer
•
Updated
•
2M
•
107
SmallDoge/DMA-Pretrain
Viewer
•
Updated
•
17M
•
283
SmallDoge/CoD-25K
Viewer
•
Updated
•
25k
•
89
SmallDoge/SmallTalks
Viewer
•
Updated
•
4.48M
•
2.91k
•
9
SmallDoge/MiniCorpus
Viewer
•
Updated
•
3.4M
•
162
SmallDoge/OpenThoughts-920K
Viewer
•
Updated
•
927k
•
90
•
1
SmallDoge/OpenR1-Math-DPO
Viewer
•
Updated
•
88.5k
•
19