Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning Lead | College Professor | GenAI | MLOps
Recent Activity
updated
a collection
2 days ago
Tiny Think
updated
a collection
2 days ago
Tiny Think DPO Checkpoints
updated
a collection
2 days ago
Tiny Think DPO Checkpoints
Organizations
Tiny Think DPO Checkpoints
Collection dedicated to all the DPO checkpoints from the Tiny Think Experiments
-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Tiny Language Model Datasets
Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model
Stable Diffusion XL Neuron Models
Collection of Stable Diffusion XL Models that can run on AWS Silicon Chips (specifically AWS Inferentia 2)
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
Tiny Think SFT Checkpoints
Collection dedicated to all the SFT checkpoints from the Tiny Think Experiments
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 193 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
SynthGenAI Datasets
Collection of Synthetic Datasets created by using SynthGenAI
Medical Instruct Models
Collection of all the medical instruct fine-tuned LLMs with 7B parameters
Tiny Think
Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
Tiny Think DPO Checkpoints
Collection dedicated to all the DPO checkpoints from the Tiny Think Experiments
-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Tiny Think SFT Checkpoints
Collection dedicated to all the SFT checkpoints from the Tiny Think Experiments
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 193 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
Tiny Language Model Datasets
Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model
SynthGenAI Datasets
Collection of Synthetic Datasets created by using SynthGenAI
Stable Diffusion XL Neuron Models
Collection of Stable Diffusion XL Models that can run on AWS Silicon Chips (specifically AWS Inferentia 2)
Medical Instruct Models
Collection of all the medical instruct fine-tuned LLMs with 7B parameters