Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
31
3
Stas Bekman
stas
Follow
Imtiaz's profile picture
seshubon's profile picture
FengJ's profile picture
116 followers
·
4 following
https://stasosphere.com/machine-learning/
StasBekman
stas00
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
updated
a model
3 days ago
stas/ml-engineering-book
updated
a model
19 days ago
stas/ml-engineering-book
posted
an
update
7 months ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes? Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58 ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
View all activity
Organizations
stas
's models
9
Sort: Recently updated
stas/ml-engineering-book
Updated
3 days ago
•
19
stas/tiny-random-llama-2
Text Generation
•
0.0B
•
Updated
Nov 14, 2023
•
4.7k
•
41
stas/tiny-m2m_100
Updated
Apr 29, 2022
•
2.51k
stas/tr8b-104B-debug3
Updated
Nov 29, 2021
stas/pegasus-cnn_dailymail-tiny-random
Updated
Jul 1, 2021
•
352
stas/mt5-tiny-random
Updated
Jun 23, 2021
•
63.9k
•
2
stas/tiny-wmt19-en-de
Updated
May 3, 2021
•
355
•
1
stas/tiny-wmt19-en-ru
Updated
May 3, 2021
•
2.12k
stas/t5-very-small-random
Updated
Apr 21, 2021
•
8
•
1