Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Stas Bekman's picture
31 3

Stas Bekman

stas
wuya2023's profile picture severo's profile picture binga's profile picture
·
https://stasosphere.com/machine-learning/
  • StasBekman
  • stas00

AI & ML interests

Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability

Organizations

BigScience Workshop's profile picture Social Post Explorers's profile picture

authored a paper 12 months ago

Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training

Paper • 2406.18820 • Published Jun 27, 2024
authored 2 papers over 1 year ago

The Case for Co-Designing Model Architectures with Hardware

Paper • 2401.14489 • Published Jan 25, 2024 • 3

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 14
authored a paper almost 2 years ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 46
authored 2 papers about 2 years ago

What Language Model to Train if You Have One Million GPU Hours?

Paper • 2210.15424 • Published Oct 27, 2022 • 2

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 32
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs