Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Elie Bakouch's picture
83 144 167

Elie Bakouch PRO

eliebak
yjernite's profile picture Allanatrix's profile picture Shangzhilou's profile picture
·
  • eliebakouch
  • eliebak
  • eliebak
  • eliebak.hf.co

AI & ML interests

Training LLM's @ 🤗

Recent Activity

upvoted a paper about 19 hours ago
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
new activity 2 days ago
community-spotlight/README:Nominate a community champion
liked a dataset 2 days ago
HuggingFaceFW/finepdfs
View all activity

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture Hugging Face H4's profile picture Blog-explorers's profile picture Hugging Face Smol Models Research's profile picture huggingPartyParis's profile picture Nanotron Research's profile picture MLX Community's profile picture Hugging Face SMOL's profile picture FineData's profile picture HuggingFaceFW-Dev's profile picture StarCoder2 Data's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture llmc's profile picture SLLHF's profile picture Argilla Warehouse's profile picture nltpt's profile picture smol-explorers's profile picture Open Science's profile picture Hugging Face Science's profile picture open/ acc's profile picture Open R1's profile picture smol-ablations's profile picture SmolEvalData's profile picture Scratch to Scale's profile picture

authored a paper 3 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 46
authored a paper 5 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 199
authored 2 papers 7 months ago

INTELLECT-1 Technical Report

Paper • 2412.01152 • Published Dec 2, 2024 • 3

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 242
authored a paper over 1 year ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs