view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 • 1.16k
view article Article Superposition in Transformers: A Novel Way of Building Mixture of Experts Jan 4 • 13
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Paper • 2410.01036 • Published Oct 1, 2024 • 15
view article Article Llama can now see and run on your device - welcome Llama 3.2 +5 Sep 25, 2024 • 191
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated 6 days ago • 242
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 272
view article Article Memory-efficient Diffusion Transformers with Quanto and Diffusers Jul 30, 2024 • 68
TextGrad: Automatic "Differentiation" via Text Paper • 2406.07496 • Published Jun 11, 2024 • 31