view article Article I trained a Language Model to schedule events with GRPO! By anakin87 • Apr 29 • 79
view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies By prithivMLmods • Feb 17 • 22
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • Jan 27 • 21
⛔️🔦 Provenance, Watermarking & Deepfake Detection Collection Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1, 2024 • 43
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 21 items • Updated Feb 10 • 10
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 16 items • Updated 15 days ago • 161
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 86
view article Article Let’s make a generation of amazing image generation models By burtenshaw and 4 others • Nov 26, 2024 • 33
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw and 9 others • Oct 16, 2024 • 20
view article Article The 5 Most Under-Rated Tools on Hugging Face By derek-thomas • Aug 22, 2024 • 89
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • Jul 30, 2024 • 38
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot By plaguss and 4 others • Jul 16, 2024 • 33
view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • Jun 3, 2024 • 27
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • May 3, 2024 • 17