view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic and 4 others β’ Jan 23 β’ 30
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 19
view article Article Open Preference Dataset for Text-to-Image Generation by the π€ Community By davidberenstein1957 and 6 others β’ Dec 9, 2024 β’ 56
view article Article Letβs make a generation of amazing image generation models By burtenshaw and 4 others β’ Nov 26, 2024 β’ 33
view article Article Share your open ML datasets on Hugging Face Hub! By davanstrien and 3 others β’ Nov 12, 2024 β’ 28
view article Article Scaling AI-based Data Processing with Hugging Face + Dask By scj13 and 3 others β’ Oct 9, 2024 β’ 30
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien β’ Jun 20, 2024 β’ 12
view article Article Data Is Better Together: A Look Back and Forward By sdiazlor and 2 others β’ Jun 20, 2024 β’ 20
view article Article Synthetic dataset generation techniques: generating custom sentence similarity data By davanstrien β’ May 23, 2024 β’ 16
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien β’ May 15, 2024 β’ 14
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien β’ May 7, 2024 β’ 8
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models By loubnabnl and 2 others β’ Mar 20, 2024 β’ 82
view article Article Extracting Insights from Model Cards Using Open Large Language Models By davanstrien β’ Nov 27, 2023
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model By VictorSanh and 10 others β’ Aug 22, 2023 β’ 31
view article Article Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub By davanstrien β’ Aug 2, 2023 β’ 1
view article Article The Hugging Face Hub for Galleries, Libraries, Archives and Museums By davanstrien β’ Jun 12, 2023 β’ 1
view article Article Introducing BERTopic Integration with Hugging Face Hub By davanstrien and 1 other β’ May 31, 2023 β’ 8