Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325
Mistral 7B + UltraChat + Arithmo checkpoints Collection A collection of Mistral 7B fine-tunes on UltraChat and Arithmo to boost the math capabilities of chat models. See https://x.com/_lewtun/status/1715652 • 5 items • Updated Oct 22, 2023 • 2
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 505
Custom Components ✨ Collection Awesome gradio custom components to get you started build your own! • 7 items • Updated Nov 20, 2023 • 35
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Paper • 2101.00027 • Published Dec 31, 2020 • 6
Does Putting a Linguist in the Loop Improve NLU Data Collection? Paper • 2104.07179 • Published Apr 15, 2021 • 1
XNLI: Evaluating Cross-lingual Sentence Representations Paper • 1809.05053 • Published Sep 13, 2018 • 1
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Paper • 1704.05426 • Published Apr 18, 2017 • 1
HuggingFace's Transformers: State-of-the-art Natural Language Processing Paper • 1910.03771 • Published Oct 9, 2019 • 16
Datasets: A Community Library for Natural Language Processing Paper • 2109.02846 • Published Sep 7, 2021 • 10
ALBERT release Collection The ALBERT release was done in two steps, over 4 checkpoints of different sizes each time. The first version is noted as "v1", the second as "v2". • 8 items • Updated Jul 31 • 5