view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others • May 24, 2023 • 152
Jamba 1.6 Collection The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated Mar 6 • 15
Portuguese LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 17 items • Updated 24 minutes ago • 33
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 867
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.26k
Tucano: Advancing Neural Text Generation for Portuguese Paper • 2411.07854 • Published Nov 12, 2024 • 6
view article Article Total noob’s intro to Hugging Face Transformers By 2legit2overfit • Mar 22, 2024 • 81
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review Paper • 2403.16303 • Published Mar 24, 2024 • 1
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated 12 days ago • 10
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31, 2024 • 539
TeenyTinyLlama Collection TeenyTinyLlama is a pair of compact language models based on the Llama 2 architecture trained on a Brazilian Portuguese corpus. • 38 items • Updated 10 days ago • 3
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12, 2024 • 149