Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases ā¢ 5 items ā¢ Updated Dec 6, 2024 ā¢ 708
Idefics2 š¶ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. ā¢ 11 items ā¢ Updated May 6, 2024 ā¢ 91
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper ā¢ 2403.09611 ā¢ Published Mar 14, 2024 ā¢ 126
StarCoder 2 and The Stack v2: The Next Generation Paper ā¢ 2402.19173 ā¢ Published Feb 29, 2024 ā¢ 137
Masked Audio Generation using a Single Non-Autoregressive Transformer Paper ā¢ 2401.04577 ā¢ Published Jan 9, 2024 ā¢ 43
MAGNeT Collection Masked Audio Generation using a Single Non-Autoregressive Transformer ā¢ 9 items ā¢ Updated Apr 4, 2024 ā¢ 40
QuIP: 2-Bit Quantization of Large Language Models With Guarantees Paper ā¢ 2307.13304 ā¢ Published Jul 25, 2023 ā¢ 2
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models Paper ā¢ 2312.09767 ā¢ Published Dec 15, 2023 ā¢ 25
Improving Text Embeddings with Large Language Models Paper ā¢ 2401.00368 ā¢ Published Dec 31, 2023 ā¢ 80
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation Paper ā¢ 2312.02145 ā¢ Published Dec 4, 2023 ā¢ 5
Notus 7B v1 Collection Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus ā¢ 11 items ā¢ Updated Dec 11, 2024 ā¢ 18
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community ā¢ 17 items ā¢ Updated Jun 6, 2024 ā¢ 233
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper ā¢ 2312.00752 ā¢ Published Dec 1, 2023 ā¢ 140
Positional Description Matters for Transformers Arithmetic Paper ā¢ 2311.14737 ā¢ Published Nov 22, 2023 ā¢ 2
Thinking Fast and Slow in Large Language Models Paper ā¢ 2212.05206 ā¢ Published Dec 10, 2022 ā¢ 1
Memory Augmented Language Models through Mixture of Word Experts Paper ā¢ 2311.10768 ā¢ Published Nov 15, 2023 ā¢ 17