LoneStriker/Meta-Llama-3.1-70B-Instruct-6.0bpw-h6-exl2 Text Generation • Updated Jul 24, 2024 • 20 • 6
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated Jan 17 • 27