Remnant Collection Remnant is a series of finetuned LLMs focused on SFW and NSFW roleplaying and conversation. • 3 items • Updated 5 days ago • 3
Gemstone Models Collection Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 59 items • Updated Feb 26 • 8
NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 14
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 178
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 37
L3-8B-Helium3 Collection The culmination of my first LLM project. Hybrid storytelling and RP model, with a focus on niche fetish content. (This will be a recurring theme.) • 3 items • Updated Sep 16, 2024 • 1
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 4 days ago • 27
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published May 16, 2024 • 131
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 7 days ago • 191
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 9 days ago • 566
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 757