view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 β’ 190
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper β’ 2402.19427 β’ Published Feb 29, 2024 β’ 56
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated 12 days ago β’ 145
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published May 16, 2024 β’ 131
What matters when building vision-language models? Paper β’ 2405.02246 β’ Published May 3, 2024 β’ 103
Zephyr ORPO Collection Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook β’ 3 items β’ Updated Apr 12, 2024 β’ 17
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30, 2024 β’ 36
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated 4 days ago β’ 562
DistilBERT release Collection Original DistilBERT model, checkpoints obtained from using teacher-student learning from the original BERT checkpoints. β’ 6 items β’ Updated Apr 17, 2024 β’ 19
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 725
πΆ IDEFICS πΆ Collection Collection assembling all the models and spaces related to IDEFICS β’ 6 items β’ Updated Apr 15, 2024 β’ 7
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15, 2024 β’ 175
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6, 2024 β’ 91
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training Paper β’ 2403.09611 β’ Published Mar 14, 2024 β’ 127
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs Paper β’ 2403.12596 β’ Published Mar 19, 2024 β’ 10