Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Paper • 2501.05122 • Published 9 days ago • 18
Centurio Collection Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" • 5 items • Updated 8 days ago • 4
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides Paper • 2501.03936 • Published 11 days ago • 18
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods • 13 days ago • 16
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper • 2412.21140 • Published 19 days ago • 15
Sentence Encoders Collection Collection of models and dataset for sentence encoder task • 4 items • Updated Nov 25, 2024 • 7
rusBeIR-datasets Collection Collection of datasets used in rusBeIR • 37 items • Updated 21 days ago • 4
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 128
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 128
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 8 items • Updated 6 days ago • 22
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated 26 days ago • 7
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 12 days ago • 33
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 31