view article Article Large-scale Near-deduplication Behind BigCode By chenghao β’ May 16, 2023 β’ 31
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others β’ 9 days ago β’ 23
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 172
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community By Leyo and 2 others β’ Apr 15, 2024 β’ 182
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen β’ Mar 26 β’ 143
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit β’ 28 items β’ Updated 5 days ago β’ 83
Ο_0: A Vision-Language-Action Flow Model for General Robot Control Paper β’ 2410.24164 β’ Published Oct 31, 2024 β’ 20
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 181
view article Article Visual Document Retrieval Goes Multilingual By marco and 1 other β’ Jan 10 β’ 74
view article Article Better RAG 2: Single-shot is not good enough By hrishioa β’ Mar 14, 2024 β’ 16
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 870
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper β’ 2412.07626 β’ Published Dec 10, 2024 β’ 23
Cosmos-Tokenizer Collection A suite of image and video tokenizers β’ 13 items β’ Updated 4 days ago β’ 40
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9, 2024 β’ 56