view article Article Should We Still Pretrain Encoders with Masked Language Modeling? By Nicolas-BZRD and 3 others β’ 3 days ago β’ 9
view post Post 2446 so many multimodal releases these days π€ > ERNIE-4.5-VL: new vision language MoE models by Baidu https://huggingface.co/models?search=ernie-4.5-vl> new visual document retrievers by NVIDIA (sota on ViDoRe!) nvidia/llama-nemoretriever-colembed-3b-v1 nvidia/llama-nemoretriever-colembed-1b-v1> Ovis-3b: new image-text in image-text out models by Alibaba β€΅οΈ https://huggingface.co/spaces/AIDC-AI/Ovis-U1- See translation π 6 6 + Reply
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text β’ 24B β’ Updated May 9 β’ 198k β’ β’ 1.29k
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 284
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others β’ 24 days ago β’ 105
view article Article Microsoft and Hugging Face expand collaboration By jeffboudier and 2 others β’ May 19 β’ 22
Granite Time Series Models Collection A collection of time series models trained by IBM licensed under Apache 2.0 license. β’ 7 items β’ Updated 19 days ago β’ 28
view article Article The New and Fresh analytics in Inference Endpoints By erikkaum and 4 others β’ Mar 21 β’ 21
view article Article Blazingly fast whisper transcriptions with Inference Endpoints By mfuntowicz and 5 others β’ May 13 β’ 70
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ 71B β’ Updated Feb 24 β’ 259k β’ β’ 702