view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 1 day ago โข 21
view post Post 3436 2025.1 - DeepSeek entered the scene, backed by High Flyer Quant2026.1 - IQuest enters the game, backed by Uniquant Quant ๐ and launching IQuest-Coder on huggingfacehttps://huggingface.co/collections/IQuestLab/iquest-coderโจ 40B models: Instruct / Thinking / Loopโจ Loop = MoE-level performance with only ~5% extra training costโจ Native 128K context See translation 1 reply ยท ๐ 6 6 + Reply
IQuestLab/IQuest-Coder-V1-40B-Instruct Text Generation โข 40B โข Updated 4 days ago โข 3.61k โข 234
view post Post 3267 I have update my https://huggingface.co/collections/MohamedRashad/arabic-speech-datasetswith new datasets, making the full audio data more than 3000 hours of good arabic speech.Feel Free to use it in your new innovations, And happy new year! See translation โค๏ธ 10 10 + Reply
view post Post 5525 Thank you @clem (Co-Founder & CEO of Hugging Face) for sharing my dataset on X / Twitter! ronantakizawa/github-top-developers#github #dataset See translation 4 replies ยท ๐ 11 11 โค๏ธ 3 3 ๐ 2 2 ๐ 1 1 + Reply
dh-unibe/transkribus-exports-3025-raw-xml Viewer โข Updated about 14 hours ago โข 482k โข 800 โข 1
view post Post 5346 NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! ๐ฅHas 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF๐ Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3 See translation 1 reply ยท ๐ฅ 12 12 โค๏ธ 7 7 ๐ค 4 4 ๐ 1 1 + Reply