20 4 31

Mariusz Kurman PRO

mkurman

AI & ML interests

AI Tech Lead | MD

Recent Activity

reacted to AdinaY's post with 🔥 about 6 hours ago

Qwen team did it again!! They just released Qwen3-Coder-30B-A3B-Instruct on the hub🔥 https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct ✨ Apache 2.0 ✨30B total / 3.3B active (128 experts, 8 top-k) ✨ Native 256K context, extendable to 1M via Yarn ✨ Built for Agentic Coding

new activity 1 day ago

MegaScience/MegaScience:License

updated a collection 2 days ago

Medical QA Datasets

View all activity

Organizations

Posts 18

Post

229

🚀 Big news! NeuroBLAST, the outstanding new architecture, has officially arrived on HF! After three intense months of training my 1.9 billion SLM on my trusty RTX 3090 Ti, I’m happy to announce the results. While it’s not perfect just yet, I’ve dedicated countless hours to optimizing costs while crafting clever layer connections that mimic the brain's centers. Plus, I’ve introduced a new memory-like layer that’s sure to turn heads! I can’t wait to dive deep into this journey in my upcoming blog post. Stay tuned for the full scoop! 🔥

meditsolutions/NeuroBLAST-1.9B-Instruct-Early-Preview

Post

674

I feel like it's going to take me forever

meditsolutions/medit-one-140M-9B-tokens-checkpoint

View all Posts