🚀 Optimum: The Last v1 Release 🚀
Optimum v1.27 marks the final major release in the v1 series. As we close this chapter, we're laying the groundwork for a more modular and community-driven future:
- Optimum v2: A lightweight core package for porting Transformers, Diffusers, or Sentence-Transformers to specialized AI hardware/software/accelerators..
- Optimum‑ONNX: A dedicated package where the ONNX/ONNX Runtime ecosystem lives and evolves, faster-moving and decoupled from the Optimum core.

🎯 Why this matters:
- A clearer governance path for ONNX, fostering stronger community collaboration and improved developer experience..
- Enable innovation at a faster pace in a more modular, open-source environment.

💡 What this means:
- More transparency, broader participation, and faster development driven by the community and key actors in the ONNX ecosystem (PyTorch, Microsoft, Joshua Lochner 👀, ...)
- A cleaner, more maintainable core Optimum, focused on extending HF libraries to special AI hardware/software/accelerators tooling and used by our partners (Intel Corporation, Amazon Web Services (AWS), AMD, NVIDIA, FuriosaAI, ...)

🛠️ Major updates I worked on in this release:
✅ Added support for Transformers v4.53 and SmolLM3 in ONNX/ONNXRuntime.
✅ Solved batched inference/generation for all supported decoder model architectures (LLMs).

✨ Big shoutout to @echarlaix for leading the refactoring work that cleanly separated ONNX exporter logic and enabled the creation of Optimum‑ONNX.

📝 Release Notes: https://lnkd.in/gXtE_qji
📦 Optimum : https://lnkd.in/ecAezNT6
🎁 Optimum-ONNX: https://lnkd.in/gzjyAjSi
#Optimum #ONNX #OpenSource #HuggingFace #Transformers #Diffusers

updated a collection 1 day ago

Potential models (story, rp, technical)

Collection

Some models that look very promising to me. Primarily i want models that do well with writing and roleplay (like ERP) but also delve into technical. • 15 items • Updated 25 minutes ago

New activity in ReadyArt/MS3.2-The-Omega-Directive-24B-Unslop-v2.1 1 day ago

Nice!

❤️ 1

#1 opened 1 day ago by

yano2mch

liked a model 1 day ago

ReadyArt/MS3.2-The-Omega-Directive-24B-Unslop-v2.1

Text Generation • 24B • Updated 5 days ago • 68 • 7

replied to mitkox's post 1 day ago

RTX A6000, lists 48Gb video memory...

Soooo jealous....

dont be its ampere .. i have 2 of them .. i much rather have 2 6000 pro nowadays .. much much faster

Jealouss.........

replied to mitkox's post 2 days ago

RTX A6000, lists 48Gb video memory...

Soooo jealous....

New activity in jukofyork/command-r-35b-writer 3 days ago

Working?

#1 opened 17 days ago by

yano2mch

New activity in DavidAU/L3-Darker-Planet-12.15B-GGUF 4 days ago

If only every model's description page was as detailed as yours.

👍 2

#1 opened 10 months ago by

BigBeavis

replied to Blazgo's post 6 days ago

671B to 2.7T?!? I can barely run 235B models! (and that's with Q3 or Q2).

"Ultra" is the keywords here. We are going to release Pro and Mini models.
The downside is it isn't very well supported by many apps

Gotcha. I'll keep fingers crossed then

replied to Blazgo's post 6 days ago

Yes! It has gone from 671B to 2.7T in slightly more than 2 weeks.

671B to 2.7T?!? I can barely run 235B models! (and that's with Q3 or Q2).

Regardless, as long as it performs well and does the job i suppose.

But i wonder if trimming the models or getting them to be more optimized in size vs performance shouldn't be a bigger push. Though i'm new to this scene so i could just be ignorant in how this is all done.

updated a collection 7 days ago