Run GPT-OSS-120B with just Single A100 (80GB)

#80
by ghostplant - opened

A solution for single A100 (80G) to serve whatever 20B and 120B version: Tutel Instruction to Run GptOSS 120B.

How to start an API service locally

Sign up or log in to comment