Added VLLM Offline Serve working code.

#107

by hrithiksagar-tih - opened about 4 hours ago

base: refs/heads/main

←

from: refs/pr/107

Discussion Files changed

+76

-0

hrithiksagar-tih

about 4 hours ago

So, in this commit, I have attached the solution to the OSS 20b model inference code via vLLM. The original code in the cookbook: https://cookbook.openai.com/articles/gpt-oss/run-vllm, was not working; with a few modifications, it worked.

Added VLLM Offline Serve working code.b97f50dc

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment