Added VLLM Offline Serve working code. #150

hrithiksagar-tih · 2025-08-23T05:08:40Z

So, in this commit, I have attached the solution to the OSS 20b model inference code via vLLM. The original code in the cookbook: https://cookbook.openai.com/articles/gpt-oss/run-vllm, was not working; with a few modifications, it worked on H100s.

I have a working code for 120B also.

hrithiksagar-tih · 2025-08-23T06:23:21Z

@dkundel-openai Could you please look into this? Ref: https://huggingface.co/openai/gpt-oss-20b/discussions/107#689d1f85854a58a7e953317f

Added VLLM Offline Serve working code.

40bbee6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added VLLM Offline Serve working code. #150

Added VLLM Offline Serve working code. #150

Uh oh!

hrithiksagar-tih commented Aug 23, 2025

Uh oh!

hrithiksagar-tih commented Aug 23, 2025

Uh oh!

Uh oh!

Added VLLM Offline Serve working code. #150

Are you sure you want to change the base?

Added VLLM Offline Serve working code. #150

Uh oh!

Conversation

hrithiksagar-tih commented Aug 23, 2025

Uh oh!

hrithiksagar-tih commented Aug 23, 2025

Uh oh!

Uh oh!