why VLLM add this to support deterministic mode but HF code doesn't? Will this cause performance drop?
why VLLM add this to support deterministic mode but HF code doesn't? Will this cause performance drop?