Skip to content

Conversation

@benchislett
Copy link
Contributor

Overview

Include the generated_text section from each request in the top of the output-file. This makes it much easier to check the response quality at a glance, and correspond them by line-number to the i-th prompt.

Other

  • Allow small output token lengths as long as they are < 1. This was formerly a sanity check for ShareGPT data but isn't really needed anymore. Now we can run with --output-token-distribution same 1 to test prefill-only workloads.

@benchislett benchislett self-assigned this Aug 27, 2025
Signed-off-by: Benjamin Chislett <[email protected]>
@benchislett benchislett merged commit 695da06 into main Aug 27, 2025
5 checks passed
@benchislett benchislett deleted the qol-output-file branch August 27, 2025 20:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants