Benchmark toolkit support #66
Labels
feature
Categorizes issue or PR as related to a new feature.
needs-priority
Indicates a PR lacks a label and requires one.
needs-triage
Indicates an issue or PR lacks a label and requires one.
What would you like to be added:
It would be super great to support benchmarking the LLM throughputs or latencies with different backends.
Why is this needed:
Provide proofs for users.
Completion requirements:
This enhancement requires the following artifacts:
The artifacts should be linked in subsequent comments.
The text was updated successfully, but these errors were encountered: