Skip to content

Commit 30ff579

Browse files
committed
perf: Fine-tune max tokens for lower latency conversation
1 parent dc65da6 commit 30ff579

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

main.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -1020,7 +1020,7 @@ async def _error_response() -> Tuple[CallModel, ActionModel]:
10201020
tool_calls = {}
10211021
try:
10221022
async for delta in completion_stream(
1023-
max_tokens=400,
1023+
max_tokens=350,
10241024
messages=messages,
10251025
tools=tools,
10261026
):

0 commit comments

Comments
 (0)