make litellm async #205

derekmeegan · 2025-09-18T21:19:21Z

why

LiteLLM's synchronous completion() method was blocking the event loop in async handlers, preventing concurrent execution of multiple LLM calls. This caused performance degradation when multiple operations needed to run in parallel.

what changed

Converted LLMClient.create_response() from sync to async method using litellm.acompletion()
Updated inference.observe() and inference.extract() functions to be async
Modified all handlers (ObserveHandler, ExtractHandler) to await async inference calls
Updated mock LLM client's create_response() method to be async for test compatibility

test plan

run CI tests and inspect that things are working as expected

derekmeegan added 3 commits September 18, 2025 14:13

make litellm async

adc89e2

add changeset

a787fc6

remove extra file

754a8de

miguelg719 approved these changes Sep 25, 2025

View reviewed changes

format

f29484d

derekmeegan merged commit 3bcdd05 into main Sep 25, 2025
13 checks passed

derekmeegan deleted the derek/make_litellm_async branch September 25, 2025 23:23

github-actions bot mentioned this pull request Sep 25, 2025

🚀 Release packages #208

Merged

derekmeegan mentioned this pull request Oct 20, 2025

litellm completions should be async #138

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

make litellm async #205

make litellm async #205

derekmeegan commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

make litellm async #205

make litellm async #205

Conversation

derekmeegan commented Sep 18, 2025

why

what changed

test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants