[Perf] Alexsander fixes round 3 - Oct 25th #15935

AlexsanderHamir · 2025-10-26T01:00:00Z

Title

[Perf] Alexsander fixes round 3 - Oct 25th

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix
🧹 Refactoring

Changes

Reduced memory usage.
Changed slow json library for faster option.

…/Router

Replace json.dumps with orjson.dumps in HTTP handler to reduce serialization latency for all LLM provider API calls.

Replace json.dumps/loads with orjson in streaming hot paths. Saves ~350ms per 1000-chunk streaming response.

- Move orjson from optional to required dependencies in pyproject.toml - Fixes ModuleNotFoundError when importing litellm core modules - orjson is used in llm_http_handler.py which is imported by core litellm

- Optimize jsonify_object() (24 call sites across codebase) - Optimize get_request_status() for metadata parsing Reduces CPU usage and improves database write latency.

- serialize_object() uses orjson for 3-5x faster dict serialization - get_prompt_caching_cache_key() skips encode/decode cycle - Maintains cache key compatibility

- Maintains circular reference detection - Preserves default=str fallback behavior - Used in 15+ files across proxy and integrations

- Added @lru_cache(1024) to get_cooldown_cache_key() - Changed all 4 locations to use the cached method instead of recreating strings - Replaced f-strings with string concatenation for better performance Results: get_cooldown_cache_key dropped from 47MB to a few bytes and is no longer a top consumer. Memory leak still present. Next: Optimize heavy memory consumers so memory leaks become more obvious.

vercel · 2025-10-26T01:00:04Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Ready	Preview	Comment	Nov 1, 2025 8:22pm

AlexsanderHamir added 12 commits October 25, 2025 08:52

[feat] tests: add configurable performance regression testing for SDK…

c33a3c4

…/Router

fix: remove unused import

d6ca7b6

perf: use orjson for faster request serialization

3cfd734

Replace json.dumps with orjson.dumps in HTTP handler to reduce serialization latency for all LLM provider API calls.

perf: use orjson for streaming serialization (2-5x faster)

23db0c4

Replace json.dumps/loads with orjson in streaming hot paths. Saves ~350ms per 1000-chunk streaming response.

fix: make orjson a required dependency

ab01be6

- Move orjson from optional to required dependencies in pyproject.toml - Fixes ModuleNotFoundError when importing litellm core modules - orjson is used in llm_http_handler.py which is imported by core litellm

ci: Add orjson dependency to CircleCI config

066a703

perf(proxy): use orjson in PrismaClient for 3-5x faster DB writes

cfcf520

- Optimize jsonify_object() (24 call sites across codebase) - Optimize get_request_status() for metadata parsing Reduces CPU usage and improves database write latency.

perf(router): optimize prompt caching with orjson

9661581

- serialize_object() uses orjson for 3-5x faster dict serialization - get_prompt_caching_cache_key() skips encode/decode cycle - Maintains cache key compatibility

perf: optimize safe_json_loads with orjson for 3-5x faster parsing

118aaef

perf: optimize safe_dumps with orjson for 3-5x faster serialization

e2d455a

- Maintains circular reference detection - Preserves default=str fallback behavior - Used in 15+ files across proxy and integrations

fix: import issues

33db244

Merge branch 'main' into litellm_oct_alexsander_staging_three

f6344ea

vercel bot deployed to Preview October 26, 2025 01:02 View deployment

Merge branch 'main' into litellm_oct_alexsander_staging_three

b536f27

vercel bot deployed to Preview November 1, 2025 20:22 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Perf] Alexsander fixes round 3 - Oct 25th #15935

[Perf] Alexsander fixes round 3 - Oct 25th #15935

Uh oh!

AlexsanderHamir commented Oct 26, 2025

Uh oh!

vercel bot commented Oct 26, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Perf] Alexsander fixes round 3 - Oct 25th #15935

Are you sure you want to change the base?

[Perf] Alexsander fixes round 3 - Oct 25th #15935

Uh oh!

Conversation

AlexsanderHamir commented Oct 26, 2025

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vercel bot commented Oct 26, 2025 •

edited

Loading