Releases: BerriAI/litellm
Releases · BerriAI/litellm
v1.79.3.dev2
What's Changed
- [Infra] CI/CD - Bump up docker version for e2e ui testing by @yuneng-jiang in #16506
- Add Zscaler AI Guard hook by @jwang-gif in #15691
New Contributors
- @jwang-gif made their first contribution in #15691
Full Changelog: v1.79.dev.1...v1.79.3.dev2
v1.79.dev.1
What's Changed
- Fix container api link in release page by @Sameerlite in #16440
- Add softgen to projects that are using litellm by @artplan1 in #16423
- [Feature] UI - Model Info Page Health Check by @yuneng-jiang in #16416
- add kimi k2 thinking by @artplan1 in #16445
- docs: fix streaming example in README by @Chesars in #16461
- [Fix] Management Endpoints - Fixes inconsistent error responses in customer management endpoints. Non-existent user errors now return proper 404 status codes with consistent error schema format across all endpoints. by @ishaan-jaff in #16450
- [Infra] UI - Show Deprecation Warning for Model Analytics Tab by @yuneng-jiang in #16417
- fix: allow internal users to access video generation routes by @JehandadK in #16472
- [Bug Fix] - LiteLLM Usage shows key_hash- by @ishaan-jaff in #16471
- [Feature] UI - Test Key Page show models based on selected endpoint by @yuneng-jiang in #16452
- Add GET list of providers endpoint by @Sameerlite in #16432
- [Feature] UI - Invite User Searchable Team Select by @yuneng-jiang in #16454
- Add sdk focused examples for custom prompt management by @Sameerlite in #16441
- Fix magistral streaming to emit reasoning chunks by @Sameerlite in #16434
- Add docs for tracking callback failure by @Sameerlite in #16474
- fix[16428]: remove strict master_key check in add_deployment by @vmiscenko in #16453
- fix(proxy): Correct date range filtering in /spend/logs endpoint by @AlanPonnachan in #16443
- fix: update model_cost_map_url to use environment variable by @mcowger in #16429
- feat(router): Support default fallbacks for unknown models by @AlanPonnachan in #16419
- fix(langfuse): Handle null usage values to prevent validation errors by @AlanPonnachan in #16396
- fix: apply provided timeout value to ClientTimeout.total by @yellowsubmarine372 in #16395
- [Bug] Updated spend would not be sent to CloudZero by @Hebruwu in #16201
- fix: unable to delete MCP server from permission settings by @uc4w6c in #16407
- [Fix] Bedrock Knowledge bases - ensure users can access
search_resultsfor both stream + non stream response to /chat/completions by @ishaan-jaff in #16459 - [AI Gateway] - End User Budgets - Allow pointing max_end_user budget to an id, so the default ID applies to all end users by @ishaan-jaff in #16456
New Contributors
- @artplan1 made their first contribution in #16423
- @JehandadK made their first contribution in #16472
- @vmiscenko made their first contribution in #16453
- @mcowger made their first contribution in #16429
- @yellowsubmarine372 made their first contribution in #16395
- @Hebruwu made their first contribution in #16201
Full Changelog: v1.79.3-nightly...v1.79.dev.1
v1.79.3.rc.1
What's Changed
- Litellm dev 10 29 2025 p1 by @krrishdholakia in #16404
- Revert "Litellm dev 10 29 2025 p1" by @krrishdholakia in #16409
- fix: install runtime node for prisma by @AlexsanderHamir in #16410
- Add Vertex and Gemini Videos API with Cost Tracking + UI support by @Sameerlite in #16323
- Adds support for returning Azure Content Policy error information when exceptions from Azure OpenAI occur by @Sameerlite in #16231
- [Fix] UI - Various Small Issues by @yuneng-jiang in #16406
- Litellm dev 10 29 2025 p1 by @krrishdholakia in #16411
- (feat) audio transcription - add gpt-4o-transcribe cost tracking by @krrishdholakia in #16412
- [Bug Fix] Content Filter Guard by @ishaan-jaff in #16414
- [Docs] litellm content filter guard by @ishaan-jaff in #16413
- add: performance improvements to release notes by @AlexsanderHamir in #16401
- [Docs] Litellm 1 79 2 rc by @ishaan-jaff in #16415
Full Changelog: v1.79.2-nightly...v1.79.3.rc.1
v1.79.3-nightly
What's Changed
- Litellm dev 10 29 2025 p1 by @krrishdholakia in #16404
- Revert "Litellm dev 10 29 2025 p1" by @krrishdholakia in #16409
- fix: install runtime node for prisma by @AlexsanderHamir in #16410
- Add Vertex and Gemini Videos API with Cost Tracking + UI support by @Sameerlite in #16323
- Adds support for returning Azure Content Policy error information when exceptions from Azure OpenAI occur by @Sameerlite in #16231
- [Fix] UI - Various Small Issues by @yuneng-jiang in #16406
- Litellm dev 10 29 2025 p1 by @krrishdholakia in #16411
- (feat) audio transcription - add gpt-4o-transcribe cost tracking by @krrishdholakia in #16412
- [Bug Fix] Content Filter Guard by @ishaan-jaff in #16414
- [Docs] litellm content filter guard by @ishaan-jaff in #16413
- add: performance improvements to release notes by @AlexsanderHamir in #16401
- [Docs] Litellm 1 79 2 rc by @ishaan-jaff in #16415
Full Changelog: v1.79.2-nightly...v1.79.3-nightly
v1.79.2-nightly
What's Changed
- (feat) Milvus - search vector store support + (fix) Passthrough Endpoints - support multi-part form data on passthrough by @krrishdholakia in #16035
- (feat) Azure AI Vector Stores - support "virtual" indexes + create vector store on passthrough API by @krrishdholakia in #16160
- Add
custom_llm_providersupport for video endpoints (non-generation) by @Sameerlite in #16121 - fix: resolve memory accumulation caused by Pydantic 2.11+ deprecation warnings by @AlexsanderHamir in #16110
- add: minimum resource requirement for production by @AlexsanderHamir in #16146
- Remove minimum validation for cache control injection index by @steve-gore-snapdocs in #16149
- Fix empty assistant message handling in AWS Bedrock Converse API to prevent 400 Bad Request errors by @Sameerlite in #15850
- Add LLM provider response headers to Responses API by @Sameerlite in #16091
- Fix azure doesn't accept extra body param by @Sameerlite in #16116
- Add E2E Container API Support by @Sameerlite in #16136
- Add new model pricing for nemotron-nano-9b-v2 by @timbmg in #16120
- docs - v1.79.1 by @ishaan-jaff in #16163
- UI - Fix regression where Guardrail Entity Could not be selected and entity was not displayed by @ishaan-jaff in #16165
- Fix Anthropic token counting for VertexAI by @steve-gore-snapdocs in #16171
- [Feature] UI - Delete Team Member with friction by @yuneng-jiang in #16167
- [Feature] UI - Add Model Existing Credentials Improvement by @yuneng-jiang in #16166
- [Feature] UI - Guardrail Info Page Show PII Config by @yuneng-jiang in #16164
- strip base64 in s3 by @deepanshululla in #16157
- fix(ui_sso.py): support dot notation on ui sso by @krrishdholakia in #16135
- Support responses API streaming in langfuse otel by @eycjur in #16153
- fix: return the diarized transcript when it's required in the request by @gvioss in #16133
- build: Squashed commit of the following: by @krrishdholakia in #16176
- feat: change guardrail_information to list type by @uc4w6c in #16127
- Milvus - Passthrough API support - adds create + read vector store support via passthrough API's by @krrishdholakia in #16170
- Fix index field not populated in streaming mode with n>1 and tool calls by @Sameerlite in #15962
- Update perplexity cost tracking by @Sameerlite in #15743
- Fix image_config.aspect_ratio not working for gemini-2.5-flash-image by @Sameerlite in #15999
- [Feat] s3 logger, add support for ssl_verify when using minio logger by @ishaan-jaff in #16211
- [Patch] - Revert Noma Apply Guardrail implementation by @ishaan-jaff in #16214
- [Feat] /ocr - Add VertexAI OCR provider support + cost tracking by @ishaan-jaff in #16216
- fix: broken link on model_management.md by @AlexsanderHamir in #16217
- [Feat] Add Azure AI Doc Intelligence OCR by @ishaan-jaff in #16219
- Add Prometheus metric to track callback logging failures in S3 by @Sameerlite in #16209
- fix(anthropic-adapter): properly translate Anthropic image format to OpenAI by @Nivg in #16202
- Feat: Enable automated prompt caching message format for Claude on Databricks by @anthonyivn2 in #16200
- Add gemini live audio model cost in model map by @Sameerlite in #16183
- Fix translation problem with Gemini parallel tool calls by @pablobgar in #16194
- feat: Generalize tiered pricing in generic cost calculator by @AlanPonnachan in #16150
- [MCP] configure static mcp header by @uc4w6c in #16179
- add: benchmark comparison with other gateways by @AlexsanderHamir in #16145
- Revert "add: benchmark comparison with other gateways" by @krrishdholakia in #16247
- [Feature] UI - Litellm test key audio by @yuneng-jiang in #16251
- [Feature] UI - Prevent trailing slash in sso proxy base url input by @yuneng-jiang in #16244
- [Feature] UI - Tag Usage Top Model Table View and Label Fix by @yuneng-jiang in #16249
- fix(proxy): handle None values in daily spend sort key by @AlexsanderHamir in #16245
- fix: Fix OpenAI Responses API streaming tests usage field names and cost calculation by @Chesars in #16236
- [Feature] UI - Initial changes for supporting prompts to multiple models by @yuneng-jiang in #16223
- Add tags and descriptions support to aws secrets manager by @deepanshululla in #16224
- [Feat] Add Bedrock Agentcore as a provider on LiteLLM Python SDK and LiteLLM AI Gateway by @ishaan-jaff in #16252
- [Feat] /search API - add firecrawl search API support by @ishaan-jaff in #16257
- [Feat] add serxng search API provider by @ishaan-jaff in #16259
- feat: Support
reasoning_effortfor watsonx chat models by @AlanPonnachan in #16261 - add: shared_session support to responses API by @AlexsanderHamir in #16260
- fix: typo of the word orginal by @bowenliang123 in #16255
- fix(openai): Remove automatic summary from reasoning_effort transformation by @Chesars in #16210
- fix(redis): handle float redis_version from AWS ElastiCache Valkey by @AlexsanderHamir in #16207
- Fix langfuse input tokens logic for cached tokens by @fabriciojoc in #16203
- Add gpt-image-1-mini pricing for azure by @Sameerlite in #16182
- Fix: Send Gemini API key via x-goog-api-key header with custom api_base by @Sameerlite in #16085
- Litellm noma guardrail support images by @dean-zavad in #16199
- [Feat] Add CyberArk Secrets Manager Integration by @ishaan-jaff in #16278
- [Feat] Cyber Ark - Add Key Rotations support by @ishaan-jaff in #16289
- Add allowing Key based prefix to s3 path by @deepanshululla in #16237
- Add: benchmark comparison with other AI gateways by @AlexsanderHamir in #16248
- [Feat] Email Notifications - Ensure Users get Key Rotated Email by @ishaan-jaff in #16292
- [Fix] UI - Fixed Label for vLLM in Model Create Flow by @yuneng-jiang in #16285
- feat: Add configurable mount name and path prefix for HashiCorp Vault by @AlexsanderHamir in #16253
- Stabilize main branch by @Sameerlite in #16243
- OpikLogger: fix the bug with not incorrect attachment to existing trace & refactor by @alexkuzmik in #15529
- [Feat] Add Custom Secret Manager - Allow users to define and write a custom secret manager by @ishaan-jaff in #16297
- [Feature] UI - Include Model Access Group Models on Team Models Table by @yuneng-jiang in #16298
- Gracefully handle connection closed errors during streaming by @AlexsanderHamir in #16294
- [Fix] Litellm non root docker Model Hub Table fix by @yuneng-jiang in #16282
- [Fix] /model_group/info Returning Entire Model List for SSO Users by @yuneng-jiang in #16296
- Update MCP version from 1.10.1 to 1.20.0 by @Granine in #16281
- feat: add parallel execution handling in during_call_hook by @Oodapow in #16279
- fix: Resolve MyPy type checking errors and CI linting by @Chesars in #16277
- [LiteLLM-16250] Proxy to Bedrock will add name to file content, breaks when cache_control in use. by @jgoodyear in #16275
- Add azure gpt-5-pro in model map by @Sameerlite in #16235
- Revert "Update MCP version from 1.10.1 to 1.20.0" by @krrishdholakia in https://git...
v1.79.1-stable
Full Changelog: v1.79.1.rc.2...v1.79.1-stable
v1.79.1.dev6
What's Changed
- (feat) Milvus - search vector store support + (fix) Passthrough Endpoints - support multi-part form data on passthrough by @krrishdholakia in #16035
- (feat) Azure AI Vector Stores - support "virtual" indexes + create vector store on passthrough API by @krrishdholakia in #16160
- Add
custom_llm_providersupport for video endpoints (non-generation) by @Sameerlite in #16121 - fix: resolve memory accumulation caused by Pydantic 2.11+ deprecation warnings by @AlexsanderHamir in #16110
- add: minimum resource requirement for production by @AlexsanderHamir in #16146
- Remove minimum validation for cache control injection index by @steve-gore-snapdocs in #16149
- Fix empty assistant message handling in AWS Bedrock Converse API to prevent 400 Bad Request errors by @Sameerlite in #15850
- Add LLM provider response headers to Responses API by @Sameerlite in #16091
- Fix azure doesn't accept extra body param by @Sameerlite in #16116
- Add E2E Container API Support by @Sameerlite in #16136
- Add new model pricing for nemotron-nano-9b-v2 by @timbmg in #16120
- docs - v1.79.1 by @ishaan-jaff in #16163
- UI - Fix regression where Guardrail Entity Could not be selected and entity was not displayed by @ishaan-jaff in #16165
- Fix Anthropic token counting for VertexAI by @steve-gore-snapdocs in #16171
- [Feature] UI - Delete Team Member with friction by @yuneng-jiang in #16167
- [Feature] UI - Add Model Existing Credentials Improvement by @yuneng-jiang in #16166
- [Feature] UI - Guardrail Info Page Show PII Config by @yuneng-jiang in #16164
- strip base64 in s3 by @deepanshululla in #16157
- fix(ui_sso.py): support dot notation on ui sso by @krrishdholakia in #16135
- Support responses API streaming in langfuse otel by @eycjur in #16153
- fix: return the diarized transcript when it's required in the request by @gvioss in #16133
- build: Squashed commit of the following: by @krrishdholakia in #16176
- feat: change guardrail_information to list type by @uc4w6c in #16127
- Milvus - Passthrough API support - adds create + read vector store support via passthrough API's by @krrishdholakia in #16170
- Fix index field not populated in streaming mode with n>1 and tool calls by @Sameerlite in #15962
- Update perplexity cost tracking by @Sameerlite in #15743
- Fix image_config.aspect_ratio not working for gemini-2.5-flash-image by @Sameerlite in #15999
- [Feat] s3 logger, add support for ssl_verify when using minio logger by @ishaan-jaff in #16211
- [Patch] - Revert Noma Apply Guardrail implementation by @ishaan-jaff in #16214
- [Feat] /ocr - Add VertexAI OCR provider support + cost tracking by @ishaan-jaff in #16216
- fix: broken link on model_management.md by @AlexsanderHamir in #16217
- [Feat] Add Azure AI Doc Intelligence OCR by @ishaan-jaff in #16219
- Add Prometheus metric to track callback logging failures in S3 by @Sameerlite in #16209
- fix(anthropic-adapter): properly translate Anthropic image format to OpenAI by @Nivg in #16202
- Feat: Enable automated prompt caching message format for Claude on Databricks by @anthonyivn2 in #16200
- Add gemini live audio model cost in model map by @Sameerlite in #16183
- Fix translation problem with Gemini parallel tool calls by @pablobgar in #16194
- feat: Generalize tiered pricing in generic cost calculator by @AlanPonnachan in #16150
- [MCP] configure static mcp header by @uc4w6c in #16179
- add: benchmark comparison with other gateways by @AlexsanderHamir in #16145
- Revert "add: benchmark comparison with other gateways" by @krrishdholakia in #16247
- [Feature] UI - Litellm test key audio by @yuneng-jiang in #16251
- [Feature] UI - Prevent trailing slash in sso proxy base url input by @yuneng-jiang in #16244
- [Feature] UI - Tag Usage Top Model Table View and Label Fix by @yuneng-jiang in #16249
- fix(proxy): handle None values in daily spend sort key by @AlexsanderHamir in #16245
- fix: Fix OpenAI Responses API streaming tests usage field names and cost calculation by @Chesars in #16236
- [Feature] UI - Initial changes for supporting prompts to multiple models by @yuneng-jiang in #16223
- Add tags and descriptions support to aws secrets manager by @deepanshululla in #16224
- [Feat] Add Bedrock Agentcore as a provider on LiteLLM Python SDK and LiteLLM AI Gateway by @ishaan-jaff in #16252
- [Feat] /search API - add firecrawl search API support by @ishaan-jaff in #16257
- [Feat] add serxng search API provider by @ishaan-jaff in #16259
- feat: Support
reasoning_effortfor watsonx chat models by @AlanPonnachan in #16261 - add: shared_session support to responses API by @AlexsanderHamir in #16260
- fix: typo of the word orginal by @bowenliang123 in #16255
- fix(openai): Remove automatic summary from reasoning_effort transformation by @Chesars in #16210
- fix(redis): handle float redis_version from AWS ElastiCache Valkey by @AlexsanderHamir in #16207
- Fix langfuse input tokens logic for cached tokens by @fabriciojoc in #16203
- Add gpt-image-1-mini pricing for azure by @Sameerlite in #16182
- Fix: Send Gemini API key via x-goog-api-key header with custom api_base by @Sameerlite in #16085
New Contributors
- @steve-gore-snapdocs made their first contribution in #16149
- @timbmg made their first contribution in #16120
- @Nivg made their first contribution in #16202
- @pablobgar made their first contribution in #16194
- @AlanPonnachan made their first contribution in #16150
- @Chesars made their first contribution in #16236
- @bowenliang123 made their first contribution in #16255
Full Changelog: v1.79.1-nightly...v1.79.1.dev6
1.78.5-stable-patch-1
Full Changelog: v1.78.5-nightly...1.78.5-stable-patch-1
v1.79.1.rc.2
Full Changelog: v1.79.1-nightly...v1.79.1.rc.2
v1.79.1.dev5
What's Changed
- (feat) Milvus - search vector store support + (fix) Passthrough Endpoints - support multi-part form data on passthrough by @krrishdholakia in #16035
- (feat) Azure AI Vector Stores - support "virtual" indexes + create vector store on passthrough API by @krrishdholakia in #16160
- Add
custom_llm_providersupport for video endpoints (non-generation) by @Sameerlite in #16121 - fix: resolve memory accumulation caused by Pydantic 2.11+ deprecation warnings by @AlexsanderHamir in #16110
- add: minimum resource requirement for production by @AlexsanderHamir in #16146
- Remove minimum validation for cache control injection index by @steve-gore-snapdocs in #16149
- Fix empty assistant message handling in AWS Bedrock Converse API to prevent 400 Bad Request errors by @Sameerlite in #15850
- Add LLM provider response headers to Responses API by @Sameerlite in #16091
- Fix azure doesn't accept extra body param by @Sameerlite in #16116
- Add E2E Container API Support by @Sameerlite in #16136
- Add new model pricing for nemotron-nano-9b-v2 by @timbmg in #16120
- docs - v1.79.1 by @ishaan-jaff in #16163
- UI - Fix regression where Guardrail Entity Could not be selected and entity was not displayed by @ishaan-jaff in #16165
- Fix Anthropic token counting for VertexAI by @steve-gore-snapdocs in #16171
- [Feature] UI - Delete Team Member with friction by @yuneng-jiang in #16167
- [Feature] UI - Add Model Existing Credentials Improvement by @yuneng-jiang in #16166
- [Feature] UI - Guardrail Info Page Show PII Config by @yuneng-jiang in #16164
- strip base64 in s3 by @deepanshululla in #16157
- fix(ui_sso.py): support dot notation on ui sso by @krrishdholakia in #16135
- Support responses API streaming in langfuse otel by @eycjur in #16153
- fix: return the diarized transcript when it's required in the request by @gvioss in #16133
- build: Squashed commit of the following: by @krrishdholakia in #16176
- feat: change guardrail_information to list type by @uc4w6c in #16127
- Milvus - Passthrough API support - adds create + read vector store support via passthrough API's by @krrishdholakia in #16170
- Fix index field not populated in streaming mode with n>1 and tool calls by @Sameerlite in #15962
- Update perplexity cost tracking by @Sameerlite in #15743
- Fix image_config.aspect_ratio not working for gemini-2.5-flash-image by @Sameerlite in #15999
New Contributors
- @steve-gore-snapdocs made their first contribution in #16149
- @timbmg made their first contribution in #16120
Full Changelog: v1.79.1-nightly...v1.79.1.dev5