v1.79.1.dev6
What's Changed
- (feat) Milvus - search vector store support + (fix) Passthrough Endpoints - support multi-part form data on passthrough by @krrishdholakia in #16035
- (feat) Azure AI Vector Stores - support "virtual" indexes + create vector store on passthrough API by @krrishdholakia in #16160
- Add
custom_llm_providersupport for video endpoints (non-generation) by @Sameerlite in #16121 - fix: resolve memory accumulation caused by Pydantic 2.11+ deprecation warnings by @AlexsanderHamir in #16110
- add: minimum resource requirement for production by @AlexsanderHamir in #16146
- Remove minimum validation for cache control injection index by @steve-gore-snapdocs in #16149
- Fix empty assistant message handling in AWS Bedrock Converse API to prevent 400 Bad Request errors by @Sameerlite in #15850
- Add LLM provider response headers to Responses API by @Sameerlite in #16091
- Fix azure doesn't accept extra body param by @Sameerlite in #16116
- Add E2E Container API Support by @Sameerlite in #16136
- Add new model pricing for nemotron-nano-9b-v2 by @timbmg in #16120
- docs - v1.79.1 by @ishaan-jaff in #16163
- UI - Fix regression where Guardrail Entity Could not be selected and entity was not displayed by @ishaan-jaff in #16165
- Fix Anthropic token counting for VertexAI by @steve-gore-snapdocs in #16171
- [Feature] UI - Delete Team Member with friction by @yuneng-jiang in #16167
- [Feature] UI - Add Model Existing Credentials Improvement by @yuneng-jiang in #16166
- [Feature] UI - Guardrail Info Page Show PII Config by @yuneng-jiang in #16164
- strip base64 in s3 by @deepanshululla in #16157
- fix(ui_sso.py): support dot notation on ui sso by @krrishdholakia in #16135
- Support responses API streaming in langfuse otel by @eycjur in #16153
- fix: return the diarized transcript when it's required in the request by @gvioss in #16133
- build: Squashed commit of the following: by @krrishdholakia in #16176
- feat: change guardrail_information to list type by @uc4w6c in #16127
- Milvus - Passthrough API support - adds create + read vector store support via passthrough API's by @krrishdholakia in #16170
- Fix index field not populated in streaming mode with n>1 and tool calls by @Sameerlite in #15962
- Update perplexity cost tracking by @Sameerlite in #15743
- Fix image_config.aspect_ratio not working for gemini-2.5-flash-image by @Sameerlite in #15999
- [Feat] s3 logger, add support for ssl_verify when using minio logger by @ishaan-jaff in #16211
- [Patch] - Revert Noma Apply Guardrail implementation by @ishaan-jaff in #16214
- [Feat] /ocr - Add VertexAI OCR provider support + cost tracking by @ishaan-jaff in #16216
- fix: broken link on model_management.md by @AlexsanderHamir in #16217
- [Feat] Add Azure AI Doc Intelligence OCR by @ishaan-jaff in #16219
- Add Prometheus metric to track callback logging failures in S3 by @Sameerlite in #16209
- fix(anthropic-adapter): properly translate Anthropic image format to OpenAI by @Nivg in #16202
- Feat: Enable automated prompt caching message format for Claude on Databricks by @anthonyivn2 in #16200
- Add gemini live audio model cost in model map by @Sameerlite in #16183
- Fix translation problem with Gemini parallel tool calls by @pablobgar in #16194
- feat: Generalize tiered pricing in generic cost calculator by @AlanPonnachan in #16150
- [MCP] configure static mcp header by @uc4w6c in #16179
- add: benchmark comparison with other gateways by @AlexsanderHamir in #16145
- Revert "add: benchmark comparison with other gateways" by @krrishdholakia in #16247
- [Feature] UI - Litellm test key audio by @yuneng-jiang in #16251
- [Feature] UI - Prevent trailing slash in sso proxy base url input by @yuneng-jiang in #16244
- [Feature] UI - Tag Usage Top Model Table View and Label Fix by @yuneng-jiang in #16249
- fix(proxy): handle None values in daily spend sort key by @AlexsanderHamir in #16245
- fix: Fix OpenAI Responses API streaming tests usage field names and cost calculation by @Chesars in #16236
- [Feature] UI - Initial changes for supporting prompts to multiple models by @yuneng-jiang in #16223
- Add tags and descriptions support to aws secrets manager by @deepanshululla in #16224
- [Feat] Add Bedrock Agentcore as a provider on LiteLLM Python SDK and LiteLLM AI Gateway by @ishaan-jaff in #16252
- [Feat] /search API - add firecrawl search API support by @ishaan-jaff in #16257
- [Feat] add serxng search API provider by @ishaan-jaff in #16259
- feat: Support
reasoning_effortfor watsonx chat models by @AlanPonnachan in #16261 - add: shared_session support to responses API by @AlexsanderHamir in #16260
- fix: typo of the word orginal by @bowenliang123 in #16255
- fix(openai): Remove automatic summary from reasoning_effort transformation by @Chesars in #16210
- fix(redis): handle float redis_version from AWS ElastiCache Valkey by @AlexsanderHamir in #16207
- Fix langfuse input tokens logic for cached tokens by @fabriciojoc in #16203
- Add gpt-image-1-mini pricing for azure by @Sameerlite in #16182
- Fix: Send Gemini API key via x-goog-api-key header with custom api_base by @Sameerlite in #16085
New Contributors
- @steve-gore-snapdocs made their first contribution in #16149
- @timbmg made their first contribution in #16120
- @Nivg made their first contribution in #16202
- @pablobgar made their first contribution in #16194
- @AlanPonnachan made their first contribution in #16150
- @Chesars made their first contribution in #16236
- @bowenliang123 made their first contribution in #16255
Full Changelog: v1.79.1-nightly...v1.79.1.dev6