Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
234 commits
Select commit Hold shift + click to select a range
29057ba
Added support for azure anthopic models via chat completion
Sameerlite Nov 19, 2025
3e58fe4
fix: Support response_format parameter in completion -> responses bridge
Chesars Nov 19, 2025
57544f1
[Feat] Adds IAM role assumption support for AWS Secret Manager (#16887)
ishaan-jaff Nov 20, 2025
0d812f9
new u build
ishaan-jaff Nov 20, 2025
93affcb
[Feat] mcp resources support (#16800)
uc4w6c Nov 20, 2025
e7751f0
docs: fix docs bug
krrishdholakia Nov 21, 2025
7225fc0
Fix key model alias (#16896)
yuneng-jiang Nov 21, 2025
041ac05
feat: allow custom violation message for tool-permission guardrail (#…
uc4w6c Nov 21, 2025
be9f2e0
feat(pricing): Add Gemini 3 Pro Preview for OpenRouter (#16905)
TsurumaruTsuyoshi Nov 21, 2025
b96179a
Upgrade pydantic to version 2.11.0 (#16909)
superpoussin22 Nov 21, 2025
4156672
[Feat] UI - Prompt Management - Allow testing prompts with Chat UI (…
ishaan-jaff Nov 21, 2025
3c789ac
feat: Add vector store create and search call types (#16859)
ishaan-jaff Nov 21, 2025
97d9da9
[Feat] Prompt Management - Allow viewing version history (#16901)
ishaan-jaff Nov 21, 2025
caddc6d
fix images being dropped from tool results for bedrock (#16492)
cyberjunk Nov 21, 2025
f9d8eea
[stripe] gemini 3 thought signatures in tool call id (#16895)
colinlin-stripe Nov 21, 2025
4b25398
[Infra] CI/CD Fixes (#16937)
yuneng-jiang Nov 21, 2025
6ae2290
[Feat] Prompt Versioning - Allow specifying prompt version in code (…
ishaan-jaff Nov 21, 2025
5dd2ee0
Change /public fields to honor server root path (#16930)
yuneng-jiang Nov 21, 2025
c9ac194
[Fix] Prompt Management - UI, allow seeing model, prompt id for Promp…
ishaan-jaff Nov 21, 2025
fb38763
[Feat] UI - Show "get code" section for prompt management + minor pol…
ishaan-jaff Nov 21, 2025
1812eba
fix: Correct Cerebras GPT-OSS-120B model name (#16939)
Chesars Nov 21, 2025
01ea6c8
[New model] Add GLM 4.6 from together.ai (#16942)
ishaan-jaff Nov 21, 2025
ed6c3b4
[Bug Fix]: Search APIs - error in firecrawl-search "Invalid request b…
ishaan-jaff Nov 21, 2025
7c4ef09
docs: fix mcp url format (#16940)
uc4w6c Nov 21, 2025
22ef7ab
feat: Add support for Gemini 3 Pro Image model (#16938)
Chesars Nov 21, 2025
1c65800
Feat: add support for Grok 4.1 Fast models (#16936)
Chesars Nov 21, 2025
b6b8f46
Change Public Model Hub to use proxyBaseUrl (#16892)
yuneng-jiang Nov 21, 2025
5b4a848
fix anthropic pass-through endpoint (#16883)
sureshdsk Nov 22, 2025
d88580f
fix(gemini-video): inherit BaseVideoConfig to enable async content re…
eiliyaabedini Nov 22, 2025
8e318dd
[Feat] New LLM Provider - Docker Model Runner (#16948)
ishaan-jaff Nov 22, 2025
0abfb07
Remove UI Session Token from user/info return (#16851)
yuneng-jiang Nov 22, 2025
4205b7c
fix install litellm
ishaan-jaff Nov 22, 2025
2226450
test_ensure_initialize_azure_sdk_client_always_used
ishaan-jaff Nov 22, 2025
4e8f1d0
fix prompt manager
ishaan-jaff Nov 22, 2025
4a9f163
TestPromptVersionsEndpoint
ishaan-jaff Nov 22, 2025
69da15e
test_fal_ai_image_generation_basic
ishaan-jaff Nov 22, 2025
9b5a655
fix _encode_tool_call_id_with_signature
ishaan-jaff Nov 22, 2025
5cfacb9
Exclude litellm_credential_name from sensitive masker (#16950)
yuneng-jiang Nov 22, 2025
49e3313
Remove console logs and errors from model tab (#16455)
yuneng-jiang Nov 22, 2025
1ebe1fe
Docs for Model Compare UI and Org Usage (#16928)
yuneng-jiang Nov 22, 2025
473fec8
fix _get_allowed_mcp_servers
ishaan-jaff Nov 22, 2025
e7a32c1
docker test fixes
ishaan-jaff Nov 22, 2025
6439aed
snowflake test fix
ishaan-jaff Nov 22, 2025
8b8b31e
fix img gen
ishaan-jaff Nov 22, 2025
34f0c3c
Remove cost tracking disabled tooltip in chat ui (#16953)
ishaan-jaff Nov 22, 2025
3296ffd
test fixes
ishaan-jaff Nov 22, 2025
1f36fad
TestDockerModelRunnerIntegration
ishaan-jaff Nov 22, 2025
58a56ba
test fixes for masker
ishaan-jaff Nov 22, 2025
eb48d5c
Revert "Exclude litellm_credential_name from sensitive masker (#16950…
yuneng-jiang Nov 22, 2025
db58f6a
fix: arize phoenix logging (#16301)
mubashir1osmani Nov 22, 2025
703f619
feat(bedrock): Add Claude 4.5 to US Gov Cloud (#16957)
justin-tahara Nov 22, 2025
6881594
[Fix] Exclude litellm_credential_name from Sensitive Data Masker (Upd…
yuneng-jiang Nov 22, 2025
f542011
fix: cache cooldown key (#16954)
AlexsanderHamir Nov 22, 2025
cdb46f9
fix: cache SSL contexts to prevent excessive memory allocation (#16955)
AlexsanderHamir Nov 22, 2025
b074c79
Allow partial matches for user id in user table (#16952)
yuneng-jiang Nov 22, 2025
6e70c27
[Fix] - Router's Cache: Fix routing for requests with same cacheable …
AlexsanderHamir Nov 22, 2025
42c883d
fix redis event loop closed at first call (#16913)
wangsoft Nov 22, 2025
a0d4d0b
Gemini models: capture image_tokens and support cost_per_output_image…
Dima-Mediator Nov 22, 2025
696974b
fix: add mcp server ids (#16904)
mubashir1osmani Nov 22, 2025
f56c7e1
Change Bulk Invite User Roles to match backend (#16906)
yuneng-jiang Nov 22, 2025
bbaf0af
Grayswan guardrail passthrough on flagged (#16891)
dsduenas Nov 22, 2025
7b05a5f
Add full information to vertex_ai/gemini-2.5-flash-image key (#16882)
jlan-nl Nov 22, 2025
671c219
[Infra] Building UI for Testing (#16968)
yuneng-jiang Nov 22, 2025
8fbf060
ArizePhoenixConfig
ishaan-jaff Nov 22, 2025
6611176
Revert "remove deprecated embedding model (#16724)" (#16970)
ishaan-jaff Nov 22, 2025
82dc035
Litellm sameer nov 3 stable branch (#16963)
Sameerlite Nov 22, 2025
eb10dd4
fix bedrock model info
ishaan-jaff Nov 22, 2025
badbadb
fix img URL for tests
ishaan-jaff Nov 22, 2025
cfcd597
Fix tests (#16972)
yuneng-jiang Nov 22, 2025
eb5031d
[Perf] Fix bottlenecks degrading realtime endpoint performance (#16670)
AlexsanderHamir Nov 22, 2025
0ed443f
fix claude-sonnet-4-5-20250929-v1:0
ishaan-jaff Nov 22, 2025
c34d8af
test fix
ishaan-jaff Nov 22, 2025
5b23b09
async def test_auth_callback_new_user(mock_google_sso, mock_env_vars,…
ishaan-jaff Nov 22, 2025
b2812af
fix MCP tests
ishaan-jaff Nov 22, 2025
fc0eac2
test_get_tools_from_mcp_servers
ishaan-jaff Nov 22, 2025
0c28af8
test MCP server
ishaan-jaff Nov 22, 2025
3235807
test prompt manager
ishaan-jaff Nov 22, 2025
ee75891
test docker model runner
ishaan-jaff Nov 22, 2025
9a06580
Fix SSL test failures due to caching and test isolation issues (#16973)
AlexsanderHamir Nov 22, 2025
be71138
fix build bad db url
ishaan-jaff Nov 22, 2025
5c289df
test url with format
ishaan-jaff Nov 22, 2025
2613b7b
fix security
ishaan-jaff Nov 22, 2025
65b842b
test fix
ishaan-jaff Nov 22, 2025
2a92e97
installing_litellm_on_python
ishaan-jaff Nov 22, 2025
ca2a27c
fix: add missing mock attributes in websocket and realtime tests (#16…
AlexsanderHamir Nov 22, 2025
6810e06
docs: Add mini-swe-agent to Projects built on LiteLLM (#16971)
Chesars Nov 22, 2025
9817347
fix govcloud
ishaan-jaff Nov 22, 2025
1b88cfb
test_router_get_deployment_credentials_with_provider
ishaan-jaff Nov 22, 2025
9ea74a1
TestDockerModelRunnerIntegration
ishaan-jaff Nov 22, 2025
aa95443
fix mypy linting
ishaan-jaff Nov 22, 2025
9be76be
_apply_prompt_template_core
ishaan-jaff Nov 22, 2025
1adaf04
fix TYPE_CHECKING + security
ishaan-jaff Nov 22, 2025
725982f
test_dotprompt_with_prompt_version
ishaan-jaff Nov 22, 2025
f5c8136
fix docker model runner tests
ishaan-jaff Nov 22, 2025
4f707ed
bump: version 1.80.3 → 1.80.4
ishaan-jaff Nov 22, 2025
0429ca4
test_package_dependencies
ishaan-jaff Nov 22, 2025
ff99f93
fix req.txt
ishaan-jaff Nov 22, 2025
3ba3fae
fix sec scan
ishaan-jaff Nov 22, 2025
d92c5bb
fix pkg lock
ishaan-jaff Nov 22, 2025
dc08e2d
fix pkg lock
ishaan-jaff Nov 22, 2025
ac3aa74
(feat) Anthropic - support Structured Outputs `output_format` for Cla…
krrishdholakia Nov 22, 2025
2f0ec47
GHSA-5j98-mcp5-4vw2 fix
ishaan-jaff Nov 22, 2025
5f78ea7
async_health_check arize fix
ishaan-jaff Nov 22, 2025
37460b6
bump: version 1.80.4 → 1.80.5
ishaan-jaff Nov 22, 2025
c7fefcc
bump v
ishaan-jaff Nov 22, 2025
dd32519
ui testing fixes
ishaan-jaff Nov 22, 2025
a06e7ed
docs 1.80.5
ishaan-jaff Nov 22, 2025
3162048
ui unit test fix
ishaan-jaff Nov 22, 2025
b02baf5
Fix: prevent memory blowout in LoggingWorker (#16559)
AlexsanderHamir Nov 22, 2025
b43b68a
docs fix
ishaan-jaff Nov 22, 2025
c6b8f19
ui unit tests fix
ishaan-jaff Nov 22, 2025
4fb9e33
fixes
ishaan-jaff Nov 22, 2025
22fd323
Calling team/permissions_list and team/permissions_update now returns…
yuneng-jiang Nov 22, 2025
825f61b
Remove expired proxy admin keys from cache (#16894)
yuneng-jiang Nov 22, 2025
1fc3baf
e2e ui testing fixes
ishaan-jaff Nov 22, 2025
e8ba4e3
fix ui unit tests
ishaan-jaff Nov 22, 2025
9194652
fix ui build
ishaan-jaff Nov 22, 2025
7cf1d30
fix ui unit tests fuck this test why is it so flaky
ishaan-jaff Nov 22, 2025
f7f4320
Revert to console outputs to reduce noise (#16981)
yuneng-jiang Nov 22, 2025
e11d34e
Permission Management - disable global guardrails by key/team (#16983)
krrishdholakia Nov 22, 2025
b9f2cc1
Model Armor - Logging guardrail response on llm responses (#16977)
krrishdholakia Nov 22, 2025
c966c12
feat: Add Presidio PII masking tutorial (#16969)
krrishdholakia Nov 22, 2025
270d239
(fix) litellm_logging.py: fix mcp tool call response logging + (fix) …
krrishdholakia Nov 22, 2025
c6e0a02
Docs for Model Compare UI (#16979)
yuneng-jiang Nov 22, 2025
478f9b0
fix: fix ruff checks
krrishdholakia Nov 22, 2025
de897ce
fix(vertex_ai/image_generation_handler.py): fix default sample count …
krrishdholakia Nov 22, 2025
5da77ac
[Fix] Revert UI - Organization Usage (#16980)
yuneng-jiang Nov 23, 2025
d0cb2db
docs(ai_hub.md): document mcp servers on ai hub
krrishdholakia Nov 23, 2025
7cbb159
v1.80.0-stable
ishaan-jaff Nov 23, 2025
93c2103
fix docs
ishaan-jaff Nov 23, 2025
cc5ecfd
test: fix tests
krrishdholakia Nov 23, 2025
023eefb
docs fix
ishaan-jaff Nov 23, 2025
815136f
perf release notes (#16978)
AlexsanderHamir Nov 23, 2025
f1e4242
docs fix
ishaan-jaff Nov 23, 2025
036b284
docs
ishaan-jaff Nov 23, 2025
2ebfc92
docs fix
ishaan-jaff Nov 23, 2025
7e338f1
docs sso roles
ishaan-jaff Nov 23, 2025
1c864dc
docs 1.80.5
ishaan-jaff Nov 23, 2025
f3cd0b0
docs - ai gateway prompt management (#16990)
ishaan-jaff Nov 23, 2025
16a5178
Ensure unique keys in navbar menu items (#16987)
yuneng-jiang Nov 23, 2025
06f2ece
feat: tool permission argument check (#16982)
uc4w6c Nov 23, 2025
02873cf
Minor cosmetic changes for buttons, add notification for delete team …
yuneng-jiang Nov 23, 2025
73d3d15
fix 1.80.5
ishaan-jaff Nov 23, 2025
90850bf
fix: add nodejs and npm to runtime dependencies for prisma generate (…
DmitriyAlergant Nov 23, 2025
f0b10b8
chore: remove unused MCP_PROTOCOL_VERSION_HEADER_NAME constant (#17008)
uc4w6c Nov 24, 2025
b72b497
feat: add backend support for OAuth2 auth_type registration via UI (#…
uc4w6c Nov 24, 2025
5b07290
docs: cleanup README and improve agent guides (#17003)
Chesars Nov 24, 2025
f37d648
feat: add UI support for registering MCP OAuth2 auth_type (#17007)
uc4w6c Nov 24, 2025
45e921d
fix: Update broken documentation links in README (#17002)
Chesars Nov 24, 2025
7cc92d1
fix(vertex_ai): handle global location in context caching (#16997)
prawaan Nov 24, 2025
a2a45ce
fix: prevent duplicate spend logs in Responses API for non-OpenAI pro…
hula-la Nov 24, 2025
013dcd8
Change provider create fields to JSON (#16985)
yuneng-jiang Nov 24, 2025
adfdcf1
[Fix] UI - Hide Default Team Settings From Proxy Admin Viewers (#16900)
yuneng-jiang Nov 24, 2025
61fed95
OCI Provider: Fix pydantic validation errors during tool call with st…
gotsysdba Nov 24, 2025
760bce2
feat(proxy_server.py): preserve content field even if null (#16988)
krrishdholakia Nov 24, 2025
f18cec2
Update release notes to v1.80.5.rc.2 (#17032)
krrishdholakia Nov 24, 2025
c368b4c
Document model pricing contribution process (#17031)
krrishdholakia Nov 24, 2025
7c1dfd4
docs: Clarify optional user_allowed_email_domain (#17034)
krrishdholakia Nov 24, 2025
559142a
Add table explaining different callback hooks (#17035)
krrishdholakia Nov 24, 2025
62b84d6
Prompt security litellm (#16365)
lior-ps Nov 24, 2025
a807fe4
add claude opus 4.5 (#17043)
mubashir1osmani Nov 24, 2025
4e195d6
[Feat] New API - Claude Skills API (Anthropic) (#17042)
ishaan-jaff Nov 24, 2025
d2b90e6
[Docs] Claude Skills API (#17052)
ishaan-jaff Nov 24, 2025
f5a3349
feat: add UI support for configuring tool permission guardrails (#17050)
uc4w6c Nov 25, 2025
aec6590
add strands tutorial (#17039)
mubashir1osmani Nov 25, 2025
629404a
Add cost tracking for cohere embed passthrough endpoint (#17029)
Sameerlite Nov 25, 2025
84e8b9a
fix: handle None or empty contents in Gemini token counter (#17020)
HaiyiMei Nov 25, 2025
3b6c170
Fix the azure auth format for videos (#17009)
Sameerlite Nov 25, 2025
c6fbdc7
fix bedrock passthrough auth issue (#16879)
Sameerlite Nov 25, 2025
35bfcac
Add header forwarding in embedding (#16869)
Sameerlite Nov 25, 2025
fc219c7
Integrate eleven labs text-to-speech (#16573)
Sameerlite Nov 25, 2025
282ac87
Add temperature support for 5.1 models (#17011)
Sameerlite Nov 25, 2025
d53bc7b
Change modals to reusable component
yuneng-jiang Nov 25, 2025
bd8196f
(fix) propagate x-litellm-model-id in responses (#16986)
raghav-stripe Nov 25, 2025
262fb74
Fix: Distinguish permission errors from idempotent errors in Prisma m…
yuya2017 Nov 25, 2025
e371ff4
Non root docker build fix (#17060)
yuneng-jiang Nov 25, 2025
3f5a34d
Deleting a user from team deletes key user created for team (#17057)
yuneng-jiang Nov 25, 2025
1ae8095
Docs: Add link to logging payload spec (#17049)
krrishdholakia Nov 25, 2025
d2b3ef0
Add aws_bedrock_runtime_endpoint into Credential Types (#17053)
yuneng-jiang Nov 25, 2025
597fa4d
Fix image edit endpoint (#17046)
emerzon Nov 25, 2025
777ef62
Enhancement(helm): ServiceMonitor template rendering (#17038)
saar-win Nov 25, 2025
3aba6d9
[Fix] UI - Add No Default Models for Team and User Settings (#17037)
yuneng-jiang Nov 25, 2025
650b189
fix(gemini): skip thinking config for image models (#17027)
Chesars Nov 25, 2025
cfd35d3
Metadata: fix 401 when audio/transcriptions (#17023)
saar-win Nov 25, 2025
046b7ef
Make Bedrock image generation more consistent (#17021)
komarovd95 Nov 25, 2025
babee43
Merge pull request #17068 from BerriAI/litellm_additional_delete_reso…
yuneng-jiang Nov 25, 2025
6dcb542
fix(vertex): fix CreateCachedContentRequest enum error (#16965)
sdip15fa Nov 25, 2025
29ab291
Add vertex ai image support
Sameerlite Nov 25, 2025
f52f057
Update docs related to vertex ai image gen
Sameerlite Nov 25, 2025
883cfae
Add tests
Sameerlite Nov 25, 2025
b0d5111
remove unsused imports
Sameerlite Nov 25, 2025
a50083a
Remove none support from reasoning param
Sameerlite Nov 25, 2025
c149ade
Add tests related to reasoning param none
Sameerlite Nov 25, 2025
67d69d1
Add cost tracking and logging support
Sameerlite Nov 25, 2025
afe540e
Fix auth issue
Sameerlite Nov 25, 2025
dd4c8ec
Add v1/messages support for azure anthropic models
Sameerlite Nov 25, 2025
e2f2ccd
Add tests related messages api
Sameerlite Nov 25, 2025
e0396e5
Merge pull request #17082 from BerriAI/main
Sameerlite Nov 25, 2025
255d1bc
fix lint errors
Sameerlite Nov 25, 2025
1c61228
fix lint errors
Sameerlite Nov 25, 2025
00e17c8
Add enforce user param functionality (#17088)
krrishdholakia Nov 25, 2025
2e50db8
Merge pull request #17071 from BerriAI/litellm_azure_gpt_5_reasoning
Sameerlite Nov 25, 2025
59bcf07
Merge pull request #17078 from BerriAI/litellm_add_search_logging
Sameerlite Nov 25, 2025
59b4b9a
fix documentation of anthropic azure
Sameerlite Nov 25, 2025
83a9dcd
Merge pull request #16886 from BerriAI/litellm_anthopic_azure_support
Sameerlite Nov 25, 2025
3249f6d
Merge pull request #17070 from BerriAI/litellm_add_vertex_ai_image_su…
Sameerlite Nov 25, 2025
67622fb
Add day 0 support for anthropic new feat (#17091)
Sameerlite Nov 25, 2025
db2c8e3
docs: initial doc cleanup
krrishdholakia Nov 25, 2025
44cde2e
Disable edit, delete, info, for dynamically generated spend tags
yuneng-jiang Nov 25, 2025
be71290
[Feat] Add OpenAI compatible bedrock imported models. - qwen etc (#17…
ishaan-jaff Nov 25, 2025
52f1bf1
fix: missing await (#17103)
uc4w6c Nov 25, 2025
f3d5775
fix: fix doc load issue
krrishdholakia Nov 25, 2025
db58792
Sorting changes, pending tests and loading state
yuneng-jiang Nov 25, 2025
c0288d8
Fix bedrock claude opus 4.5 inference profile - only global currently…
reflection Nov 25, 2025
8637d74
include `server_tool_use` in streaming usage (#16826)
KeremTurgutlu Nov 25, 2025
70a1325
docs: more doc cleanup
krrishdholakia Nov 25, 2025
3da9974
Tests
yuneng-jiang Nov 25, 2025
8ee6812
docs: cleanup launch post
krrishdholakia Nov 25, 2025
5cb5c2a
docs: more doc cleanup
krrishdholakia Nov 26, 2025
6e5c7c0
fix transcription exception handling - /audio/transcriptions (#16791)
otaviofbrito Nov 26, 2025
5ec3f19
Make model select required for team, add checks for all-proxy-models
yuneng-jiang Nov 26, 2025
5c192a2
[Feat] Add new RAG API on LiteLLM AI Gateway (#17109)
ishaan-jaff Nov 26, 2025
cd65a84
Merge pull request #16844 from Chesars/fix/response-format-to-text-fo…
Sameerlite Nov 26, 2025
b50fcc4
vertex ai: use the correct domain for the global location when counti…
CAFxX Nov 26, 2025
7227747
Improve Wording for Config Models in Model Table (#17100)
yuneng-jiang Nov 26, 2025
7c09187
downgrade grpcio (#17090)
AlexsanderHamir Nov 26, 2025
e6e1e8f
feat(pillar): add automatic LiteLLM context headers (#17076)
eagle-p Nov 26, 2025
a727f71
Optimize date filtering for spend logs queries (#17073)
CAFxX Nov 26, 2025
be97073
feat: Add gemini-3-pro-image-preview model support for imageSize para…
choigawoon Nov 26, 2025
e9ab206
Merge pull request #17098 from BerriAI/litellm_broken_links_ui
yuneng-jiang Nov 26, 2025
6c79240
Merge pull request #17108 from BerriAI/litellm_user_table_sort_ui
yuneng-jiang Nov 26, 2025
38bac31
Merge pull request #17110 from BerriAI/litellm_org_admin_access_fix
yuneng-jiang Nov 26, 2025
31c3913
Fix videos lint errors
Sameerlite Nov 26, 2025
3c2623e
Merge pull request #17125 from BerriAI/litellm_fix_videos_lint
Sameerlite Nov 26, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
6 changes: 4 additions & 2 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -1785,7 +1785,7 @@ jobs:
- audio_coverage
installing_litellm_on_python:
docker:
- image: circleci/python:3.8
- image: cimg/python:3.11
auth:
username: ${DOCKERHUB_USERNAME}
password: ${DOCKERHUB_PASSWORD}
Expand Down Expand Up @@ -3389,7 +3389,9 @@ jobs:
nvm use 20

cd ui/litellm-dashboard
npm ci || npm install
# Remove node_modules and package-lock to ensure clean install (fixes optional deps issue)
rm -rf node_modules package-lock.json
npm install

# CI run, with both LCOV (Codecov) and HTML (artifact you can click)
CI=true npm run test -- --run --coverage \
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/test-litellm.yml
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ jobs:
- name: Setup litellm-enterprise as local package
run: |
cd enterprise
python -m pip install -e .
poetry run pip install -e .
cd ..
- name: Run tests
run: |
Expand Down
19 changes: 19 additions & 0 deletions AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -98,6 +98,25 @@ LiteLLM supports MCP for agent workflows:

Use `poetry run python script.py` to run Python scripts in the project environment (for non-test files).

## GITHUB TEMPLATES

When opening issues or pull requests, follow these templates:

### Bug Reports (`.github/ISSUE_TEMPLATE/bug_report.yml`)
- Describe what happened vs. expected behavior
- Include relevant log output
- Specify LiteLLM version
- Indicate if you're part of an ML Ops team (helps with prioritization)

### Feature Requests (`.github/ISSUE_TEMPLATE/feature_request.yml`)
- Clearly describe the feature
- Explain motivation and use case with concrete examples

### Pull Requests (`.github/pull_request_template.md`)
- Add at least 1 test in `tests/litellm/`
- Ensure `make test-unit` passes


## TESTING CONSIDERATIONS

1. **Provider Tests**: Test against real provider APIs when possible
Expand Down
16 changes: 16 additions & 0 deletions CLAUDE.md
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,22 @@ This file provides guidance to Claude Code (claude.ai/code) when working with co
### Running Scripts
- `poetry run python script.py` - Run Python scripts (use for non-test files)

### GitHub Issue & PR Templates
When contributing to the project, use the appropriate templates:

**Bug Reports** (`.github/ISSUE_TEMPLATE/bug_report.yml`):
- Describe what happened vs. what you expected
- Include relevant log output
- Specify your LiteLLM version

**Feature Requests** (`.github/ISSUE_TEMPLATE/feature_request.yml`):
- Describe the feature clearly
- Explain the motivation and use case

**Pull Requests** (`.github/pull_request_template.md`):
- Add at least 1 test in `tests/litellm/`
- Ensure `make test-unit` passes

## Architecture Overview

LiteLLM is a unified interface for 100+ LLM providers with two main components:
Expand Down
2 changes: 1 addition & 1 deletion Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ FROM $LITELLM_RUNTIME_IMAGE AS runtime
USER root

# Install runtime dependencies
RUN apk add --no-cache openssl tzdata
RUN apk add --no-cache openssl tzdata nodejs npm

# Upgrade pip to fix CVE-2025-8869
RUN pip install --upgrade pip>=24.3.1
Expand Down
19 changes: 19 additions & 0 deletions GEMINI.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,25 @@ This file provides guidance to Gemini when working with code in this repository.
- `poetry run pytest tests/path/to/test_file.py -v` - Run specific test file
- `poetry run pytest tests/path/to/test_file.py::test_function -v` - Run specific test

### Running Scripts
- `poetry run python script.py` - Run Python scripts (use for non-test files)

### GitHub Issue & PR Templates
When contributing to the project, use the appropriate templates:

**Bug Reports** (`.github/ISSUE_TEMPLATE/bug_report.yml`):
- Describe what happened vs. what you expected
- Include relevant log output
- Specify your LiteLLM version

**Feature Requests** (`.github/ISSUE_TEMPLATE/feature_request.yml`):
- Describe the feature clearly
- Explain the motivation and use case

**Pull Requests** (`.github/pull_request_template.md`):
- Add at least 1 test in `tests/litellm/`
- Ensure `make test-unit` passes

## Architecture Overview

LiteLLM is a unified interface for 100+ LLM providers with two main components:
Expand Down
12 changes: 5 additions & 7 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
<p align="center">Call all LLM APIs using the OpenAI format [Bedrock, Huggingface, VertexAI, TogetherAI, Azure, OpenAI, Groq etc.]
<br>
</p>
<h4 align="center"><a href="https://docs.litellm.ai/docs/simple_proxy" target="_blank">LiteLLM Proxy Server (LLM Gateway)</a> | <a href="https://docs.litellm.ai/docs/hosted" target="_blank"> Hosted Proxy (Preview)</a> | <a href="https://docs.litellm.ai/docs/enterprise"target="_blank">Enterprise Tier</a></h4>
<h4 align="center"><a href="https://docs.litellm.ai/docs/simple_proxy" target="_blank">LiteLLM Proxy Server (LLM Gateway)</a> | <a href="https://docs.litellm.ai/docs/enterprise#hosted-litellm-proxy" target="_blank"> Hosted Proxy</a> | <a href="https://docs.litellm.ai/docs/enterprise"target="_blank">Enterprise Tier</a></h4>
<h4 align="center">
<a href="https://pypi.org/project/litellm/" target="_blank">
<img src="https://img.shields.io/pypi/v/litellm.svg" alt="PyPI Version">
Expand Down Expand Up @@ -40,18 +40,14 @@ LiteLLM manages:
LiteLLM Performance: **8ms P95 latency** at 1k RPS (See benchmarks [here](https://docs.litellm.ai/docs/benchmarks))

[**Jump to LiteLLM Proxy (LLM Gateway) Docs**](https://github.com/BerriAI/litellm?tab=readme-ov-file#litellm-proxy-server-llm-gateway---docs) <br>
[**Jump to Supported LLM Providers**](https://github.com/BerriAI/litellm?tab=readme-ov-file#supported-providers-docs)
[**Jump to Supported LLM Providers**](https://docs.litellm.ai/docs/providers)

🚨 **Stable Release:** Use docker images with the `-stable` tag. These have undergone 12 hour load tests, before being published. [More information about the release cycle here](https://docs.litellm.ai/docs/proxy/release_cycle)

Support for more providers. Missing a provider or LLM Platform, raise a [feature request](https://github.com/BerriAI/litellm/issues/new?assignees=&labels=enhancement&projects=&template=feature_request.yml&title=%5BFeature%5D%3A+).

# Usage ([**Docs**](https://docs.litellm.ai/docs/))

> [!IMPORTANT]
> LiteLLM v1.0.0 now requires `openai>=1.0.0`. Migration guide [here](https://docs.litellm.ai/docs/migration)
> LiteLLM v1.40.14+ now requires `pydantic>=2.0.0`. No changes required.

<a target="_blank" href="https://colab.research.google.com/github/BerriAI/litellm/blob/main/cookbook/liteLLM_Getting_Started.ipynb">
<img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
</a>
Expand Down Expand Up @@ -114,6 +110,8 @@ print(response)
}
```

> **Note:** LiteLLM also supports the [Responses API](https://docs.litellm.ai/docs/response_api) (`litellm.responses()`)

Call any model supported by a provider, with `model=<provider_name>/<model_name>`. There might be provider-specific details here, so refer to [provider docs for more information](https://docs.litellm.ai/docs/providers)

## Async ([Docs](https://docs.litellm.ai/docs/completion/stream#async-completion))
Expand Down Expand Up @@ -210,7 +208,7 @@ response = completion(model="openai/gpt-4o", messages=[{"role": "user", "content

Track spend + Load Balance across multiple projects

[Hosted Proxy (Preview)](https://docs.litellm.ai/docs/hosted)
[Hosted Proxy](https://docs.litellm.ai/docs/enterprise#hosted-litellm-proxy)

The proxy provides:

Expand Down
3 changes: 3 additions & 0 deletions ci_cd/security_scans.sh
Original file line number Diff line number Diff line change
Expand Up @@ -69,10 +69,13 @@ run_grype_scans() {
# Allowlist of CVEs to be ignored in failure threshold/reporting
# - CVE-2025-8869: Not applicable on Python >=3.13 (PEP 706 implemented); pip fallback unused; no OS-level fix
# - GHSA-4xh5-x5gv-qwph: GitHub Security Advisory alias for CVE-2025-8869
# - GHSA-5j98-mcp5-4vw2: glob CLI command injection via -c/--cmd; glob CLI is not used in the litellm runtime image,
# and the vulnerable versions are pulled in only via OS-level/node tooling outside of our application code
ALLOWED_CVES=(
"CVE-2025-8869"
"GHSA-4xh5-x5gv-qwph"
"CVE-2025-8291" # no fix available as of Oct 11, 2025
"GHSA-5j98-mcp5-4vw2"
)

# Build JSON array of allowlisted CVE IDs for jq
Expand Down
2 changes: 1 addition & 1 deletion cookbook/LiteLLM_HuggingFace.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -131,7 +131,7 @@
" {\n",
" \"type\": \"image_url\",\n",
" \"image_url\": {\n",
" \"url\": \"https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg\",\n",
" \"url\": \"https://awsmp-logos.s3.amazonaws.com/seller-xw5kijmvmzasy/c233c9ade2ccb5491072ae232c814942.png\",\n",
" },\n",
" },\n",
" ],\n",
Expand Down
70 changes: 50 additions & 20 deletions cookbook/misc/RELEASE_NOTES_GENERATION_INSTRUCTIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -43,6 +43,14 @@ hide_table_of_contents: false
## Key Highlights
[3-5 bullet points of major features - prioritize MCP OAuth 2.0, scheduled key rotations, and major model updates]

## New Providers and Endpoints

### New Providers
[Table with Provider, Supported Endpoints, Description columns]

### New LLM API Endpoints
[Optional table for new endpoint additions with Endpoint, Method, Description, Documentation columns]

## New Models / Updated Models
#### New Model Support
[Model pricing table]
Expand All @@ -53,9 +61,6 @@ hide_table_of_contents: false
### Bug Fixes
[Provider-specific bug fixes organized by provider]

#### New Provider Support
[New provider integrations]

## LLM API Endpoints
#### Features
[API-specific features organized by API type]
Expand All @@ -70,16 +75,20 @@ hide_table_of_contents: false
#### Bugs
[Management-related bug fixes]

## Logging / Guardrail / Prompt Management Integrations
#### Features
[Organized by integration provider with proper doc links]
## AI Integrations

### Logging
[Logging integrations organized by provider with proper doc links, includes General subsection]

#### Guardrails
### Guardrails
[Guardrail-specific features and fixes]

#### Prompt Management
### Prompt Management
[Prompt management integrations like BitBucket]

### Secret Managers
[Secret manager integrations - AWS, HashiCorp Vault, CyberArk, etc.]

## Spend Tracking, Budgets and Rate Limiting
[Cost tracking, service tier pricing, rate limiting improvements]

Expand Down Expand Up @@ -149,26 +158,34 @@ hide_table_of_contents: false
- Admin settings updates
- Management routes and endpoints

**Logging / Guardrail / Prompt Management Integrations:**
**AI Integrations:**
- **Structure:**
- `#### Features` - organized by integration provider with proper doc links
- `#### Guardrails` - guardrail-specific features and fixes
- `#### Prompt Management` - prompt management integrations
- `#### New Integration` - major new integrations
- **Integration Categories:**
- `### Logging` - organized by integration provider with proper doc links, includes **General** subsection
- `### Guardrails` - guardrail-specific features and fixes
- `### Prompt Management` - prompt management integrations
- `### Secret Managers` - secret manager integrations
- **Logging Categories:**
- **[DataDog](../../docs/proxy/logging#datadog)** - group all DataDog-related changes
- **[Langfuse](../../docs/proxy/logging#langfuse)** - Langfuse-specific features
- **[Prometheus](../../docs/proxy/logging#prometheus)** - monitoring improvements
- **[PostHog](../../docs/observability/posthog)** - observability integration
- **[SQS](../../docs/proxy/logging#sqs)** - SQS logging features
- **[Opik](../../docs/proxy/logging#opik)** - Opik integration improvements
- **[Arize Phoenix](../../docs/observability/arize_phoenix)** - Arize Phoenix integration
- **General** - miscellaneous logging features like callback controls, sensitive data masking
- Other logging providers with proper doc links
- **Guardrail Categories:**
- LakeraAI, Presidio, Noma, and other guardrail providers
- LakeraAI, Presidio, Noma, Grayswan, IBM Guardrails, and other guardrail providers
- **Prompt Management:**
- BitBucket, GitHub, and other prompt management integrations
- Prompt versioning, testing, and UI features
- **Secret Managers:**
- **[AWS Secrets Manager](../../docs/secret_managers)** - AWS secret manager features
- **[HashiCorp Vault](../../docs/secret_managers)** - Vault integrations
- **[CyberArk](../../docs/secret_managers)** - CyberArk integrations
- **General** - cross-secret-manager features
- Use bullet points under each provider for multiple features
- Separate logging features from guardrails and prompt management clearly
- Separate logging, guardrails, prompt management, and secret managers clearly

### 4. Documentation Linking Strategy

Expand Down Expand Up @@ -232,6 +249,9 @@ From git diff analysis, create tables like:
- **Cost breakdown in logging** → Spend Tracking section
- **MCP configuration/OAuth** → MCP Gateway (NOT General Proxy Improvements)
- **All documentation PRs** → Documentation Updates section for visibility
- **Callback controls/logging features** → AI Integrations > Logging > General
- **Secret manager features** → AI Integrations > Secret Managers
- **Video generation tag-based routing** → LLM API Endpoints > Video Generation API

### 7. Writing Style Guidelines

Expand Down Expand Up @@ -370,10 +390,20 @@ This release has a known issue...
- **Virtual Keys** - Key rotation and management
- **Models + Endpoints** - Provider and endpoint management

**Logging Section Expansion:**
- Rename to "Logging / Guardrail / Prompt Management Integrations"
- Add **Prompt Management** subsection for BitBucket, GitHub integrations
- Keep guardrails separate from logging features
**AI Integrations Section Expansion:**
- Renamed from "Logging / Guardrail / Prompt Management Integrations" to "AI Integrations"
- Structure with four main subsections:
- **Logging** - with **General** subsection for miscellaneous logging features
- **Guardrails** - separate from logging features
- **Prompt Management** - BitBucket, GitHub integrations, versioning features
- **Secret Managers** - AWS, HashiCorp Vault, CyberArk, etc.

**New Providers and Endpoints Section:**
- Add section after Key Highlights and before New Models / Updated Models
- Include tables for:
- **New Providers** - Provider name, supported endpoints, description
- **New LLM API Endpoints** (optional) - Endpoint, method, description, documentation link
- Only include major new provider integrations, not minor provider updates

## Example Command Workflow

Expand Down
1 change: 1 addition & 0 deletions cookbook/misc/update_json_caching.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,7 @@
"gpt-4o-2024-05-13",
"text-embedding-3-small",
"text-embedding-3-large",
"text-embedding-ada-002-v2",
"ft:gpt-4o-2024-08-06",
"ft:gpt-4o-mini-2024-07-18",
"ft:gpt-3.5-turbo",
Expand Down
9 changes: 9 additions & 0 deletions deploy/charts/litellm-helm/templates/deployment.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -129,6 +129,10 @@ spec:
args:
- --config
- /etc/litellm/config.yaml
{{ if .Values.numWorkers }}
- --num_workers
- {{ .Values.numWorkers | quote }}
{{- end }}
ports:
- name: http
containerPort: {{ .Values.service.port }}
Expand Down Expand Up @@ -208,3 +212,8 @@ spec:
tolerations:
{{- toYaml . | nindent 8 }}
{{- end }}
terminationGracePeriodSeconds: {{ .Values.terminationGracePeriodSeconds | default 90 }}
{{- if .Values.topologySpreadConstraints }}
topologySpreadConstraints:
{{- toYaml .Values.topologySpreadConstraints | nindent 8 }}
{{- end }}
39 changes: 39 additions & 0 deletions deploy/charts/litellm-helm/templates/servicemonitor.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
{{- with .Values.serviceMonitor }}
{{- if and (eq .enabled true) }}
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
name: {{ include "litellm.fullname" $ }}
labels:
{{- include "litellm.labels" $ | nindent 4 }}
{{- if .labels }}
{{- toYaml .labels | nindent 4 }}
{{- end }}
{{- if .annotations }}
annotations:
{{- toYaml .annotations | nindent 4 }}
{{- end }}
spec:
selector:
matchLabels:
{{- include "litellm.selectorLabels" $ | nindent 6 }}
namespaceSelector:
matchNames:
# if not set, use the release namespace
{{- if not .namespaceSelector.matchNames }}
- {{ $.Release.Namespace | quote }}
{{- else }}
{{- toYaml .namespaceSelector.matchNames | nindent 4 }}
{{- end }}
endpoints:
- port: http
path: /metrics/
interval: {{ .interval }}
scrapeTimeout: {{ .scrapeTimeout }}
scheme: http
{{- if .relabelings }}
relabelings:
{{- toYaml .relabelings | nindent 4 }}
{{- end }}
{{- end }}
{{- end }}
Loading
Loading