feat: Add Azure CLI authentication support for Azure OpenAI #1426

shubhamdeodia · 2025-11-13T15:44:22Z

Azure CLI Authentication Implementation

Pull Request Documentation

Summary

This PR adds support for Azure CLI-based authentication (azure_auth_mode: "azure_cli") to the Portkey AI Gateway for both Azure OpenAI and Azure AI Inference services. This enhancement enables developers to authenticate using their existing Azure CLI credentials, simplifying local development and testing workflows.

Changes Overview

Files Modified

src/providers/azure-openai/utils.ts
- Added getAzureCliToken() function to obtain access tokens via Azure CLI
src/providers/azure-openai/api.ts
- Imported getAzureCliToken function
- Added azure_cli authentication mode handler in headers function
src/providers/azure-ai-inference/api.ts
- Imported getAzureCliToken function
- Added azure_cli authentication mode handler in headers function
plugins/azure/utils.ts
- Added getAzureCliToken() function for Azure plugin support
- Updated getAccessToken() to handle azure_cli mode
plugins/azure/types.ts
- Updated AzureCredentials interface with strict union type
- Added 'azure_cli' to valid authentication modes
src/types/requestBody.ts
- Added JSDoc documentation for azureAuthMode in 3 interfaces (Options, Targets, ShortConfig)
- Documented all valid authentication modes including azure_cli

Authentication Flow Sequence Diagram

sequenceDiagram
    participant Client
    participant Gateway as Portkey Gateway<br/>(Node.js)
    participant Utils as getAzureCliToken()<br/>Function
    participant CLI as Azure CLI<br/>(Local Process)
    participant Azure as Azure OpenAI API

    Client->>Gateway: Request with config<br/>{azure_auth_mode: "azure_cli"}
    
    Gateway->>Gateway: Detect azure_auth_mode === "azure_cli"<br/>& runtime === "node"
    
    Gateway->>Utils: Call getAzureCliToken(scope)
    
    Utils->>CLI: execSync('az account get-access-token<br/>--resource https://cognitiveservices.azure.com/')
    
    alt Azure CLI Success
        CLI->>Utils: Return JSON<br/>{accessToken: "eyJ0...", expiresOn: "..."}
        Utils->>Utils: Parse JSON and extract accessToken
        Utils->>Gateway: Return access token
        Gateway->>Gateway: Add Authorization header<br/>"Bearer eyJ0..."
        Gateway->>Azure: Forward request with Bearer token
        Azure->>Gateway: Response
        Gateway->>Client: Response
    else Azure CLI Error
        CLI->>Utils: Error: "az not found" or<br/>"Not logged in"
        Utils->>Utils: Log error message
        Utils->>Gateway: Return undefined
        Gateway->>Gateway: Fall back to API key authentication
        Gateway->>Azure: Forward request with api-key header
        Azure->>Gateway: Response (or error if no API key)
        Gateway->>Client: Response
    end

Technical Implementation Details

1. Token Acquisition Function

Location: src/providers/azure-openai/utils.ts

export function getAzureCliToken(
  scope = 'https://cognitiveservices.azure.com/.default'
): string | undefined {
  try {
    const { execSync } = require('child_process');
    
    // Execute Azure CLI command to get access token
    const command = `az account get-access-token --resource ${scope.replace('/.default', '')}`;
    const output = execSync(command, { encoding: 'utf-8' });
    
    const tokenData = JSON.parse(output);
    return tokenData.accessToken;
  } catch (error: any) {
    console.error('getAzureCliToken error: ', error?.message || error);
    console.error(
      'Make sure Azure CLI is installed and you are logged in using "az login"'
    );
    return undefined;
  }
}

Key Design Decisions:

Synchronous Execution: Uses execSync to block until token is retrieved (simplifies async flow)
Scope Handling: Removes /.default suffix as Azure CLI expects raw resource URL
Error Handling: Returns undefined on failure, allowing fallback to API key authentication
Logging: Provides helpful error messages guiding users to install/login to Azure CLI

2. Azure OpenAI Integration

Location: src/providers/azure-openai/api.ts

// Azure CLI authentication mode - only available in Node.js runtime
if (azureAuthMode === 'azure_cli' && runtime === 'node') {
  const scope = 'https://cognitiveservices.azure.com/.default';
  const accessToken = getAzureCliToken(scope);
  if (accessToken) {
    return {
      Authorization: `Bearer ${accessToken}`,
    };
  }
}

Integration Points:

Placed after workload auth mode and before API key fallback
Runtime check ensures it only executes in Node.js environment
Returns immediately if token is successfully obtained
Falls through to API key auth if token retrieval fails

3. Azure AI Inference Integration

Location: src/providers/azure-ai-inference/api.ts

// Azure CLI authentication mode - only available in Node.js runtime
if (azureAuthMode === 'azure_cli' && runtime === 'node') {
  const scope = 'https://cognitiveservices.azure.com/.default';
  const accessToken = getAzureCliToken(scope);
  if (accessToken) {
    headers['Authorization'] = `Bearer ${accessToken}`;
    return headers;
  }
}

Differences from OpenAI Implementation:

Modifies existing headers object rather than returning new object
Maintains consistency with other auth modes in the same file
Same runtime and error handling logic

4. Azure Plugin Support

Location: plugins/azure/utils.ts

export function getAzureCliToken(
  scope = 'https://cognitiveservices.azure.com/.default',
  check: string
): { token: string; error: string | null } {
  const result: { token: string; error: string | null } = {
    token: '',
    error: null,
  };
  
  try {
    // Note: Azure CLI auth only works in Node.js runtime
    if (typeof process === 'undefined' || !process.versions?.node) {
      result.error = 'Azure CLI authentication requires Node.js runtime';
      return result;
    }

    const { execSync } = require('child_process');
    const command = `az account get-access-token --resource ${scope.replace('/.default', '')}`;
    const output = execSync(command, { encoding: 'utf-8' });
    
    const tokenData = JSON.parse(output);
    result.token = tokenData.accessToken;
  } catch (error: any) {
    result.error = error?.message || String(error);
    console.error('getAzureCliToken error: ', result.error);
    console.error(
      'Make sure Azure CLI is installed and you are logged in using "az login"'
    );
  }
  
  return result;
}

Integration in getAccessToken():

if (azureAuthMode === 'azure_cli') {
  tokenResult = getAzureCliToken(scope, check);
}

Key Features:

Returns object with token and error properties for consistent error handling
Runtime validation to ensure Node.js environment
Compatible with Azure plugin architecture
Enables Azure CLI auth for Content Safety and other Azure plugins

5. Type Definitions

Location: plugins/azure/types.ts

export interface AzureCredentials {
  resourceName: string;
  azureAuthMode: 'apiKey' | 'entra' | 'managed' | 'azure_cli';
  apiKey?: string;
  clientId?: string;
  clientSecret?: string;
  tenantId?: string;
  customHost?: string;
}

Location: src/types/requestBody.ts

/** Azure authentication mode: 'apiKey' | 'entra' | 'managed' | 'workload' | 'azure_cli' */
azureAuthMode?: string;

Type Safety Features:

Strict union type in AzureCredentials for compile-time checking
JSDoc documentation in request body types for IDE support
Added to 3 interfaces: Options, Targets, and ShortConfig
Provides autocomplete and hover documentation in IDEs

Authentication Mode Precedence

The authentication logic follows this order:

azureAdToken (if provided directly)
entra mode (client credentials flow)
managed mode (managed identity)
workload mode (workload identity)
azure_cli mode (Azure CLI tokens) ← New
API Key (fallback)

Runtime Requirements

The azure_cli mode is only available in Node.js runtime because:

Requires child_process.execSync to execute shell commands
Needs access to locally installed Azure CLI
Not compatible with serverless environments (Cloudflare Workers, Vercel Edge, etc.)

Runtime Detection:

import { getRuntimeKey } from 'hono/adapter';
const runtime = getRuntimeKey();

if (azureAuthMode === 'azure_cli' && runtime === 'node') {
  // Only executes in Node.js
}

Security Considerations

Token Security

Tokens are obtained fresh for each request (no caching in this implementation)
Tokens are scoped specifically to Azure Cognitive Services
Token lifetime is managed by Azure CLI (typically 1 hour)

Credential Management

No credentials stored in code or configuration
Leverages Azure CLI's secure credential storage
Uses OS-level keychain/credential manager

Audit Trail

All authentication attempts logged through Azure's audit systems
Failed authentications generate console errors for debugging

Error Scenarios and Handling

Scenario	Detection	Handling	User Message
Azure CLI not installed	`execSync` throws error	Log error, return `undefined`, fall back to API key	"Make sure Azure CLI is installed"
Not logged in	Azure CLI returns error	Log error, return `undefined`, fall back to API key	"Make sure you are logged in using 'az login'"
Insufficient permissions	Azure returns 401/403	Propagated to client	Azure error message
Wrong subscription	Token for wrong subscription	Azure returns 404	"Resource not found"
Serverless runtime	Runtime check	Skip azure_cli auth	Silent - uses other auth methods

Configuration Examples

Minimal Configuration

{
  "provider": "azure-openai",
  "azure_auth_mode": "azure_cli",
  "resource_name": "my-openai",
  "deployment_id": "gpt-4",
  "api_version": "2024-02-15-preview"
}

With Fallback to API Key

{
  "provider": "azure-openai",
  "azure_auth_mode": "azure_cli",
  "api_key": "${AZURE_OPENAI_API_KEY}",
  "resource_name": "my-openai",
  "deployment_id": "gpt-4",
  "api_version": "2024-02-15-preview"
}

Multiple Providers with Loadbalancing

{
  "strategy": {
    "mode": "loadbalance"
  },
  "targets": [
    {
      "provider": "azure-openai",
      "azure_auth_mode": "azure_cli",
      "resource_name": "openai-dev",
      "deployment_id": "gpt-4",
      "api_version": "2024-02-15-preview",
      "weight": 1
    },
    {
      "provider": "azure-openai",
      "azure_auth_mode": "entra",
      "azure_entra_client_id": "${AZURE_CLIENT_ID}",
      "azure_entra_client_secret": "${AZURE_CLIENT_SECRET}",
      "azure_entra_tenant_id": "${AZURE_TENANT_ID}",
      "resource_name": "openai-prod",
      "deployment_id": "gpt-4",
      "api_version": "2024-02-15-preview",
      "weight": 2
    }
  ]
}

Testing

Manual Testing Steps

Prerequisites:

# Install Azure CLI
brew install azure-cli  # macOS

# Login
az login

# Verify login
az account show

Test Basic Authentication:

# Create test config
cat > config.json << EOF
{
  "provider": "azure-openai",
  "azure_auth_mode": "azure_cli",
  "resource_name": "your-resource-name",
  "deployment_id": "gpt-4",
  "api_version": "2024-02-15-preview"
}
EOF

# Make request
curl -X POST http://localhost:8787/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-portkey-config: $(cat config.json | base64)" \
  -d '{
    "messages": [{"role": "user", "content": "Hello!"}],
    "model": "gpt-4"
  }'

Test Error Handling:

# Logout from Azure CLI
az logout

# Attempt request (should fail gracefully)
curl -X POST http://localhost:8787/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-portkey-config: $(cat config.json | base64)" \
  -d '{
    "messages": [{"role": "user", "content": "Hello!"}],
    "model": "gpt-4"
  }'

Performance Considerations

Token Acquisition Time

First Request: ~100-300ms (executing Azure CLI command)
Subsequent Requests: Same time (no caching in current implementation)

Future Optimization Opportunities

Token Caching: Cache tokens until 5 minutes before expiration
Background Refresh: Proactively refresh tokens in background
Shared Token Pool: Share tokens across multiple gateway instances

Backwards Compatibility

This change is 100% backwards compatible:

No changes to existing authentication modes
No changes to type definitions (uses existing azureAuthMode?: string)
Falls back gracefully to API key auth if Azure CLI auth fails
Only activates when explicitly configured with azure_auth_mode: "azure_cli"

Migration Guide

For users wanting to switch from API key to Azure CLI authentication:

Step 1: Install and configure Azure CLI

az login
az account set --subscription "your-subscription-id"

Step 2: Update configuration

{
  "provider": "azure-openai",
- "api_key": "sk-***",
+ "azure_auth_mode": "azure_cli",
  "resource_name": "my-openai-resource",
  "deployment_id": "gpt-4",
  "api_version": "2024-02-15-preview"
}

Step 3: Test the connection

curl -X POST http://localhost:8787/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-portkey-config: ..." \
  -d '{"messages": [{"role": "user", "content": "Test"}], "model": "gpt-4"}'

Deployment Considerations

Local Development

✅ Recommended Use Case

Perfect for local testing and development
No need to manage API keys in local environment

CI/CD Pipelines

✅ Supported

Can use service principal login in CI/CD
Requires Node.js runtime in CI/CD environment

Production Serverless

❌ Not Supported

Azure CLI not available in Cloudflare Workers, Vercel Edge, etc.
Use entra, managed, or workload modes instead

Production VMs/Containers

⚠️ Use with Caution

Works but not recommended for production
Prefer managed or entra modes for production workloads
If using, ensure Azure CLI is properly configured in container

Breaking Changes

None - This is a purely additive change with no breaking changes to existing functionality.

Summary of Changes

This PR adds comprehensive Azure CLI authentication support across the entire gateway:

Core Provider Support:

✅ Azure OpenAI provider (src/providers/azure-openai/*)
✅ Azure AI Inference provider (src/providers/azure-ai-inference/*)

Plugin Support:

✅ Azure plugins (plugins/azure/*)
✅ Content Safety and other Azure services

Type Safety:

✅ Strict union types in AzureCredentials
✅ JSDoc documentation in request body types
✅ Full IntelliSense and autocomplete support

Documentation:

✅ Feature overview and use cases
✅ Technical implementation details with sequence diagram
✅ Type definitions documentation
✅ Quick reference guide

Total Lines Changed: ~150+ lines added across 6 code files + 4 documentation files

Related Issues: N/A (Feature Request)
Breaking Changes: None
Deployment Notes: Requires Node.js runtime for functionality

…unctions

…ken function

narengogi

Hey @shubhamdeodia thank you for the PR, but I'm having a tough time understanding the need for this implementation.

What are you trying to achieve here.

From what I understand, is this so that people can deploy the gateway on azure and use configured environment variables to generate a temporary token and make the inference request with it?

I believe what you are looking for is the azureManagedIdentity implementation, which serves the same puprose
https://github.com/Portkey-AI/gateway-enterprise-node/blob/66ea88c8c00d5b177e4940e4f79b1bd0e2369998/src/providers/azure-openai/utils.ts#L54

spawning a child process for every request and expecting azure cli (which is an external dependency on the system the gateway is deployed on) is a red flag and anti pattern.

can you please detail what exactly is your requirement here?

shubhamdeodia · 2025-11-17T14:05:35Z

@narengogi

This implementation is not intended for production deployments, Azure-hosted gateways, or situations where Managed Identity / Entra ID already exists.

The primary purpose is to support local development workflows where developers are:

running the Portkey Gateway on their local machine (Node.js)
already authenticated via az login
accessing Azure OpenAI or Azure AI Inference without needing to store or manage API keys

In our company, we never stored secrets of any sorts (not even for Testing)

narengogi · 2025-11-17T14:25:03Z

got it @shubhamdeodia
The right approach for this would be to programmatically generate the token and pass it in your request with the Authorization header
It does make sense that it is an added step for you but I've discussed with the team and they're not in favour of corrupting the gateway code with development workflows,
thankyou for taking the time to implement this but I do not see it as desired change at this point

shubhamdeodia · 2025-11-17T14:35:08Z

@narengogi thanks, I think that make sense. Seems like I missed that it supports the Auth Headers.

feat: add Azure CLI authentication mode and related token retrieval f…

7f45c3c

…unctions

shubhamdeodia force-pushed the feat/azure-cli-auth branch from 4013886 to 7f45c3c Compare November 13, 2025 15:51

shubhamdeodia and others added 2 commits November 13, 2025 11:04

feat: integrate Azure CLI for access token retrieval in getAzureCliTo…

7e68f6f

…ken function

Merge branch 'main' into feat/azure-cli-auth

d18f807

rafik43 approved these changes Nov 14, 2025

View reviewed changes

Merge branch 'main' into feat/azure-cli-auth

049e717

shubhamdeodia mentioned this pull request Nov 14, 2025

[Feature] Azure CLI Authentication for Azure OpenAI #1425

Closed

narengogi requested a review from b4s36t4 November 17, 2025 13:38

narengogi requested changes Nov 17, 2025

View reviewed changes

Merge branch 'main' into feat/azure-cli-auth

cb1db5b

narengogi linked an issue Nov 17, 2025 that may be closed by this pull request

[Feature] Azure CLI Authentication for Azure OpenAI #1425

Closed

narengogi closed this Nov 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add Azure CLI authentication support for Azure OpenAI #1426

feat: Add Azure CLI authentication support for Azure OpenAI #1426

shubhamdeodia commented Nov 13, 2025 •

edited

Loading

Uh oh!

narengogi left a comment

Uh oh!

shubhamdeodia commented Nov 17, 2025

Uh oh!

narengogi commented Nov 17, 2025

Uh oh!

shubhamdeodia commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add Azure CLI authentication support for Azure OpenAI #1426

feat: Add Azure CLI authentication support for Azure OpenAI #1426

Conversation

shubhamdeodia commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Azure CLI Authentication Implementation

Pull Request Documentation

Summary

Changes Overview

Files Modified

Authentication Flow Sequence Diagram

Technical Implementation Details

1. Token Acquisition Function

2. Azure OpenAI Integration

3. Azure AI Inference Integration

4. Azure Plugin Support

5. Type Definitions

Authentication Mode Precedence

Runtime Requirements

Security Considerations

Token Security

Credential Management

Audit Trail

Error Scenarios and Handling

Configuration Examples

Minimal Configuration

With Fallback to API Key

Multiple Providers with Loadbalancing

Testing

Manual Testing Steps

Performance Considerations

Token Acquisition Time

Future Optimization Opportunities

Backwards Compatibility

Migration Guide

Deployment Considerations

Local Development

CI/CD Pipelines

Production Serverless

Production VMs/Containers

Breaking Changes

Summary of Changes

Uh oh!

narengogi left a comment

Choose a reason for hiding this comment

Uh oh!

shubhamdeodia commented Nov 17, 2025

Uh oh!

narengogi commented Nov 17, 2025

Uh oh!

shubhamdeodia commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shubhamdeodia commented Nov 13, 2025 •

edited

Loading