AI Gateway

Gateway settings

Manage API keys, budget limits, and guardrail configuration.

Overview

The Settings page covers 3 areas: API keys for LLM providers, monthly budget limits and guardrail configuration. Changes take effect immediately.

API keys

API keys connect the gateway to LLM providers. Each key is encrypted at rest using AES-256-CBC and only decrypted at the moment a request is proxied to the provider. Keys are scoped to your organization and can be referenced by multiple endpoints.

Adding a key

Click "Add key"
Enter a name (e.g., "Production OpenAI key")
Select the provider from the dropdown. The top 10 providers are listed first, with 100+ additional providers below the divider.
Paste your API key
Click "Add key"

Key security

API keys are never displayed in full after creation. The key list shows a masked version (first and last few characters). Keys are never logged, even in error messages.

Budget

Budgets set a monthly spending limit across all endpoints. The budget section shows current spend with a progress bar, alert threshold and hard limit status. Spend resets automatically on the 1st of each month.

Budget options

Setting	Description
Monthly limit	Maximum spend in USD per month. Must be a positive number.
Alert threshold	Percentage (0-100) at which the progress bar turns red as a visual warning.
Hard limit	When enabled, requests are rejected with HTTP 429 once the budget is exceeded. When disabled, requests continue but the progress bar shows the overage.

Budget alerts

When spend crosses the alert threshold percentage, the system logs an alert. Alerts are deduplicated per month (only 1 alert per threshold crossing per month). The budget spend resets automatically on the 1st of each month via a background job.

Guardrail settings

These settings control the global behavior of guardrail scanning. They apply to all guardrail rules regardless of type.

Error behavior

Controls what happens when the guardrail scanner itself fails (e.g., the AI Gateway service is temporarily unavailable):

PII scan on error: Block (default): If PII scanning fails, all requests are blocked. Fail-closed. Recommended for regulated environments.
Content filter on error: Allow (default): If content filtering fails, requests go through. Fail-open. Prevents a scanner outage from blocking all AI traffic.

Replacement text

When a guardrail masks content, these settings control the replacement text:

PII replacement format: Default: "<ENTITY_TYPE>". The placeholder ENTITY_TYPE is replaced with the detected type (e.g., "<EMAIL_ADDRESS>", "<CREDIT_CARD>").
Content filter replacement: Default: "[REDACTED]". A fixed string that replaces any matched keyword or regex pattern.

Audit log retention

Guardrail detection logs record every blocked or masked request for compliance auditing (EU AI Act Art. 12). The retention period controls how long these logs are kept. Default: 90 days. Use the "Purge old logs" button to immediately delete logs older than the retention period.

Log purge is irreversible

Purged logs cannot be recovered. Ensure your retention period meets your organization's compliance requirements before reducing it.

Guardrails

Create and manage the PII and content filter rules configured here.

Endpoints

Endpoints reference the API keys and budgets managed in settings.

Virtual keys

Virtual keys have their own per-key budgets separate from the org budget.

Logs

Request and response body logging is controlled by settings on this page.

PreviousGuardrails

NextVirtual keys