Rate limiting

Rate limit policies let you cap how many verification requests a key can serve within a time window. Talos stores the policy on the key, returns it in verification responses, and -- in the Commercial edition -- enforces it server-side. For background on how enforcement works in each edition, see the rate limiting concepts page.

Prerequisites

A running Talos server with rate limiting enabled. See the quickstart to start one locally.

Attach a rate limit policy

Set a rate limit policy when issuing a key. The policy defines a quota (maximum requests) and a window (time window as a duration string, e.g. "60s"):

CLI
curl

RESPONSE=$(talos keys issue "rate-limited-key" \
  --actor service_api \
  --rate-limit-quota 100 \
  --rate-limit-window "60s" \
  --format json \
  -e "$TALOS_URL" 2>/dev/null)

echo "$RESPONSE" | jq .

export API_SECRET=$(echo "$RESPONSE" | jq -er '.secret')
export KEY_ID=$(echo "$RESPONSE" | jq -er '.issued_api_key.key_id')

RESPONSE=$(curl -s -X POST "$TALOS_URL/v2alpha1/admin/issuedApiKeys" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "rate-limited-key",
    "actor_id": "service_api",
    "rate_limit_policy": {
      "quota": 100,
      "window": "60s"
    }
  }')

echo "$RESPONSE" | jq .

export API_SECRET=$(echo "$RESPONSE" | jq -er '.secret')
export KEY_ID=$(echo "$RESPONSE" | jq -er '.key_id')

The response includes the full key metadata with the rate_limit_policy attached. For the complete request and response field reference, see the IssueAPIKey API reference.

Verify a rate-limited key

Verify the key as you would any other credential. When the key has a rate limit policy, the response includes the policy metadata:

CLI
curl

talos keys verify "$API_SECRET" -e "$TALOS_URL"

curl -s -X POST "$TALOS_URL/v2alpha1/admin/apiKeys:verify" \
  -H "Content-Type: application/json" \
  -d "{\"credential\":\"$API_SECRET\"}" | jq .

When rate limiting is enabled (Commercial), the response also includes rate_limit_remaining (approximate requests available before the limit is reached) and rate_limit_reset_time (when full capacity is recovered). For the complete response field reference, see the VerifyAPIKey API reference.

Exceeding the limit

When a key's quota is exhausted, verification returns is_active: false with error code VERIFICATION_ERROR_RATE_LIMITED (Commercial edition). The response body includes the error code and a human-readable message:

{
  "is_active": false,
  "error_code": "VERIFICATION_ERROR_RATE_LIMITED",
  "error_message": "Rate limit exceeded"
}

The HTTP response also includes a Retry-After header indicating how many seconds the client should wait before retrying. In the OSS edition, enforcement is external -- Talos always returns the policy metadata but does not reject requests based on quota.

For the complete list of verification error codes, see the error codes reference.

Update rate limit policy

Use PATCH to change a key's rate limit policy without rotating the secret. Include rate_limit_policy in the update_mask:

CLI
curl

talos keys issued update "$KEY_ID" \
  --rate-limit-quota 500 \
  --rate-limit-window "120s" \
  -e "$TALOS_URL"

curl -s -X PATCH "$TALOS_URL/v2alpha1/admin/issuedApiKeys/$KEY_ID" \
  -H "Content-Type: application/json" \
  -d '{
    "rate_limit_policy": {
      "quota": 500,
      "window": "120s"
    },
    "update_mask": {"paths": ["rate_limit_policy"]}
  }' | jq .

The updated policy takes effect on the next verification request (subject to cache TTL). For the complete update field reference, see the UpdateIssuedAPIKey API reference.

Remove rate limit policy

To remove a rate limit policy entirely, set rate_limit_policy to an empty object:

CLI
curl

talos keys issued update "$KEY_ID" \
  --rate-limit-quota 0 \
  --rate-limit-window "0s" \
  -e "$TALOS_URL"

curl -s -X PATCH "$TALOS_URL/v2alpha1/admin/issuedApiKeys/$KEY_ID" \
  -H "Content-Type: application/json" \
  -d '{
    "rate_limit_policy": {},
    "update_mask": {"paths": ["rate_limit_policy"]}
  }' | jq .

Once removed, the key is no longer subject to rate limiting.

HTTP response headers

When a key has a rate limit policy, the HTTP gateway includes IETF draft-compliant headers in verification responses:

Header	When present	Description
`RateLimit-Policy`	Always (with policy)	Declares the quota and window: `100;w=60`
`RateLimit`	Always (with policy)	Current state: `limit=100, remaining=42, reset=18`
`Retry-After`	Only when limited	Seconds to wait before the next allowed request

These headers are present in both editions. In the OSS edition, your API gateway can read them to apply enforcement. In the Commercial edition, clients can use them for backoff and retry logic.

Behavior notes

Fail-open on limiter errors -- if the rate limiter backend is unavailable (e.g., Redis is down), verification succeeds but rate limit metadata is omitted. Limiter failures never block legitimate traffic.
Cache interaction -- rate limit checks happen after cache resolution. If a verification result is served from cache, the rate limiter is not consulted. This means cached responses do not decrement the counter.
Per-key isolation -- each key maintains its own counter. Keys do not share rate limit budgets, even if they belong to the same actor.
Policy changes -- updated policies take effect on the next cache miss. To force immediate effect, use the Cache-Control: no-cache header on verification requests.

Next steps

Rate limiting concepts -- how enforcement works in OSS vs. Commercial
Key lifecycle -- update, rotate, and revoke keys
Error handling -- error response format and retry logic

Prerequisites​

Attach a rate limit policy​

Verify a rate-limited key​

Exceeding the limit​

Update rate limit policy​

Remove rate limit policy​

HTTP response headers​

Behavior notes​

Next steps​

Ory Network