Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.auriko.ai/llms.txt

Use this file to discover all available pages before exploring further.

Rate limits scale with your plan. See Plans and billing for tier thresholds, requests-per-minute (RPM) limits, and monthly caps.

Prerequisites

Check rate limit headers

Chat completion responses include request-level rate limit headers:
HeaderDescription
Retry-AfterSeconds until rate limit resets; present on 429 responses only (RFC 7231)
X-RateLimit-Limit-RequestsRequests allowed per window
X-RateLimit-Remaining-RequestsRequests remaining in current window
X-RateLimit-Reset-RequestsISO 8601 timestamp when the window resets

Handle 429 responses

When you exceed a rate limit, the API returns a 429 Too Many Requests response with a Retry-After header:
{
  "error": {
    "message": "Rate limit exceeded. Retry after 12 seconds.",
    "type": "rate_limit_error",
    "param": null,
    "code": "rate_limit_exceeded",
    "doc_url": "https://docs.auriko.ai/errors/rate_limit_exceeded"
  }
}
The Auriko SDK (Python, TypeScript) retries failed requests with exponential backoff (up to 2 retries by default). For manual handling, see Error handling — Retry manually.