Claude Code features work through Auriko: tool use, MCP servers, streaming, extended thinking, and prompt caching. Only LLM inference routes through Auriko; agentic operations (file I/O, bash, MCP) run locally.
Prerequisites
- Claude Code installed
- An Auriko API key
Set up Claude Code
Add 3 environment variables to your shell profile (~/.zshrc or ~/.bashrc):
ANTHROPIC_API_KEY="" must be explicitly set to empty. If it contains a value, Claude Code uses it directly against Anthropic, bypassing Auriko.Verify setup
Use different models
You can pass an Auriko model ID with the--model flag:
400 error. Browse per-model capabilities in the directory API.
Available models include:
| Model | Author | Context |
|---|---|---|
claude-sonnet-4-6 | Anthropic | 1M |
claude-opus-4-6 | Anthropic | 1M |
claude-opus-4-7 | Anthropic | 1M |
deepseek-v4-flash | DeepSeek | 1M |
deepseek-v4-pro | DeepSeek | 1M |
gemini-2.5-flash | 1M | |
gemini-2.5-pro | 1M | |
gemini-3.1-pro-preview | 1M | |
glm-5.1 | Z.AI | 200K |
grok-4.3 | xAI | 1M |
kimi-k2.5 | Moonshot | 262K |
kimi-k2.6 | Moonshot | 262K |
minimax-m2-7 | MiniMax | 205K |
minimax-m2-7-highspeed | MiniMax | 205K |
qwen-3.6-plus | Alibaba | 1M |
/model picker in interactive sessions lists only Claude tier names (Opus, Sonnet, and Haiku). To switch to a non-Claude model mid-session, type the full ID: /model deepseek-v4-flash.
You can override which model each tier maps to:
Set up Claude Agent SDK
The Claude Agent SDK is Python-only. For TypeScript, use the Anthropic SDK directly (see below). The Claude Agent SDK spawns Claude Code as a subprocess. Pass Auriko credentials throughClaudeAgentOptions.env:
To prevent filesystem settings from overriding your
env values, pass setting_sources=[] in options.Use the Anthropic SDK directly
You can point the Anthropic SDK at Auriko’s API:Configure routing
Add agateway object to the request body:
Troubleshoot
| Symptom | Cause | Fix |
|---|---|---|
| ”model may not exist or you may not have access” | Model ID isn’t exact (e.g., claude-haiku-4-5 instead of claude-haiku-4-5-20251001) | Use the full model ID from GET /v1/models |
| Requests go to Anthropic directly, not Auriko | ANTHROPIC_API_KEY contains a value | Set ANTHROPIC_API_KEY="" (empty string, not unset) |
| “Invalid API Key” or auth errors | Cached Anthropic OAuth credentials | Run claude auth logout, then verify env vars are set |
| Requests hang or timeout | ANTHROPIC_BASE_URL includes /v1 | Use https://api.auriko.ai only |
| ”does not support reasoning/extended thinking” | Claude Code requires reasoning support but this model doesn’t have it | Use a reasoning-capable model (see “Use different models” above) |
apiKeySource: none in session events | Claude Code doesn’t classify ANTHROPIC_AUTH_TOKEN as a key source | Expected behavior. Requests authenticate correctly |