Skip to main content
Auriko’s POST /v1/chat/completions endpoint accepts the parameters below. The endpoint is OpenAI-compatible with Auriko-specific extensions for routing and metadata.
Not all providers support every optional parameter. Auriko drops unsupported ones and includes a warning in routing_metadata.warnings.
To require full parameter support, set require_parameters to true. Check which parameters each provider supports via the model directory endpoint.

Supported parameters

Auriko accepts and forwards 32 parameters to providers.
ParameterTypeRequiredDescription
extensionsExtensionsNo
frequency_penaltynumberNoFrequency penalty (-2 to 2). Not supported by all providers.
gatewayobjectConditionalAuriko routing, metadata, and multi-model configuration. Omit for default single-model routing.
logit_biasobjectNoToken logit bias. Not supported by all providers.
logprobsbooleanNoReturn log probabilities. Not supported by all providers.
max_completion_tokensintegerNoMaximum tokens to generate. Reasoning models (o1/o3) use this field instead of max_tokens.
max_tokensintegerNoMaximum tokens to generate (legacy, use max_completion_tokens)
messagesMessage[]YesThe messages to generate a completion for
min_pnumberNoMin-P sampling. Supported by some vLLM providers.
modelstringConditionalModel to route to, required for single-model requests.
nintegerNoNumber of completions to generate. Not supported by all providers.
parallel_tool_callsbooleanNoAllow parallel tool calls
presence_penaltynumberNoPresence penalty (-2 to 2). Not supported by all providers.
prompt_cache_keystringNoPrompt caching identifier. Supported by OpenAI.
reasoning_effortstringNoControls reasoning effort for supported models.
repetition_penaltynumberNoRepetition penalty. Supported by vLLM providers.
response_formatResponseFormatNo
safety_identifierstringNoSafety policy identifier. Supported by OpenAI.
seedintegerNoRandom seed for reproducibility
stopstring | arrayNoStop sequences. Restrictions vary by provider and model.
streambooleanNoEnable streaming responses
stream_optionsStreamOptionsNo
temperaturenumberNoSampling temperature (0-2). Some providers restrict this value when reasoning is enabled.
tool_choiceToolChoiceNo
toolsTool[]NoTools the model can call
top_anumberNoTop-A sampling. Supported by some vLLM providers.
top_kintegerNoTop-K sampling. Restricted by some providers when reasoning is enabled. Supported by Anthropic, Google, and vLLM.
top_logprobsintegerNoNumber of top logprobs to return. Requires logprobs support.
top_pnumberNoNucleus sampling parameter. Some providers restrict this value when reasoning is enabled.
userstringNoUser identifier for abuse detection
verbositystringNoOutput verbosity control. Supported by OpenAI.
web_search_optionsobjectNoWeb search configuration. Supported by OpenAI.

Deprecated with auto-transform

These legacy OpenAI parameters are accepted and converted to their modern equivalents.
ParameterTransforms ToReason
function_calltool_choicedeprecated by OpenAI, converted to tool_choice
functionstoolsdeprecated by OpenAI, converted to tools

Accepted and dropped

These parameters are accepted but not forwarded to providers. A warning is included in routing_metadata.warnings.
ParameterWarning Message
metadataunsupported_field: metadata
predictionunsupported_field: prediction
service_tierunsupported_field: service_tier (use gateway.routing.optimize instead)
storeunsupported_field: store

Rejected with error

These parameters cause the request to fail with an error response.
ParameterStatusError CodeConditionMessage
audio400unsupported_modalitiesalwaysaudio output is not supported
modalities400unsupported_modalitieswhen requesting audio outputaudio output is not supported