Skip to main content
Every successful response includes a routing_metadata object with details about how the request was routed and its cost.
FieldTypeRequiredDescription
costCostInfoNoCost for the request
cost.cache_savings_percentintegerNoCache savings as integer percentage (0-100). Present only when savings > 0.
cost.cache_savings_usdnumberNoCache savings in USD. Present only when savings > 0.
cost.usdnumberYesBillable cost in USD
model_canonicalstringYesCanonical model ID requested
providerstringYesProvider name (e.g., “fireworks_ai”, “anthropic”)
provider_model_idstringYesProvider’s model ID
routing_strategystringYesStrategy used for routing. Known values: cost, cost-focus, ttft, ttft-focus, tps, tps-focus, balanced, custom. custom is returned when explicit routing.weights are provided. Additional strategies may be added in future versions.
throughput_tpsnumberNoOutput throughput (tokens per second)
ttft_msnumberNoTime to first token (streaming only)
warningsStructuredWarning[]NoStructured warnings emitted when the gateway modifies or ignores part of the request (e.g., unsupported parameters, blocked fields).