Price Comparison API

GET/v1/prices

Public endpoint — no authentication required. Returns a price comparison matrix across all providers for every model in the marketplace.

Query Parameters

Name
model
Type
string
Description
Filter to a single model (e.g. claude-opus-4.6)

Examples

bash

curl https://api.surplusintelligence.ai/v1/prices

bash

curl "https://api.surplusintelligence.ai/v1/prices?model=claude-opus-4.6"

json

{
  "models": [
    {
      "model": "claude-opus-4.6",
      "displayName": "Claude Opus 4.6",
      "provider": "Anthropic",
      "contextLength": 1000000,
      "providers": {
        "venice": { "input": 6.00, "output": 30.00, "cacheRead": 0.60 },
        "openrouter": { "input": 5.00, "output": 25.00, "cacheRead": 0.50, "cacheWrite": 6.25 },
        "anthropic": { "input": 5.00, "output": 25.00, "cacheRead": 0.50, "cacheWrite": 6.25 },
        "bankr": { "input": 5.00, "output": 25.00, "cacheRead": 0.50 }
      },
      "cheapest": { "provider": "openrouter", "input": 5.00, "output": 25.00 }
    }
  ],
  "providers": ["venice", "openrouter", "anthropic", "deepseek", "bankr"],
  "updated_at": "2026-04-14T11:00:00.000Z"
}

Fields

Model Entry

Name
model
Type
string
Description
Standard marketplace model name
Name
displayName
Type
string
Description
Human-readable name
Name
provider
Type
string
Description
Model creator (Anthropic, OpenAI, Google, etc.)
Name
contextLength
Type
number
Description
Maximum context window in tokens
Name
providers
Type
object
Description
Price data keyed by provider
Name
cheapest
Type
object
Description
Cheapest provider for this model

Price Entry (per provider)

All prices in USD per 1M tokens.

Name
input
Type
number
Description
Input/prompt token price
Name
output
Type
number
Description
Output/completion token price
Name
cacheRead
Type
number
Description
Cache hit price (optional)
Name
cacheWrite
Type
number
Description
Cache creation price (optional)
Name
reasoning
Type
number
Description
Thinking/reasoning token price (optional)

Cheapest

Name
provider
Type
string
Description
Provider key with the lowest weighted cost
Name
input
Type
number
Description
Input price at that provider
Name
output
Type
number
Description
Output price at that provider

Cheapest is determined by weighted cost: input + 3 × output. This approximates a typical chat workload where output tokens outnumber input tokens.

Caching

Response is cached for 1 hour (ISR). The updated_at field indicates when the snapshot was generated.

Providers Tracked

The live price response currently includes 10 provider pricing sources. The seller allowlist has 11 provider domains; OpenAI direct is allowlisted but may not appear in the live price matrix if no current pricing rows are available.

Key	Provider
`venice`	Venice AI
`openrouter`	OpenRouter
`anthropic`	Anthropic (direct)
`openai`	OpenAI (direct)
`google`	Google (Gemini)
`deepseek`	DeepSeek (direct)
`mistral`	Mistral AI
`groq`	Groq
`together`	Together AI
`fireworks`	Fireworks AI
`bankr`	Bankr

Interactive Price Table

See the live price comparison API for the current machine-readable matrix. Treat that response as the source of truth for current provider rows and counts.