Models & Providers
Surplus Intelligence has two related but different counts:
- Catalog models:
GET /v1/modelsreturns every model currently known to the SI registry. Current live snapshot: 145 catalog models. - Active marketplace models: the Market Prices page and
GET /api/marketsshow models with active seller offers. Current live snapshot: 76 active marketplace models.
Current catalog breakdown from a recent live /v1/models snapshot:
- text and vision-capable chat/completion models
- image-generation models
- text-to-speech and speech-to-text models
- embedding models
- video-generation models backed by the async Venice video adapter
- music/audio-generation models backed by the async Venice audio adapter
The live /v1/prices endpoint tracks pricing data across 10 provider pricing sources. The seller allowlist has 11 provider domains. Active production offers are currently concentrated on Venice and Bankr LLM Gateway; other allowed providers are supported by the routing/normalization layer but may not have active seller liquidity at all times.
Use these endpoints for current counts instead of copying numbers from docs:
GET https://api.surplusintelligence.ai/v1/models
GET https://api.surplusintelligence.ai/api/markets
GET https://api.surplusintelligence.ai/v1/prices
Supported Provider Domains
Sellers may list OpenAI-compatible endpoints from the provider allowlist. Native non-OpenAI providers are only supported where the router has an adapter/translator.
- Venice AI —
api.venice.ai— active production liquidity - Bankr LLM Gateway —
llm.bankr.bot— active production liquidity - OpenRouter —
openrouter.ai - OpenAI —
api.openai.com - Anthropic native API —
api.anthropic.com— OpenAI ⇄ Anthropic translation path exists, but no active native Anthropic offers are currently listed in production - Together AI —
api.together.xyz - Fireworks AI —
api.fireworks.ai - DeepSeek —
api.deepseek.com - Mistral AI —
api.mistral.ai - Groq —
api.groq.com - Google Gemini native API —
generativelanguage.googleapis.com
Model Name Resolution
Request models using any of these formats — the marketplace normalizes automatically:
| Format | Example |
|---|---|
| Canonical (recommended) | claude-opus-4.6 |
| OpenRouter format | anthropic/claude-opus-4.6 |
| Venice internal | claude-opus-4-6 |
| Common aliases | gemini-3-flash → gemini-3-flash-preview |
Live Model Catalog
GET /v1/models
Returns all catalog models with pricing, context lengths, capabilities, and provider mappings in an OpenRouter-compatible format. For tool-calling compatibility, check each model's supported_parameters; tools must be present for tool calls to be forwarded.
Cross-Provider Translation
The marketplace accepts buyer requests in OpenAI format. OpenAI-compatible sellers receive OpenAI-format requests after model-name and tool-format normalization. Native provider APIs require translation.
Native Anthropic Translation
When a request routes to an Anthropic-format seller (api.anthropic.com), the following translations are applied:
role: "system"/role: "developer"→ Anthropicsystemparameter- User messages with
image_url→ Anthropic image content blocks (base64 + URL) toolsarray → Anthropic tool format (name,description,input_schema)tool_choice→ Anthropic tool choice (auto,tool)- Assistant messages with
tool_calls→ Anthropictool_usecontent blocks role: "tool"messages → Anthropictool_resultcontent blocksmax_tokens/max_completion_tokens→ Anthropicmax_tokensstop→stop_sequences
tool_usecontent blocks → OpenAItool_callsarrayend_turn→finish_reason: "stop"tool_usestop reason →finish_reason: "tool_calls"input_tokens/output_tokens→prompt_tokens/completion_tokens
content_block_deltawith text →choices[0].delta.contentcontent_block_startwithtool_use→choices[0].delta.tool_calls[i]with function name and idinput_json_delta→choices[0].delta.tool_calls[i].function.argumentschunksmessage_deltawith usage → final usage chunk
This translation layer is implemented and covered by tests, including multi-turn function-tool transcripts. It is not the primary active production path today because current Claude seller liquidity is mainly through OpenAI-compatible providers. Anthropic-specific beta features and non-function computer-use tools are not guaranteed.
Media Models
Media models (images, TTS, STT, embeddings, video, music/audio) use provider-specific endpoints rather than /chat/completions. The marketplace maps standard model names to provider-specific IDs via the media model registry.
Images are synchronous. Video and music/audio are asynchronous jobs: submit to /v1/video/generations or /v1/music/generations, then poll the job URL until terminal. Seller onboarding validates video with Venice /video/quote and music/audio with Venice /audio/quote, so health checks avoid full paid generations.
Sellers listing Venice as a provider will see media models (Whisper, TTS, Flux, Runway/Seedance/PixVerse video, ACE-Step/ElevenLabs/MiniMax/Stable Audio, etc.) in the model discovery flow alongside text models.