Skip to Content
ConceptsSupported ModelsOverview

Supported Models

Convoy supports a wide range of models. Use the Convoy Model ID in your requests — Convoy handles routing to the underlying provider automatically.

Building or testing an integration? Use convoy-mock — a free, synthetic model that returns a callback in ~60 seconds and is never billed. It runs the entire batch pipeline so you can verify your wiring without burning real credits, then swap to a production model below when you’re ready.

Testing

  • convoy-mock — free, synthetic, ~60s callback round-trip — use this for all integration testing

Browse by Provider

  • Anthropic — Claude 3, 3.5, 3.7, Sonnet 4 / 4.5 / 4.6, Opus 4.5 / 4.6, Haiku 4.5, plus extended-context (*-long) variants
  • Amazon Nova — Nova Micro, Lite, Pro, Premier, Nova 2 Lite, Nova 2 Pro / Omni (Preview), and Titan Text Embeddings v2
  • Meta Llama — Llama 3.1, 3.2, 3.3, Llama 4 Maverick / Scout
  • Mistral AI — Mistral Large 2407 / 3, Devstral 2, Magistral, Ministral 3, Voxtral
  • DeepSeek — DeepSeek V3.1, V3.2
  • Google — Gemma 3 (4B, 12B, 27B)
  • MiniMax — MiniMax M2, M2.1
  • NVIDIA — Nemotron Nano 9B v2, 12B v2, Nano 3 30B
  • Moonshot AI — Kimi K2 Thinking, K2.5
  • OpenAI — GPT OSS 20B / 120B, GPT OSS Safeguard 20B / 120B
  • Qwen — Qwen3 32B, 235B, Coder, Coder Next, Next 80B, VL
  • Z.AI — GLM 4.7, GLM 4.7 Flash

Example Request

curl -X POST https://api.cnvy.ai/cargo/load \ -H "Content-Type: application/json" \ -H "X-API-Key: convoy_sk_your_key_here" \ -d '{ "params": { "model": "claude-3-haiku" }, "records": [ { "recordId": "rec_001", "modelInput": { "max_tokens": 1024, "messages": [ { "role": "user", "content": "Summarize this document." } ] } } ] }'
Last updated on