Skip to Content
Introduction

Convoy

Convoy simplifies batch processing for AI inference. Send individual requests and Convoy automatically groups them into batches.

Convoy Cloud is live — Get started instantly with a fully managed API. Sign up at cnvy.ai  to create your account and receive your API key.

How It Works

  1. Sign up at cnvy.ai  and get your API key from the dashboard
  2. Submit requests via the /cargo/load endpoint with your API key
  3. Convoy batches them automatically (100 requests or 1 hour, whichever comes first)
  4. Provider processes the batch
  5. Results delivered to your callback URL

Key Features

  • Multi-tenant - Projects with isolated API keys
  • Automatic batching - No manual batch management needed
  • Multiple providers - Anthropic and more
  • Reliable delivery - Callbacks with exponential backoff retry
  • Status tracking - Monitor your requests through the lifecycle

Quick Example

# Submit a request curl -X POST https://api.cnvy.ai/cargo/load \ -H "Content-Type: application/json" \ -H "X-API-Key: convoy_sk_your_key_here" \ -d '{ "params": { "model": "claude-sonnet-4-5", "max_tokens": 1024, "messages": [{"role": "user", "content": "Hello"}] }, "callback_url": "https://your-server.com/callback" }'

Response:

{ "cargo_id": "crg_abc123", "status": "success", "message": "Cargo loaded successfully" }

Next Steps

Last updated on