Convoy

Convoy simplifies batch processing for AI inference. Send individual requests and Convoy automatically groups them into batches.

Convoy Cloud is live — Get started instantly with a fully managed API. Sign up at cnvy.ai to create your account and receive your API key.

How It Works

Sign up at cnvy.ai and get your API key from the dashboard
Submit requests via the /cargo/load endpoint with your API key
Convoy batches them automatically (100 requests or 1 hour, whichever comes first)
Provider processes the batch
Results delivered to your callback URL

Key Features

Multi-tenant - Projects with isolated API keys
Automatic batching - No manual batch management needed
Multiple providers - Anthropic and more
Reliable delivery - Callbacks with exponential backoff retry
Status tracking - Monitor your requests through the lifecycle

Quick Example


# Submit a request
curl -X POST https://api.cnvy.ai/cargo/load \
  -H "Content-Type: application/json" \
  -H "X-API-Key: convoy_sk_your_key_here" \
  -d '{
    "params": {
      "model": "claude-sonnet-4-5",
      "max_tokens": 1024,
      "messages": [{"role": "user", "content": "Hello"}]
    },
    "callback_url": "https://your-server.com/callback"
  }'

Response:


{
  "cargo_id": "crg_abc123",
  "status": "success",
  "message": "Cargo loaded successfully"
}

Next Steps

Getting Started - Set up your account and make your first request
Authentication - Understand projects and API keys
API Reference - Full API documentation
Concepts - Understand how Convoy works

Last updated on May 31, 2026