Convoy
Convoy simplifies batch processing for AI inference. Send individual requests and Convoy automatically groups them into batches.
Convoy Cloud is live — Get started instantly with a fully managed API. Sign up at cnvy.ai to create your account and receive your API key.
How It Works
- Sign up at cnvy.ai and get your API key from the dashboard
- Submit requests via the
/cargo/loadendpoint with your API key - Convoy batches them automatically (100 requests or 1 hour, whichever comes first)
- Provider processes the batch
- Results delivered to your callback URL
Key Features
- Multi-tenant - Projects with isolated API keys
- Automatic batching - No manual batch management needed
- Multiple providers - Anthropic and more
- Reliable delivery - Callbacks with exponential backoff retry
- Status tracking - Monitor your requests through the lifecycle
Quick Example
# Submit a request
curl -X POST https://api.cnvy.ai/cargo/load \
-H "Content-Type: application/json" \
-H "X-API-Key: convoy_sk_your_key_here" \
-d '{
"params": {
"model": "claude-sonnet-4-5",
"max_tokens": 1024,
"messages": [{"role": "user", "content": "Hello"}]
},
"callback_url": "https://your-server.com/callback"
}'Response:
{
"cargo_id": "crg_abc123",
"status": "success",
"message": "Cargo loaded successfully"
}Next Steps
- Getting Started - Set up your account and make your first request
- Authentication - Understand projects and API keys
- API Reference - Full API documentation
- Concepts - Understand how Convoy works
Last updated on