Reference

Rate limits

Per-workspace and per-model.

Default limits during beta:

  • Free tier: 60 req/min, 10 concurrent jobs
  • Basic: 120 / 20
  • Pro: 300 / 40
  • Creator: 600 / 80
  • Enterprise: custom

Exceeding returns 429 with Retry-After and X-RateLimit-Remainingheaders. Per-model concurrency is also enforced upstream at the provider; we retry-with-backoff on 429 from providers so your call doesn't see it unless it persists.

Last updated 2026-04-17Needs more detail? β†’