Rate Limits - SO1 Documentation

Overview

The SO1 API implements rate limiting to ensure fair usage and system stability across all users. Rate limits are enforced per API key and vary by subscription tier.

Rate Limit Headers: All API responses include rate limit information in response headers to help you track your usage.

Rate Limit Tiers

SO1 API rate limits are organized by subscription tier. Higher tiers receive increased limits and priority processing.

Tier	Requests/Minute	Requests/Hour	Requests/Day	Burst Limit
Free	10	300	5,000	15
Starter	60	2,000	40,000	100
Professional	300	12,000	200,000	500
Enterprise	1,000	50,000	1,000,000	2,000
Custom	Custom	Custom	Custom	Custom

Burst Limits: Allow short bursts of traffic above the per-minute rate. Useful for batch operations or webhook processing.

Rate Limit Headers

Every API response includes headers indicating your current rate limit status:

X-RateLimit-Limit: 300
X-RateLimit-Remaining: 287
X-RateLimit-Reset: 1678901234
X-RateLimit-Tier: professional
X-RateLimit-Burst-Remaining: 450

Header Reference

Header	Description	Example
`X-RateLimit-Limit`	Maximum requests allowed per minute	`300`
`X-RateLimit-Remaining`	Requests remaining in current window	`287`
`X-RateLimit-Reset`	Unix timestamp when limit resets	`1678901234`
`X-RateLimit-Tier`	Current subscription tier	`professional`
`X-RateLimit-Burst-Remaining`	Burst capacity remaining	`450`

Rate Limit Exceeded Response

When you exceed your rate limit, the API returns a 429 Too Many Requests response:

{
  "error": {
    "code": "RATE_LIMIT_EXCEEDED",
    "message": "Rate limit exceeded. Please retry after 42 seconds.",
    "details": {
      "limit": 300,
      "window": "1m",
      "retryAfter": 42,
      "tier": "professional"
    }
  }
}

Retry-After Header: Always respect the Retry-After header value (in seconds) before making subsequent requests.

Endpoint-Specific Limits

Certain resource-intensive endpoints have additional rate limits:

Control Plane API

Endpoint Category	Additional Limit	Reason
Agent Execution	50 concurrent executions	Prevents resource exhaustion
Workflow Triggers	100 triggers/minute	Protects workflow infrastructure
Bulk Operations	1,000 items/request	Limits payload size
Metrics Queries	10 queries/minute	Database protection

Veritas API

Endpoint Category	Additional Limit	Reason
Prompt Testing	20 tests/minute	LLM quota management
Chain Execution	30 chains/minute	Complex operation throttling
Batch Refinement	50 prompts/batch	Processing limits

n8n API

Endpoint Category	Additional Limit	Reason
Workflow Execution	100 executions/minute	n8n instance protection
Webhook Registration	10 webhooks/minute	External service rate limits
Bulk Workflow Import	5 imports/hour	Validation overhead

Rate Limiting Best Practices

1. Monitor Rate Limit Headers

Always check rate limit headers in your API client and implement proactive throttling:

async function makeAPIRequest(endpoint: string, options: RequestInit) {
  const response = await fetch(endpoint, options);
  
  const remaining = parseInt(response.headers.get('X-RateLimit-Remaining') || '0');
  const limit = parseInt(response.headers.get('X-RateLimit-Limit') || '0');
  
  // Slow down when approaching limit
  if (remaining < limit * 0.1) {
    console.warn(`Approaching rate limit: ${remaining}/${limit} remaining`);
    await sleep(1000); // Add 1s delay
  }
  
  return response;
}

2. Implement Exponential Backoff

When receiving 429 responses, implement exponential backoff with jitter:

async function retryWithBackoff<T>(
  fn: () => Promise<T>,
  maxRetries: number = 5
): Promise<T> {
  for (let attempt = 0; attempt < maxRetries; attempt++) {
    try {
      return await fn();
    } catch (error) {
      if (error.status === 429 && attempt < maxRetries - 1) {
        const retryAfter = parseInt(error.headers.get('Retry-After') || '1');
        const jitter = Math.random() * 1000; // 0-1000ms jitter
        const delay = (retryAfter * 1000) + jitter;
        
        console.log(`Rate limited. Retrying after ${delay}ms...`);
        await sleep(delay);
      } else {
        throw error;
      }
    }
  }
  throw new Error('Max retries exceeded');
}

3. Use Request Queuing

Implement a request queue to stay within rate limits:

class RateLimitedQueue {
  private queue: Array<() => Promise<any>> = [];
  private processing = false;
  private requestsPerMinute: number;
  private lastRequest: number = 0;
  
  constructor(requestsPerMinute: number) {
    this.requestsPerMinute = requestsPerMinute;
  }
  
  async enqueue<T>(fn: () => Promise<T>): Promise<T> {
    return new Promise((resolve, reject) => {
      this.queue.push(async () => {
        try {
          const result = await fn();
          resolve(result);
        } catch (error) {
          reject(error);
        }
      });
      
      if (!this.processing) {
        this.process();
      }
    });
  }
  
  private async process() {
    this.processing = true;
    
    while (this.queue.length > 0) {
      const now = Date.now();
      const timeSinceLastRequest = now - this.lastRequest;
      const minInterval = 60000 / this.requestsPerMinute;
      
      if (timeSinceLastRequest < minInterval) {
        await sleep(minInterval - timeSinceLastRequest);
      }
      
      const task = this.queue.shift();
      if (task) {
        this.lastRequest = Date.now();
        await task();
      }
    }
    
    this.processing = false;
  }
}

// Usage
const queue = new RateLimitedQueue(300); // 300 req/min for Professional tier

await queue.enqueue(() => 
  fetch('https://api.so1.io/v1/agents/execute', { ... })
);

4. Batch Requests When Possible

Use batch endpoints to reduce API calls:

// ❌ Bad: Multiple individual requests
for (const agentId of agentIds) {
  await fetch(`https://api.so1.io/v1/agents/${agentId}`);
}

// ✅ Good: Single batch request
const response = await fetch('https://api.so1.io/v1/agents/batch', {
  method: 'POST',
  body: JSON.stringify({ agentIds }),
  headers: { 'Content-Type': 'application/json' }
});

5. Cache Responses

Cache API responses to reduce redundant requests:

const cache = new Map<string, { data: any; expiry: number }>();

async function cachedRequest(url: string, ttl: number = 60000) {
  const cached = cache.get(url);
  
  if (cached && Date.now() < cached.expiry) {
    return cached.data;
  }
  
  const response = await fetch(url);
  const data = await response.json();
  
  cache.set(url, {
    data,
    expiry: Date.now() + ttl
  });
  
  return data;
}

Increasing Rate Limits

Upgrade Your Tier

The most straightforward way to increase rate limits is to upgrade your subscription tier:

Starter → Professional

5x increase in rate limits

60 → 300 requests/minute
2,000 → 12,000 requests/hour

Professional → Enterprise

3.3x increase in rate limits

300 → 1,000 requests/minute
12,000 → 50,000 requests/hour

Request Custom Limits

For specialized use cases requiring higher limits:

Contact Sales: Email enterprise@so1.io with your use case
Provide Usage Estimates: Include expected traffic patterns
Describe Architecture: Explain how you’ll handle bursts
Review SLA Requirements: Discuss uptime and latency needs

Enterprise Custom Tiers: Available for organizations with >1M requests/day or specialized requirements.

Rate Limit Monitoring

Using the Dashboard

Monitor your rate limit usage in the SO1 Dashboard:

Navigate to Settings → API Keys
View real-time rate limit metrics per key
Set up alerts for approaching limits
Review historical usage patterns

Using the API

Query your current rate limit status programmatically:

curl -X GET https://api.so1.io/v1/rate-limits/status \
  -H "Authorization: Bearer YOUR_API_KEY"

Response:

{
  "tier": "professional",
  "limits": {
    "perMinute": { "limit": 300, "remaining": 287, "resetAt": "2024-03-10T15:45:00Z" },
    "perHour": { "limit": 12000, "remaining": 11456, "resetAt": "2024-03-10T16:00:00Z" },
    "perDay": { "limit": 200000, "remaining": 189234, "resetAt": "2024-03-11T00:00:00Z" }
  },
  "burst": {
    "limit": 500,
    "remaining": 450
  }
}

Troubleshooting

Common Issues

Issue	Symptom	Resolution
Unexpected 429s	Rate limited despite low usage	Check for multiple API keys using same account; consolidate keys
Burst limit exhausted	429s after rapid requests	Implement request pacing with minimum intervals
Reset time confusion	Unclear when limits reset	Use `X-RateLimit-Reset` Unix timestamp, not `Retry-After`
Endpoint-specific limits	429 on specific endpoint only	Review endpoint-specific limits above; use alternative endpoints
Caching not effective	High request volume for static data	Increase cache TTL; use ETags for conditional requests

Getting Help

If you’re experiencing rate limiting issues:

Check System Status

View real-time API performance and known issues

Contact Support

Email support with your API key and request patterns

Authentication

API key management and security

Error Handling

Complete error code reference

Control Plane API

Agent and workflow execution endpoints

Best Practices

API usage guidelines and patterns

API Reference

Control Plane API

Veritas API

n8n API

​Overview

​Rate Limit Tiers

​Rate Limit Headers

​Header Reference

​Rate Limit Exceeded Response

​Endpoint-Specific Limits

​Control Plane API

​Veritas API

​n8n API

​Rate Limiting Best Practices

​Increasing Rate Limits

​Upgrade Your Tier

Starter → Professional

Professional → Enterprise

​Request Custom Limits

​Rate Limit Monitoring

​Using the Dashboard

​Using the API

​Troubleshooting

​Common Issues

​Getting Help

Check System Status

Contact Support

​Related Resources

Authentication

Error Handling

Control Plane API

Best Practices

Overview

Rate Limit Tiers

Rate Limit Headers

Header Reference

Rate Limit Exceeded Response

Endpoint-Specific Limits

Control Plane API

Veritas API

n8n API

Rate Limiting Best Practices

Increasing Rate Limits

Upgrade Your Tier

Request Custom Limits

Rate Limit Monitoring

Using the Dashboard

Using the API

Troubleshooting

Common Issues

Getting Help

Related Resources