All Announcements

Rate limit support for both tokens and requests

You can now set rate limits for both tokens and requests under an API key.

Request-based limits: Control the number of requests per minute, hour, or day.

Token-based limits: Control the number of tokens consumed per minute, hour, or day.

This gives you more precise control over usage, budgets, and traffic patterns across teams and applications.

rate-limits-tokens-and-requests