Rate limit support for both tokens and requests
You can now set rate limits for both tokens and requests under an API key.
Request-based limits: Control the number of requests per minute, hour, or day.
Token-based limits: Control the number of tokens consumed per minute, hour, or day.
This gives you more precise control over usage, budgets, and traffic patterns across teams and applications.
