Smarter routing capabilities!
Sticky load balancing ensures that requests sharing the same identifier are consistently routed to the same target.
This is useful for:
- Maintaining conversation context across multiple requests
- Ensuring consistent model behavior during A/B testing
- Supporting session-based or user-specific routing
Read more about this here ->
