All Announcements

Smarter routing capabilities!

Sticky load balancing ensures that requests sharing the same identifier are consistently routed to the same target.

This is useful for:

  • Maintaining conversation context across multiple requests
  • Ensuring consistent model behavior during A/B testing
  • Supporting session-based or user-specific routing

Read more about this here ->

sticky-load-balancing