Announcements

Stay informed about Portkey's newest releases, features, and improvements

  1. New Integration: Nvidia's Triton Inference Server on AI Gateway

    Announcement

    🎉 Exciting news!

    Portkey now supports Nvidia's Triton Inference Server.

    Big Thanks to Tapan Sharma for contributing to Portkey's open-source AI gateway with the integration.

    Triton is NVIDIA's inference server that makes AI model deployment lightning-fast and scalable across multiple GPUs and CPUs.

     Link to docs 

    Group from Figma (1)

     

     

    Like

  2. New Integration: MindsDB 🤝 Portkey AI

    We're excited to share that MindsDB now integrates natively with Portkey!

    Easily connect your databases, vector stores, and apps to 250+ LLMs with enterprise-grade monitoring and reliability built-in.

     

    Transform your existing data infrastructure into advanced, AI-powered systems today! 🚀  

     Link to docs 

    Like

  3. 🚨 OpenAI Assistant's v1 API Deprecation Alert!

    Announcement

    OpenAI's Assistants API v1 beta shuts down by Dec 18, 2024. It's time to migrate to v2 beta, which offers:

    - 500x more file capacity

    - Faster processing

    - Enhanced search with reranking

    - Customizable chunking

    - Query rewriting

    If you are using Assistant's API, make sure you migrate to the v2 beta API before the deadline.

     

    2

  4. Partnering with MongoDB

    Announcement
    CleanShot 2024-09-25 at 13.12.21@2x

    We're thrilled to share that now Portkey is an official partner of MongoDB to help you take your AI apps to production. With the combo of Portkey + MongoDB, you get:

    → Streamlined data management

    → Enhanced LLM API control

    → Faster prototype-to-production pipeline

    → Improved security and scalability

    → Cost-effective AI operations

    All this over secure, scalable, and private deployments to propel your AI apps into production.

    Read more on the partnership here: https://cloud.mongodb.com/ecosystem/portkey-ai 

    5

  5. Enhance RAG Retrieval Success by 67% using Contextual Retrieval with Portkey's AI Gateway

    Anthropic recently released a guide on Contextual Retrieval, a method that cuts RAG retrieval failures by up to 67%.

    🔥We've adapted this powerful technique to work with Portkey's AI Gateway.

    Now you can easily implement Contextual Retrieval across 250+ language models with just one interface.

    Key benefits of Portkey's integration:

    1. Access to multiple language models through a single API
    2. Simplified key management process
    3. Effortless switching between providers

     

     Link to the cookbook 

    4

  6. OpenAI o1-preview and o1-mini on Portkey

    New Feature

    OpenAI's new o1-preview and o1-mini models are supported on Portkey.

    Compared to gpt-4o, these models reflect for a long time, and are able to answer questions like "how many r's in the word strawberry?", "how many words in your output?" exceptionally well.

    o1 models work significatly better on maths, science, puzzle solving, and coding tasks.

     

    Try them on Portkey:

    OpenAI-o1

     

    4