Solo.io Releases Gloo AI Gateway
July 10, 2024

Solo.io announced the release of Gloo AI Gateway, which is designed to meet the emerging use case of accelerating AI innovation.

With Gloo AI Gateway, Solo.io is building on Gloo Gateway, which delivers an Envoy-based API gateway and ingress controller to facilitate and secure application traffic at the edge, to bring the same speed, security, and scalability to modern AI applications.

“AI Gateway is the most significant new trend in the API management and API gateway space,” said Idit Levine, CEO and founder of Solo.io. “As customers continue to adopt AI, we’re providing the ability to leverage Gloo AI Gateway to handle this additional new set of use cases in addition to their existing API management needs. With Gloo AI Gateway, we’re empowering application developers to move forward with their AI initiatives, but providing guardrails to ensure long-term success.”

Gloo AI Gateway optimizes Solo.io’s customers’ AI journey by delivering:

- Speed to deployment: Eliminates development friction, boilerplate code, and avoidable errors in applications consuming LLM APIs.

- Security and control: Protects applications, models, and data from inappropriate access and ensures safe use of AI with governance controls, auditability, and visibility into consumption.

- Scalability: Leverages advanced AI integration patterns for data augmentation and integration with cloud-native gateway capabilities to support high-volume, zero-downtime AI connectivity.

Key use cases of Gloo AI Gateway include:

- Multi-LLM provider support: Simplifies LLM access for consumers and creates centralized control, visibility, and governance across LLM providers.

- API key management: Securely stores LLM API keys as secrets and also generates API keys to map to one or more LLM provider secrets.

- Consumption control & visibility: Monitors and tracks LLM consumption efficiently with logging, analytics, and reporting features that ensure optimal resource utilization and cost efficiency.

- Prompt management: Streamlines LLM application integration with prompt templating and prompt enrichment and utilizes prompt guards and data exfiltration controls to reject inappropriate requests and sanitize LLM responses to ensure consistent governance and control.

- Retrieval augmented generation (RAG): Ensures LLM responses are grounded in accurate and relevant information, dynamically retrieved from external sources.

“At Solo.io, we are focused on meeting our customers at their ever-changing inflection point with their IT operation needs and providing them with the solutions they need to succeed,” Levine added. “Gloo AI Gateway is another example of how we’re fueling innovation for developers and enterprises around the world as AI becomes a mainstay in today’s application development.”

Share this

Industry News

October 17, 2024

Progress announced the latest release of Progress® Flowmon®, the network observability platform with AI-powered detection for cyberthreats, anomalies and fast access to actionable insights for greater network and application performance across hybrid cloud ecosystems.

October 17, 2024

Mirantis announced the release of Mirantis OpenStack for Kubernetes (MOSK) 24.3, which delivers enterprise-ready and fully supported OpenStack Caracal, featuring enhancements tailored for artificial intelligence (AI) and high-performance computing (HPC).

October 17, 2024

StreamNative announced a managed Apache Flink BYOC product offering will be available to StreamNative customers in private preview.

October 17, 2024

Gluware announced a series of new offerings and capabilities that will help network engineers, operators and automation developers deliver network security, AI-readiness, and performance assurance better, faster and more affordably, using flawless intent-based intelligent network automation.

October 17, 2024

Sonar released SonarQube 10.7 with AI-driven features and expanded support for new and existing languages and frameworks.

October 16, 2024

Red Hat announced a collaboration with Lenovo to deliver Red Hat Enterprise Linux AI (RHEL AI) on Lenovo ThinkSystem SR675 V3 servers.

October 16, 2024

mabl announced the general availability of GenAI Assertions.

October 16, 2024

Amplitude announced Web Experimentation – a new product that makes it easy for product managers, marketers, and growth leaders to A/B test and personalize web experiences.

October 16, 2024

Resourcely released a free tier of its tool for configuring and deploying cloud resources.

October 15, 2024

The Cloud Native Computing Foundation® (CNCF®), which builds sustainable ecosystems for cloud native software, announced the graduation of KubeEdge.

October 15, 2024

Perforce Software announced its AI-driven strategy, covering four AI-driven pillars across the testing lifecycle: test creation, execution, analysis and maintenance, across all main environments: web, mobile and packaged applications.

October 15, 2024

OutSystems announced Mentor, a full software development lifecycle (SDLC) digital worker, enabling app generation, delivery, and monitoring, all powered by low-code and GenAI.

October 15, 2024

Azul introduced its Java Performance Engineering Lab, which collaborates with global Java developers and customers’ technical teams to deliver enhanced Java performance through continuous benchmarking, code modernization recommendations and in-depth analysis of performance impacts from new OpenJDK releases.

October 10, 2024

AWS has added support for Valkey 7.2 on Amazon ElastiCache and Amazon MemoryDB, a fully managed in-memory services.

October 10, 2024

MineOS announced a major upgrade: Data Subject Request Management (DSR) 2.0.