Rafay Launches Infrastructure Templates for Generative AI
November 07, 2023

Rafay Systems announced the availability of curated infrastructure templates for Generative AI (GenAI) use cases that many enterprises are exploring today.

These templates are designed to bring together the power of Rafay’s Environment Management and Kubernetes Management capabilities, along with best-in-class tools used by developers and data scientists to extract business value from GenAI.

Rafay’s GenAI templates empower platform teams to efficiently guide GenAI technology development and utilization, and include reference source code for a variety of use cases, pre-built cloud environment templates, and Kubernetes cluster blueprints pre-integrated with the GenAI ecosystem. Customers can easily experiment with services such as Amazon Bedrock, Microsoft Azure OpenAI and OpenAI’s ChatGPT. Support for high-performance, GPU-based computing environments is built into the templates. Traditional tools used by data scientists such as Simple Linux Utility for Resource Management (SLURM), Kubeflow and MLflow are also supported.

Rafay’s GenAI templates simplify the environment setup for deploying AI applications so platform teams can take the lead in AI development, and empower developers and data scientists to harness the full potential of AI. Developers gain a competitive advantage by being able to press a button and consume an enterprise-grade AI development sandbox in a controlled self-service manner, expediting the innovation process. Rafay’s native offering continues to deliver the control and efficiency platform teams need to maintain oversight while keeping costs in check.

“As platform teams lead the charge in enabling GenAI technologies and managing traditional AI and ML applications, Rafay’s GenAI focused templates expedite the development and time-to-market for all AI applications, ranging from chatbots to predictive analysis, delivering real-time benefits of GenAI to the business,” said Mohan Atreya, Rafay Systems SVP of Product and Solutions. “Platform teams can empower developers and data scientists to move fast with their GenAI experimentation and productization, while enforcing the necessary guardrails to ensure enterprise-grade governance and control. With Rafay, any enterprise can confidently start their GenAI journey today.”
Unlocking the Full Potential of AI with a Controlled Self-Service Approach

Rafay’s GenAI templates deliver autonomy to developers and data scientists, while streamlining the integration and resource management of AI infrastructure, such as cloud environments and Kubernetes clusters. Enterprise platform teams benefits from the following capabilities:

- Self-Service Experience: Rafay allows developers and data scientists to deploy, view and manage their GenAI applications and infrastructure in isolation using self-service workflows via Rafay & Backstage.

- AI/ML Ecosystem Support: Rafay provides out of the box support for LLM providers including Amazon Bedrock, Azure OpenAI and OpenAI.

- AI Applications & Source Code: Includes several GenAI and AI workbench applications with source code such as a text summarization and a chatbot app using GenAI

- Any Orchestration, Any Cloud: Pre-built templates for Amazon ECS, EKS/A, Microsoft AKS and Google GKE on public clouds as well as private data centers and edge locations

- Cluster and Workflow Standardization: Rafay’s Environment templates for Kubernetes blueprints allow platform teams to create a set of standard GenAI environments and make them available enterprise-wide

- Secure RBAC: Each developer, data scientist, researcher, etc. can create and destroy environments (but not templates built by platform teams) and operate them in isolation, governed by RBAC

- Integrated GPU and Kubernetes Metrics: Rafay automatically captures and aggregates both Kubernetes and GPU metrics at the controller in a multi-tenant time series database.

- Multitenancy for AI/ML Apps: It is incredibly common for enterprises to have different teams share clusters – perhaps with specific LLM resources – in an effort to save costs. Rafay’s multi-modal multi-tenancy capabilities can easily support multiple AI/ML teams on the same Kubernetes cluster.

- Chargeback & Showback: Rafay provides each isolated unit financial metrics including chargeback and showback for their AI applications across private and public clouds

- Support for Traditional AI Platforms: Rafay also supports traditional AI frameworks such as SLURM, KubeFlow and MLflow.

Share this

Industry News

November 21, 2024

Red Hat announced the general availability of Red Hat Enterprise Linux 9.5, the latest version of the enterprise Linux platform.

November 21, 2024

Securiti announced a new solution - Security for AI Copilots in SaaS apps.

November 20, 2024

Spectro Cloud completed a $75 million Series C funding round led by Growth Equity at Goldman Sachs Alternatives with participation from existing Spectro Cloud investors.

November 20, 2024

The Cloud Native Computing Foundation® (CNCF®), which builds sustainable ecosystems for cloud native software, has announced significant momentum around cloud native training and certifications with the addition of three new project-centric certifications and a series of new Platform Engineering-specific certifications:

November 20, 2024

Red Hat announced the latest version of Red Hat OpenShift AI, its artificial intelligence (AI) and machine learning (ML) platform built on Red Hat OpenShift that enables enterprises to create and deliver AI-enabled applications at scale across the hybrid cloud.

November 20, 2024

Salesforce announced agentic lifecycle management tools to automate Agentforce testing, prototype agents in secure Sandbox environments, and transparently manage usage at scale.

November 19, 2024

OpenText™ unveiled Cloud Editions (CE) 24.4, presenting a suite of transformative advancements in Business Cloud, AI, and Technology to empower the future of AI-driven knowledge work.

November 19, 2024

Red Hat announced new capabilities and enhancements for Red Hat Developer Hub, Red Hat’s enterprise-grade developer portal based on the Backstage project.

November 19, 2024

Pegasystems announced the availability of new AI-driven legacy discovery capabilities in Pega GenAI Blueprint™ to accelerate the daunting task of modernizing legacy systems that hold organizations back.

November 19, 2024

Tricentis launched enhanced cloud capabilities for its flagship solution, Tricentis Tosca, bringing enterprise-ready end-to-end test automation to the cloud.

November 19, 2024

Rafay Systems announced new platform advancements that help enterprises and GPU cloud providers deliver developer-friendly consumption workflows for GPU infrastructure.

November 19, 2024

Apiiro introduced Code-to-Runtime, a new capability using Apiiro’s deep code analysis (DCA) technology to map software architecture and trace all types of software components including APIs, open source software (OSS), and containers to code owners while enriching it with business impact.

November 19, 2024

Zesty announced the launch of Kompass, its automated Kubernetes optimization platform.

November 18, 2024

MacStadium announced the launch of Orka Engine, the latest addition to its Orka product line.