Red Hat and Elastic Partner on Retrieval Augmented Generation for GenAI
May 09, 2024

Red Hat and Elastic announced an expanded collaboration to deliver next-generation search experiences supporting retrieval augmented generation (RAG) patterns using Elasticsearch as a preferred vector database solution integrated on Red Hat OpenShift AI.

With this collaboration, Red Hat and Elastic are providing enterprises with the tools they need to deliver, maintain and refine RAG solutions over time on a single, consistent platform.

As organizations face the twin demands of adding AI solutions into their operations while also minimizing risk, RAG takes center stage for integrating large language models (LLM) into business applications. RAG enables IT teams to combine the benefits of LLMs with private data stores to train models with targeted, private data without modifying the underlying model itself. Strong search retrieval is key, as prompting LLMs with the correct information using private repositories at scale can be expensive. Retrieval with role-based controls helps maintain protections around sensitive data while still using it for training general-purpose LLMs.

Red Hat OpenShift AI and Elasticsearch can help organizations get the most out of RAG at both the MLOps infrastructure and application levels. Red Hat OpenShift AI provides a trusted machine learning operations (MLOps) platform to automate, build, tune, deploy and monitor models at scale. At the same time, Elasticsearch delivers a vector database and robust hybrid search solution for scaling and extracting AI responses, with advanced search and security features to make results more applicable to end users.

Red Hat supports Elasticsearch’s tools for RAG and generative AI (GenAI) application developers using the Elasticsearch Relevance EngineTM (ESRETM), which includes built-in vector search and transformer models, enabling developers to build next-generation search with proprietary enterprise data. ESRE enables organizations to create deployments that are optimized for security using their proprietary structured and unstructured data, and enables developers to build semantic search and RAG applications using a variety of third-party machine learning (ML) models, as well as ecosystem tooling from providers including Cohere, LangChain and LlamaIndex.

Red Hat OpenShift AI paired with Elasticsearch allows for deeper and more comprehensive customer support, as well as further innovation and integration with Red Hat’s vast ecosystem of AI partners. Successful implementations of GenAI help to build trust in AI solutions, leading to greater AI adoption and, ultimately, more user choice in the AI market.

This expansion of Red Hat’s existing collaboration with Elastic exemplifies the positive impact AI can have on business applications and the broader market. By meeting enterprises where they are in their adoption of AI, Red Hat is helping them harness often underutilized data, which can be a major differentiator for organizations.

Share this

Industry News

November 21, 2024

Red Hat announced the general availability of Red Hat Enterprise Linux 9.5, the latest version of the enterprise Linux platform.

November 21, 2024

Securiti announced a new solution - Security for AI Copilots in SaaS apps.

November 20, 2024

Spectro Cloud completed a $75 million Series C funding round led by Growth Equity at Goldman Sachs Alternatives with participation from existing Spectro Cloud investors.

November 20, 2024

The Cloud Native Computing Foundation® (CNCF®), which builds sustainable ecosystems for cloud native software, has announced significant momentum around cloud native training and certifications with the addition of three new project-centric certifications and a series of new Platform Engineering-specific certifications:

November 20, 2024

Red Hat announced the latest version of Red Hat OpenShift AI, its artificial intelligence (AI) and machine learning (ML) platform built on Red Hat OpenShift that enables enterprises to create and deliver AI-enabled applications at scale across the hybrid cloud.

November 20, 2024

Salesforce announced agentic lifecycle management tools to automate Agentforce testing, prototype agents in secure Sandbox environments, and transparently manage usage at scale.

November 19, 2024

OpenText™ unveiled Cloud Editions (CE) 24.4, presenting a suite of transformative advancements in Business Cloud, AI, and Technology to empower the future of AI-driven knowledge work.

November 19, 2024

Red Hat announced new capabilities and enhancements for Red Hat Developer Hub, Red Hat’s enterprise-grade developer portal based on the Backstage project.

November 19, 2024

Pegasystems announced the availability of new AI-driven legacy discovery capabilities in Pega GenAI Blueprint™ to accelerate the daunting task of modernizing legacy systems that hold organizations back.

November 19, 2024

Tricentis launched enhanced cloud capabilities for its flagship solution, Tricentis Tosca, bringing enterprise-ready end-to-end test automation to the cloud.

November 19, 2024

Rafay Systems announced new platform advancements that help enterprises and GPU cloud providers deliver developer-friendly consumption workflows for GPU infrastructure.

November 19, 2024

Apiiro introduced Code-to-Runtime, a new capability using Apiiro’s deep code analysis (DCA) technology to map software architecture and trace all types of software components including APIs, open source software (OSS), and containers to code owners while enriching it with business impact.

November 19, 2024

Zesty announced the launch of Kompass, its automated Kubernetes optimization platform.

November 18, 2024

MacStadium announced the launch of Orka Engine, the latest addition to its Orka product line.