Red Hat and Elastic Partner on Retrieval Augmented Generation for GenAI
May 09, 2024

Red Hat and Elastic announced an expanded collaboration to deliver next-generation search experiences supporting retrieval augmented generation (RAG) patterns using Elasticsearch as a preferred vector database solution integrated on Red Hat OpenShift AI.

With this collaboration, Red Hat and Elastic are providing enterprises with the tools they need to deliver, maintain and refine RAG solutions over time on a single, consistent platform.

As organizations face the twin demands of adding AI solutions into their operations while also minimizing risk, RAG takes center stage for integrating large language models (LLM) into business applications. RAG enables IT teams to combine the benefits of LLMs with private data stores to train models with targeted, private data without modifying the underlying model itself. Strong search retrieval is key, as prompting LLMs with the correct information using private repositories at scale can be expensive. Retrieval with role-based controls helps maintain protections around sensitive data while still using it for training general-purpose LLMs.

Red Hat OpenShift AI and Elasticsearch can help organizations get the most out of RAG at both the MLOps infrastructure and application levels. Red Hat OpenShift AI provides a trusted machine learning operations (MLOps) platform to automate, build, tune, deploy and monitor models at scale. At the same time, Elasticsearch delivers a vector database and robust hybrid search solution for scaling and extracting AI responses, with advanced search and security features to make results more applicable to end users.

Red Hat supports Elasticsearch’s tools for RAG and generative AI (GenAI) application developers using the Elasticsearch Relevance EngineTM (ESRETM), which includes built-in vector search and transformer models, enabling developers to build next-generation search with proprietary enterprise data. ESRE enables organizations to create deployments that are optimized for security using their proprietary structured and unstructured data, and enables developers to build semantic search and RAG applications using a variety of third-party machine learning (ML) models, as well as ecosystem tooling from providers including Cohere, LangChain and LlamaIndex.

Red Hat OpenShift AI paired with Elasticsearch allows for deeper and more comprehensive customer support, as well as further innovation and integration with Red Hat’s vast ecosystem of AI partners. Successful implementations of GenAI help to build trust in AI solutions, leading to greater AI adoption and, ultimately, more user choice in the AI market.

This expansion of Red Hat’s existing collaboration with Elastic exemplifies the positive impact AI can have on business applications and the broader market. By meeting enterprises where they are in their adoption of AI, Red Hat is helping them harness often underutilized data, which can be a major differentiator for organizations.

Share this

Industry News

May 16, 2024

Pegasystems announced the general availability of Pega Infinity ’24.1™.

May 16, 2024

Mend.io and Sysdig unveiled a joint solution to help developers, DevOps, and security teams accelerate secure software delivery from development to deployment.

May 16, 2024

GitLab announced new innovations in GitLab 17 to streamline how organizations build, test, secure, and deploy software.

May 16, 2024

Kobiton announced the beta release of mobile test management, a new feature within its test automation platform.

May 15, 2024

Gearset announced its new CI/CD solution, Long Term Projects in Pipelines.

May 15, 2024

Rafay Systems has extended the capabilities of its enterprise PaaS for modern infrastructure to support graphics processing unit- (GPU-) based workloads.

May 15, 2024

NodeScript, a free, low-code developer environment for workflow automation and API integration, is released by UBIO.

May 14, 2024

IBM announced IBM Test Accelerator for Z, a solution designed to revolutionize testing on IBM Z, a tool that expedites the shift-left approach, fostering smooth collaboration between z/OS developers and testers.

May 14, 2024

StreamNative launched Ursa, a Kafka-compatible data streaming engine built on top of lakehouse storage.

May 14, 2024

GitKraken acquired code health innovator, CodeSee.

May 13, 2024

ServiceNow introduced a new no‑code development studio and new automation capabilities to accelerate and scale digital transformation across the enterprise.

May 13, 2024

Security Innovation has added new skills assessments to its Base Camp training platform for software security training.

May 13, 2024

CAST introduced CAST Highlight Extensions Marketplace — an integrated marketplace for the software intelligence product where users can effortlessly browse and download a diverse range of extensions and plugins.

May 09, 2024

Red Hat and Elastic announced an expanded collaboration to deliver next-generation search experiences supporting retrieval augmented generation (RAG) patterns using Elasticsearch as a preferred vector database solution integrated on Red Hat OpenShift AI.

May 09, 2024

Traceable AI announced an Early Access Program for its new Generative AI API Security capabilities.