Red Hat Completes Acquisition of Neural Magic
January 13, 2025

Red Hat has completed its acquisition of Neural Magic, a provider of software and algorithms that accelerate generative AI (gen AI) inference workloads.

With Neural Magic, Red Hat adds expertise in inference performance engineering and model optimization, helping further the company’s vision of high-performing AI workloads that directly map to unique customer use cases, wherever needed across the hybrid cloud.

The large language models (LLMs) underpinning today’s gen AI use cases, while innovative, are often too expensive and resource-intensive for most organizations to use effectively. To address these challenges, Red Hat views smaller, optimized and open source-licensed models driven by open innovation across compute architectures and deployment environments as key to the future success of AI strategies.

Neural Magic’s commitment to making optimized and efficient AI models a reality furthers Red Hat’s ability to deliver on this vision for AI. Neural Magic is also a leading contributor to vLLM, an open source project developed by UC Berkeley for open model serving, which will help bring even greater choice and accessibility in how organizations build and deploy AI workloads.

With Neural Magic’s technology and performance engineering expertise, Red Hat aims to break through the challenges of wide-scale enterprise AI, using open source innovation to further democratize access to AI’s transformative power via:

- Open source-licensed models, from the 1B to 100’s of billions parameter scale, that can run anywhere and everywhere needed across the hybrid cloud - in corporate data centers, on multiple clouds and at the edge.

- Fine-tuning capabilities that enable organizations to more easily customize LLMs to their private data and uses cases with a stronger security footprint.

- Inference performance engineering expertise, resulting in greater operational and infrastructure efficiencies.

- A partner and open source ecosystem and support structures that enable broader customer choice, from LLMs and tooling to certified server hardware and underlying chip architectures.

The concept of choice is as crucial for gen AI today as it was cloud-native or containerized applications several years ago: The right environment (cloud, server, edge, etc.), accelerated compute and inference server are all critical for successful gen AI strategies. Red Hat remains firm in its commitment to customer choice across the hybrid cloud, including AI, with the acquisition of Neural Magic furthering supporting this promise.

The expertise and capabilities of Neural Magic will be incorporated into Red Hat AI, Red Hat’s portfolio of gen AI platforms. Built with the hybrid cloud in mind, Red Hat AI encompasses:

- Red Hat Enterprise Linux AI (RHEL AI), a foundation model platform to more seamlessly develop, test and run the IBM Granite family of open source-licensed LLMs for enterprise applications on Linux server deployments.

- Red Hat OpenShift AI, an AI platform that provides tools to rapidly develop, train, serve and monitor machine learning models across distributed Kubernetes environments on-site, in the public cloud or at the edge.

- InstructLab, an approachable open source AI community project created by Red Hat and IBM that enables anyone to shape the future of gen AI via the collaborative improvement of open source-licensed Granite LLMs using InstructLab's fine-tuning technology.

vLLM, LLM Compressor, pre-optimized models and more are all slated to be incorporated into Red Hat AI, making Neural Magic an integral piece of Red Hat’s AI platform offerings.
Supporting Quotes

Matt Hicks, president and CEO, Red Hat, said: “Efficiency, optimization and choice aren’t unique concepts when it comes to traditional enterprise IT, and we feel that gen AI should be no different. By adding Neural Magic’s expertise in gen AI performance engineering and optimization to Red Hat AI, we’re furthering our commitment to a gen AI that answers customers’ unique needs, from where workloads run to how they are tuned and trained.”

Brian Stevens, CEO, Neural Magic, said: “Neural Magic’s research and technical contributions to open source AI have significantly reduced the infrastructure required to deploy state-of-the-art large language models at scale. Red Hat shares our vision that the Future of AI is Open, and we are looking forward to together enabling enterprises to capture the value of GenAI without all of the friction.”

Share this

Industry News

January 27, 2025

Qt Group is launching Qt AI Assistant, an experimental tool for streamlining cross-platform user interface (UI) development.

January 27, 2025

Sonatype announced its integration with Buy with AWS, a new feature now available through AWS Marketplace.

January 27, 2025

Endor Labs, Aikido Security, Arnica, Amplify, Kodem, Legit, Mobb and Orca Security have launched Opengrep to ensure static code analysis remains truly open, accessible and innovative for everyone:

January 23, 2025

Progress announced the launch of Progress Data Cloud, a managed Data Platform as a Service designed to simplify enterprise data and artificial intelligence (AI) operations in the cloud.

January 23, 2025

Sonar announced the release of its latest Long-Term Active (LTA) version, SonarQube Server 2025 Release 1 (2025.1).

January 23, 2025

Idera announced the launch of Sembi, a multi-brand entity created to unify its premier software quality and security solutions under a single umbrella.

January 22, 2025

Postman announced the Postman AI Agent Builder, a suite empowering developers to quickly design, test, and deploy intelligent agents by combining LLMs, APIs, and workflows into a unified solution.

January 22, 2025

The Cloud Native Computing Foundation® (CNCF®), which builds sustainable ecosystems for cloud native software, announced the graduation of CubeFS.

January 21, 2025

BrowserStack and Bitrise announced a strategic partnership to revolutionize mobile app quality assurance.

January 21, 2025

Render raised $80M in Series C funding.

January 16, 2025

Mendix, a Siemens business, announced the general availability of Mendix 10.18.

January 16, 2025

Red Hat announced the general availability of Red Hat OpenShift Virtualization Engine, a new edition of Red Hat OpenShift that provides a dedicated way for organizations to access the proven virtualization functionality already available within Red Hat OpenShift.

January 16, 2025

Contrast Security announced the release of Application Vulnerability Monitoring (AVM), a new capability of Application Detection and Response (ADR).

January 15, 2025

Red Hat announced the general availability of Red Hat Connectivity Link, a hybrid multicloud application connectivity solution that provides a modern approach to connecting disparate applications and infrastructure.

January 15, 2025

Appfire announced 7pace Timetracker for Jira is live in the Atlassian Marketplace.