Pulumi Launches New Infrastructure Libraries for GenAI Stack
February 21, 2024

Pulumi now offers native ways to manage Pinecone indexes, including its latest serverless indexes.

Pinecone is a serverless vector database with an easy-to-use API that allows developers to build and deploy high-performance AI applications. This is incredibly important as applications involving large language models, generative AI, and semantic search require a vector database to store and retrieve vector embeddings.

Pulumi also now has a template to launch and run LangChain’s LangServe in Amazon ECS, a container management service. This in addition to Pulumi’s existing support in running Next.js frontend applications in Vercel, managing Apache Spark clusters in Databricks and 150+ other cloud and SaaS services.

The GenAI tech stack is new and emerging but has typically consisted of a LLM service and a vector data store. Running this stack on a laptop is fairly simple but getting it to production is far harder. Most of this is done manually through a CLI or a web console, which introduces manual errors and repeatability problems that affect the security and reliability of the product.

Pulumi has made it easy to take a GenAI stack running locally and get it in production in the cloud with Pulumi AI, the fastest way to learn and build Infrastructure as Code (IaC). As GenAI complexity actually relates to cloud infrastructure provisioning and management, Pulumi is purpose built to manage this cloud complexity and is easy to use to support a new use case of AI.

Pulumi allows developers to tie together all the different pieces of infrastructure that goes into their GenAI product and manage it from a simple Python program.

Share this

Industry News

January 23, 2025

Progress announced the launch of Progress Data Cloud, a managed Data Platform as a Service designed to simplify enterprise data and artificial intelligence (AI) operations in the cloud.

January 23, 2025

Sonar announced the release of its latest Long-Term Active (LTA) version, SonarQube Server 2025 Release 1 (2025.1).

January 23, 2025

Idera announced the launch of Sembi, a multi-brand entity created to unify its premier software quality and security solutions under a single umbrella.

January 22, 2025

Postman announced the Postman AI Agent Builder, a suite empowering developers to quickly design, test, and deploy intelligent agents by combining LLMs, APIs, and workflows into a unified solution.

January 22, 2025

The Cloud Native Computing Foundation® (CNCF®), which builds sustainable ecosystems for cloud native software, announced the graduation of CubeFS.

January 21, 2025

BrowserStack and Bitrise announced a strategic partnership to revolutionize mobile app quality assurance.

January 21, 2025

Render raised $80M in Series C funding.

January 16, 2025

Mendix, a Siemens business, announced the general availability of Mendix 10.18.

January 16, 2025

Red Hat announced the general availability of Red Hat OpenShift Virtualization Engine, a new edition of Red Hat OpenShift that provides a dedicated way for organizations to access the proven virtualization functionality already available within Red Hat OpenShift.

January 16, 2025

Contrast Security announced the release of Application Vulnerability Monitoring (AVM), a new capability of Application Detection and Response (ADR).

January 15, 2025

Red Hat announced the general availability of Red Hat Connectivity Link, a hybrid multicloud application connectivity solution that provides a modern approach to connecting disparate applications and infrastructure.

January 15, 2025

Appfire announced 7pace Timetracker for Jira is live in the Atlassian Marketplace.

January 14, 2025

SmartBear announced the availability of SmartBear API Hub featuring HaloAI, an advanced AI-driven capability being introduced across SmartBear's product portfolio, and SmartBear Insight Hub.

January 14, 2025

Azul announced that the integrated risk management practices for its OpenJDK solutions fully support the stability, resilience and integrity requirements in meeting the European Union’s Digital Operational Resilience Act (DORA) provisions.

January 14, 2025

OpsVerse announced a significantly enhanced DevOps copilot, Aiden 2.0.