Pulumi Launches New Infrastructure Libraries for GenAI Stack
February 21, 2024

Pulumi now offers native ways to manage Pinecone indexes, including its latest serverless indexes.

Pinecone is a serverless vector database with an easy-to-use API that allows developers to build and deploy high-performance AI applications. This is incredibly important as applications involving large language models, generative AI, and semantic search require a vector database to store and retrieve vector embeddings.

Pulumi also now has a template to launch and run LangChain’s LangServe in Amazon ECS, a container management service. This in addition to Pulumi’s existing support in running Next.js frontend applications in Vercel, managing Apache Spark clusters in Databricks and 150+ other cloud and SaaS services.

The GenAI tech stack is new and emerging but has typically consisted of a LLM service and a vector data store. Running this stack on a laptop is fairly simple but getting it to production is far harder. Most of this is done manually through a CLI or a web console, which introduces manual errors and repeatability problems that affect the security and reliability of the product.

Pulumi has made it easy to take a GenAI stack running locally and get it in production in the cloud with Pulumi AI, the fastest way to learn and build Infrastructure as Code (IaC). As GenAI complexity actually relates to cloud infrastructure provisioning and management, Pulumi is purpose built to manage this cloud complexity and is easy to use to support a new use case of AI.

Pulumi allows developers to tie together all the different pieces of infrastructure that goes into their GenAI product and manage it from a simple Python program.

Share this

Industry News

May 02, 2024

Parasoft announces the opening of its new office in Northeast Ohio.

May 02, 2024

Postman released v11, a significant update that speeds up development by reducing collaboration friction on APIs.

May 02, 2024

Sysdig announced the launch of the company’s Runtime Insights Partner Ecosystem, recognizing the leading security solutions that combine with Sysdig to help customers prioritize and respond to critical security risks.

May 02, 2024

Nokod Security announced the general availability of the Nokod Security Platform.

May 02, 2024

Drata has acquired oak9, a cloud native security platform, and released a new capability in beta to seamlessly bring continuous compliance into the software development lifecycle.

May 01, 2024

Amazon Web Services (AWS) announced the general availability of Amazon Q, a generative artificial intelligence (AI)-powered assistant for accelerating software development and leveraging companies’ internal data.

May 01, 2024

Red Hat announced the general availability of Red Hat Enterprise Linux 9.4, the latest version of the enterprise Linux platform.

May 01, 2024

ActiveState unveiled Get Current, Stay Current (GCSC) – a continuous code refactoring service that deals with breaking changes so enterprises can stay current with the pace of open source.

May 01, 2024

Lineaje released Open-Source Manager (OSM), a solution to bring transparency to open-source software components in applications and proactively manage and mitigate associated risks.

May 01, 2024

Synopsys announced the availability of Polaris Assist, an AI-powered application security assistant on the Synopsys Polaris Software Integrity Platform®.

April 30, 2024

Backslash Security announced the findings of its GPT-4 developer simulation exercise, designed and conducted by the Backslash Research Team, to identify security issues associated with LLM-generated code. The Backslash platform offers several core capabilities that address growing security concerns around AI-generated code, including open source code reachability analysis and phantom package visibility capabilities.

April 30, 2024

Azul announced that Azul Intelligence Cloud, Azul’s cloud analytics solution -- which provides actionable intelligence from production Java runtime data to dramatically boost developer productivity -- now supports Oracle JDK and any OpenJDK-based JVM (Java Virtual Machine) from any vendor or distribution.

April 30, 2024

F5 announced new security offerings: F5 Distributed Cloud Services Web Application Scanning, BIG-IP Next Web Application Firewall (WAF), and NGINX App Protect for open source deployments.

April 29, 2024

Code Intelligence announced a new feature to CI Sense, a scalable fuzzing platform for continuous testing.

April 29, 2024

WSO2 is adding new capabilities for WSO2 API Manager, WSO2 API Platform for Kubernetes (WSO2 APK), and WSO2 Micro Integrator.