How to Bridge the Gap: The Open Distributed Cloud
March 03, 2022

Roopak Parikh
Platform9

Driven by the demand for delivering applications at the locations of their choice, enterprises are increasingly deploying distributed infrastructure, with 67% of cloud deployments distributed across on-premises, hybrid, and edge clouds. Comparatively, our recent research found that developers are facing new challenges as they expand their use of cloud computing in ways that increasingly span multiple locations.

For example, public clouds are walled gardens, locking customers into a single vendor's ecosystem, while DIY solutions are time-consuming and increasingly complex. The best way for developers to address these concerns is to implement and operate an open distributed platform that offers the best of both worlds, combining the power of the public cloud on the infrastructure of your choice.

Still, there are some considerations to address in adopting an open distributed cloud solution — ranging from deployment delays to an increase in silos and operational complexity. Additionally, building a cloud environment rooted in diverse infrastructure requires expertise in technologies like virtualization, Kubernetes and cloud-native applications.

As you consider your options, there are four main components to a successful, open distributed cloud solution:

1. Recruit and train successfully

Talent-constrained IT teams have long struggled with the complexity of running large-scale private clouds. The ratio of servers managed to an admin or automation architect can be as little as 40:1 in private clouds. In contrast, hyperscale public-cloud providers have invested significantly in automating the management of their environments, which greatly improves their admin efficiency. It is not unheard of for the server/admin ratio in public clouds to be 4000:1 or more. So it's no surprise that as we look at 2022, we see DevOps, cloud-platform engineering, and cloud-native developers as the top hiring priorities for both advanced and early deployed users.

2. Deploy a SaaS control plane that will enable you to build and operate clouds anywhere

With cost optimization, data management, and achieving high availability among top concern for executives, this SaaS control plane provides operational automation for the consumption of infrastructure and is the heart of a distributed cloud service providing benefits such as:

■ Low management overhead using a highly automated hyperscale operational model.

■ Reduced maintenance costs by aggregating all distributed infrastructure behind a single management pane.

■ Rapid and repeatable remote deployments to 100s or 1000s of distributed cloud locations with consistent, template-based configuration and policy control.

■ An operational SLA through automated health monitoring, runbook-driven resolution of common problems, and streamlined upgrades.

3. Implement managed open-source services that use 100% open-source stacks and components to deliver bare-metal, container, virtualization, and supporting platform services

Modern cloud services such as Kubernetes are actually composite services that are themselves highly distributed and therefore will support more complex orchestration capability. Kubernetes is becoming a leading tool for enabling cloud-native transformation, with 85% either using Kubernetes today or planning to deploy Kubernetes in the next six months. However, deploying control plane components such as Kubernetes master node components (API server, etcd) requires addressing the redundancy and high availability capabilities they need.

4. Integrate support for diverse, distributed infrastructure

Deploy out-of-the-box plugins and integrations for public clouds (AWS, GCP, Azure), public-cloud Kubernetes services (EKS, AKS, GKE), and multiple operating systems such as CentOS and Ubuntu.

Monitoring is critical

Concerns of security and operational complexity, including challenges like high availability, observability, and troubleshooting, are felt by 91% of enterprises deploying cloud-native technologies, so it's imperative to ensure that you have a monitoring, diagnostic, and troubleshooting process in place at all times.

Since modern cloud services are highly distributed, it is imperative that you are constantly monitoring performance. Even a small degradation in certain components can lead to a larger, system-wide degradation in time. To simplify troubleshooting (whether automated or human), and to mitigate the likelihood of larger problems, these health probes must be highly granular.

Having good health metrics provides a basis to codify the resolution of common problems via automated runbooks. These runbooks can be built for common problems that occur during normal system operation such as a control plane going offline because of an infrastructure failure. These runbooks are also effective when there are problems in new versions of cloud services or interoperability issues that are found only in the field after deployment at some scale. S

ince the runbook can be implemented without requiring a new version of the cloud service, immediate mitigation can be provided while a bug fix or a new version of the cloud service in question is developed. In this way, customers can be operational despite the complex, ever-evolving nature of modern open-source cloud technologies.

Keep upgrading

The breadth of developers and vendors participating in modern open-source ecosystems means that new versions are constantly being developed both with bug fixes as well as with security and feature enhancements.

The SaaS control plane makes it easy for customers to stay up to date by fully automating the upgrade to a new version of various cloud services. These upgrades are typically offered on a granular basis (for example, upgrading Service A should be independent of upgrading Service B), which makes change control easier for large scale enterprise deployments.

Finally, these upgrades are ideally offered in a self-service manner that enables customers to schedule their own upgrades at a time that is convenient for them and at a scope of their choosing (for example, upgrade the Virginia datacenter at 3 a.m. on Saturday but leave Mumbai untouched for now).

Saas management will be the de facto standard

For the emerging categories of distributed and edge cloud computing, geographic distribution of infrastructure and workloads limit the reach of the public cloud. Similarly, distributed edge environments need to be managed centrally with little to no touch. It is clear that SaaS management will be the de facto standard for distributed cloud management.

Roopak Parikh is Co-Founder and CTO of Platform9
Share this

Industry News

April 17, 2025

GitLab announced the general availability of GitLab Duo with Amazon Q.

April 17, 2025

Perforce Software and Liquibase announced a strategic partnership to enhance secure and compliant database change management for DevOps teams.

April 17, 2025

Spacelift announced the launch of Saturnhead AI — an enterprise-grade AI assistant that slashes DevOps troubleshooting time by transforming complex infrastructure logs into clear, actionable explanations.

April 16, 2025

CodeSecure and FOSSA announced a strategic partnership and native product integration that enables organizations to eliminate security blindspots associated with both third party and open source code.

April 16, 2025

Bauplan, a Python-first serverless data platform that transforms complex infrastructure processes into a few lines of code over data lakes, announced its launch with $7.5 million in seed funding.

April 15, 2025

Perforce Software announced the launch of the Kafka Service Bundle, a new offering that provides enterprises with managed open source Apache Kafka at a fraction of the cost of traditional managed providers.

April 14, 2025

LambdaTest announced the launch of the HyperExecute MCP Server, an enhancement to its AI-native test orchestration platform, HyperExecute.

April 14, 2025

Cloudflare announced Workers VPC and Workers VPC Private Link, new solutions that enable developers to build secure, global cross-cloud applications on Cloudflare Workers.

April 14, 2025

Nutrient announced a significant expansion of its cloud-based services, as well as a series of updates to its SDK products, aimed at enhancing the developer experience by allowing developers to build, scale, and innovate with less friction.

April 10, 2025

Check Point® Software Technologies Ltd.(link is external) announced that its Infinity Platform has been named the top-ranked AI-powered cyber security platform in the 2025 Miercom Assessment.

April 10, 2025

Orca Security announced the Orca Bitbucket App, a cloud-native seamless integration for scanning Bitbucket Repositories.

April 10, 2025

The Live API for Gemini models is now in Preview, enabling developers to start building and testing more robust, scalable applications with significantly higher rate limits.

April 09, 2025

Backslash Security(link is external) announced significant adoption of the Backslash App Graph, the industry’s first dynamic digital twin for application code.

April 09, 2025

SmartBear launched API Hub for Test, a new capability within the company’s API Hub, powered by Swagger.

April 09, 2025

Akamai Technologies introduced App & API Protector Hybrid.