What Comprises the Modern Data Cloud
January 11, 2021

Manav Mital
Cyral

In 1963, a team of talented engineers led by J.C.R. Licklider took on a challenge like no other before: to develop a technology that would allow a computer to be used by two or more persons simultaneously. With the challenge, they took a $2 million research grant, and thus commenced a journey, which paved the way to computer networks, virtualization technologies, and the Internet. But even four decades later, in the early 2000s, the world had still to discover one of the most impactful contributions of Licklider's team — Cloud computing.

At that point, the data still resided in on-premises databases, but the success of early Application Cloud giants like Salesforce, Workday, and Google signaled a shift in the established trend. Enterprise productivity, along with the data that fueled it, started moving to the shared services that lived outside the perimeter of the organization. It took companies like Amazon, Google and Microsoft another decade to reach the next critical milestone — the Infrastructure Cloud, which allowed organizations to not only run applications, but migrate their storage and compute resources to the cloud, recognizing massive gains in performance and efficiency. Despite these gains, organizations quickly discovered that having the means to store and process information wasn't enough to make it readily accessible. Data continued to suffer from the legacy of the on-premises days: endless siloes, inefficiencies, lack of interoperability and, as a result, poor security.

Fortunately, another turn of the decade has marked yet another milestone — a shift in data and cloud technology, which is often referred to as the Data Cloud. For the first time in history, in a matter of years, an entire ecosystem of cloud-based services appeared to store, process and analyze operational, business intelligence and other data. True to the diverse nature of data repositories, this ecosystem spans databases, data warehouses, and data pipelines. The best part? It works.

It may seem like the Data Cloud has appeared out of nowhere, but in reality, it's been a part of several large trends that the industry is going through:

Digital Transformation allows organizations to use digital technologies to create new customer experiences, business processes, or employee management to meet changing business, market and cultural requirements. The Data Cloud enables companies to efficiently capture all their diverse data, experiment with the various use cases, scale to accommodate their need to unlock the value from their data, and optimize the price for performance and value.

Data Democratization makes data available to a wide range of stakeholders in a business. It has been a key tenet of digital transformation, and helps unlock the value locked within proprietary data embedded in various pockets of the organization, which in turn helps improve both their top line and bottom line. Embracing the Data Cloud helps teams pick tools for managing, processing and analyzing data that make most sense to them.

Infrastructure as Code (IaC) allows developers to manage and configure infrastructure through text files in a human and machine-readable format. Easy to scale and prototype, IaC has seen a massive uptake as organizations embrace the cloud. Developers can now easily spin up Data Cloud services for their applications, and services to move data across from them to their data warehouse.

While still early in its growth, the Data Cloud has clearly proven its potential. From the beginning, the allure of the Data Cloud was in its ability to solve the siloing data problem. With its flexible architecture, it has quickly enabled teams to pick the technologies they most needed, providing easy access to the relevant data repositories, and eliminating the consistency and cost problems.

But quickly, other benefits of embracing the Data Cloud also became apparent:

■ The Data Cloud has brought the processing and business intelligence into the cloud, which has enabled virtually unlimited scale and performance, and unlocked many new use cases and opportunities.

■ The Data Cloud has enabled businesses to become data-driven, to make faster, better, decisions, and improve their customer reach, engagement and retention.

■ The Data Cloud has freed up a lot of internal resources normally required for infrastructure procurement, deployment and maintenance, given the as-a-service model of delivery. Those resources can now be used to continuously refine their products, unlock new opportunities and deliver deeper insights to all stakeholders.

Any type of cloud inherently introduces new challenges. Just like the application and infrastructure clouds, the Data Cloud brings with it security issues that have not been seen before. For example, one of the biggest promises of the Data Cloud is its flexibility and support for various applications, data sources, and third-party applications such as BI tools. With such flexibility, controlling access based on IPs becomes an insurmountable task, forcing organizations to find ways to integrate identity into the data source.

Further, governance and compliance becomes impossible to manage as services scale. Enforcing integrations with identity management for all data sources can easily turn into a compliance nightmare, and granting access to new users becomes such a burden that many people cut corners and resort to shared accounts and finding excuses to bypass the compliance requirements.

Finally, distributing data amongst numerous services inside a Data Cloud leads to a distributed governance model, which is a critical component in the cloud. Each tool requires different methods for managing identities and controlling permissions. They also each generate different logs and audit trails, complicating compliance and visibility.

Despite all the risks, it is clear that the promise of the data cloud far outweighs all the challenges. And as more organizations embrace the new ecosystem, more leaders start thinking about making their data not only more accessible, but also more secure. This thinking led to the emergence of a burgeoning cloud security industry in the past, and — if history is any indicator — we might be on the verge of seeing a brand new data cloud security category in the making.

Manav Mital is CEO of Cyral
Share this

Industry News

March 27, 2025

webAI and MacStadium(link is external) announced a strategic partnership that will revolutionize the deployment of large-scale artificial intelligence models using Apple's cutting-edge silicon technology.

March 27, 2025

Development work on the Linux kernel — the core software that underpins the open source Linux operating system — has a new infrastructure partner in Akamai. The company's cloud computing service and content delivery network (CDN) will support kernel.org, the main distribution system for Linux kernel source code and the primary coordination vehicle for its global developer network.

March 27, 2025

Komodor announced a new approach to full-cycle drift management for Kubernetes, with new capabilities to automate the detection, investigation, and remediation of configuration drift—the gradual divergence of Kubernetes clusters from their intended state—helping organizations enforce consistency across large-scale, multi-cluster environments.

March 26, 2025

Red Hat announced the latest updates to Red Hat AI, its portfolio of products and services designed to help accelerate the development and deployment of AI solutions across the hybrid cloud.

March 26, 2025

CloudCasa by Catalogic announced the availability of the latest version of its CloudCasa software.

March 26, 2025

BrowserStack announced the launch of Private Devices, expanding its enterprise portfolio to address the specialized testing needs of organizations with stringent security requirements.

March 25, 2025

Chainguard announced Chainguard Libraries, a catalog of guarded language libraries for Java built securely from source on SLSA L2 infrastructure.

March 25, 2025

Cloudelligent attained Amazon Web Services (AWS) DevOps Competency status.

March 25, 2025

Platform9 formally launched the Platform9 Partner Program.

March 24, 2025

Cosmonic announced the launch of Cosmonic Control, a control plane for managing distributed applications across any cloud, any Kubernetes, any edge, or on premise and self-hosted deployment.

March 20, 2025

Oracle announced the general availability of Oracle Exadata Database Service on Exascale Infrastructure on Oracle Database@Azure(link sends e-mail).

March 20, 2025

Perforce Software announced its acquisition of Snowtrack.

March 19, 2025

Mirantis and Gcore announced an agreement to facilitate the deployment of artificial intelligence (AI) workloads.

March 19, 2025

Amplitude announced the rollout of Session Replay Everywhere.

March 18, 2025

Oracle announced the availability of Java 24, the latest version of the programming language and development platform. Java 24 (Oracle JDK 24) delivers thousands of improvements to help developers maximize productivity and drive innovation. In addition, enhancements to the platform's performance, stability, and security help organizations accelerate their business growth ...