INTRODUCTION

Technology Overview

Rubrik is an intelligent Data Management stack where each layer scales and is independently resistant to failures. Designed to run on-prem or on the cloud, the stack is anchored by Infinity (API and deep app awareness), Cerebro (the “brains”), and Atlas (cloud-scale file system built from scratch

Infinity

The interface between the outer world and Cerebro. APIs execute SLA policies throughout the system and deliver granular control to users.

Cerebro

The brains of Rubrik. Comprised of Blob Engine and Distributed Task Framework. Abstracts a data control plane detached from any underlying infrastructure.

Atlas

Cloud-scale file system designed to be masterless and self-healing. Works with Cerebro to provide instant recovery.

BENEFITS

Mobilize Your Data Anywhere


Together, Rubrik Blob Engine and Distributed Task Framework create a control plane that can orchestrate data from on-prem to cloud. In other words, data can be sent to any destination and quickly retrieved when needed. Unlike legacy solutions, Rubrik has integrated an API-first architecture from Day 1 and consumes the same APIs published and offered to users.
 

Rubrik’s APIs are designed to work in two ways:

White Paper

Intelligent Data Protection with Rubrik

It provides intelligent data management and global search while ensuring cradle-to-grave data lifecycle governance - from the initial data ingest all the way to archiving and ultimately expiring data based on the policy used. This paper will explore Rubrik in action, demonstrating the power of Cerebro. 

img

A CLOSER LOOK: CEREBRO

The Blob Engine

Rubrik Blob Engine is a distributed version control system, detached from any underlying application and infrastructure (e.g., storage, on-prem, cloud).

It can orchestrate data from on-prem to cloud, cloud to cloud, cloud to on-prem. It provides core data management services, including immutability, deduplication, retention, replication, and archival.

The Rubrik Blob Engine is designed to deliver instant access to data to meet today’s demands for recovery, test/dev, and analytics. It dynamically evaluates how to minimize fragmentation and latency within the snapshot to achieve near-zero recovery times, especially for applications that require higher quality of service (e.g., Gold vs. Silver SLA).

To deliver dramatically lower RTOs through “Live Mounts”, Rubrik exploits its distributed system DNA. When a “Live Mount” is initiated, Rubrik issues parallel requests to the cluster nodes and underlying storage to read the distributed data concurrently. Unlike traditional rehydration, Rubrik employs parallel synthesis of the data, accelerating the time in which data can be presented back to the system for recovery or test/dev purposes.

In addition, the Blob Engine maintains a mapping between content ID and an usable representation of the corresponding content until deletion (could be stored in Atlas file system). Everything is stored in an immutable format (immune to ransomware).

By viewing this video, you are providing your express consent that your viewing history has been captured and may be shared with our affiliates or third-party providers that may also combine with other data they collect about you, e.g. your use of their services. We and our third-party providers may use this information to present you with offers, promotions, or other marketing that we think you'll find relevant.

Chris Wahl takes you behind the scenes to see how the brains of Rubrik was built with founding lead engineer Fabiano Bolteho.

A CLOSER LOOK: CEREBRO

The Distributed Task Framework

By viewing this video, you are providing your express consent that your viewing history has been captured and may be shared with our affiliates or third-party providers that may also combine with other data they collect about you, e.g. your use of their services. We and our third-party providers may use this information to present you with offers, promotions, or other marketing that we think you'll find relevant.

Founding engineer Jon Derryberry discusses the benefits of a declarative policy engine (what data should exist and where in the system).

Automate your SLA policies with the Distributed Task Framework by globally assigning and executing tasks across the system in a fault-tolerant and efficient manner. 

It enforces the activities to uphold the assigned SLA policies on a daily and long-term basis. Once a SLA policy is set, it strategizes to meet these set goals for data retention, replication, and archival.

For example, if a user has defined prioritization requirements within the SLA — this database is mission-critical, it will constantly perform data efficiency checks (data consolidation, compression, deduplication) to ensure it recovers quickly.

Hear a Customer's API Story

There are extensive data retention requirements due to the nature of our industry, and Rubrik was attuned to our need for a robust, future-forward solution that reduced data management complexity.

Sean Donaldson
Chief Technology Officer

Resources

Featured Content