VAST DataSpace

Say Goodbye to Data Gravity

Meet the revolutionary VAST DataSpace.​ A continuation of our first-principles thinking, the DataSpace​ provides consistency and fast access to your data, from edge to cloud.​

The VAST DataSpace has broken the fundamental tradeoff between performance and consistency to make it possible to access, transact, and protect data from edge to cloud, across 100’s of locations.

Features

Breaking the tradeoffs between performance and consistency.

VAST had to completely rethink how lock management was done across hybrid environments.  Existing implementations were optimized for read-centric workloads and fall flat with transactional workloads.​

VAST DataSpace decentralizes and moves the lock management down to the element-level (file, object, table) and allows for the lock to be held by the VAST cluster doing the transaction.​

Reads can achieve peak performance while writes maintain consistency because before writes are executed by a VAST cluster, it will ensure that all globally cached copies of that element are removed and any references will be directed to the locking cluster until the transaction is complete.​

Access your data anywhere — without all the tradeoffs.

Snapshot copies of data at the volume level lack data coherency. This can lead to potential data inconsistencies and issues when trying to restore or access specific points in time. On the other hand, centralized lock management with directory-level locks can cause write bottlenecks because all write requests must pass through a central point. With the VAST DataSpace, the system delivers fine-grained de-centralized locks, delivering local write performance. By distributing the lock management across different parts of the system, it improves concurrency and enhances overall system performance, ensuring data integrity and efficiency in handling read and write operations.

Flexible, path-based synchronization.

VAST DataSpace allows sites to subscribe only to what they want, making it a more efficient solution out the gate. Often times there is no need to synchronize the entire dataset across remote sites.

Each site can subscribe to any path and access the data through the protocol of choice: file, object, and table. This allows for flexibility in supporting all the various applications across your organization.

Each path can have its own set of synchronization policies and consistency modes allowing you to support the diverse set of requirements of your organization.

Lazy Mode

VAST DataSpace’s Lazy Mode intelligently syncs data based on access predictions, maximizing bandwidth utilization. It’s storage-efficient but has a higher time-to-first byte.

Scheduled Mode

In Scheduled Mode, VAST DataSpace utilizes custom scripts or popular job schedulers (Run AI, Slurm, Kubernetes) to anticipate data needs, balancing efficiency and performance for remote sites.

Read/Write Paths Consistency Mode

In Read/Write Mode, all sites in the DataSpace will be able to read and write with consistency, while providing a global namespace with both read and write performance you can expect from a local storage system.

images

Global Clones Consistency Mode

Writeable Clones are great for use cases when you want a point-in-time copy of particular path of your namespace, but you don’t want the updates to propagate back to the global namespace.

images

Your Data, Your Choice of Location

VAST Data Platform has been available through manufacturing and systems partners to deploy enterprise-grade infrastructure in your core and edge datacenters, in both ruggedized and non-ruggedized environments.

Now the VAST Data Platform is available through various Cloud Service Providers such as AWS, Azure, and GCP.

With VAST DataSpace, customers are able to seamlessly move their data to where it needs to be without having to use third party software or maintain complex data orchestration environments.

Your data is where it needs to be, when it needs to be.

images
Features

Designed for AI, the enterprise, and beyond

Decentralized Lock Management

Locks are distributed to each node that needs it and are established at a very granular level on elements (files, objects, tables). This provides high transactional performance without having to check with a central repository.

Global Data Reduction

VAST’s Similarity-Based data reduction combines the global nature of deduplication with the fine granularity of compression across your entire global namespace.

Fine-grained Lease Revocation

Before writes are performed, VAST DataSpace will ensure all cached copies of that element are removed and reference the locking site until the write is completed. This ensures high transactional performance while maintaining write consistency.

Massive Scale

VAST clusters can be built to support well over an exabyte of data capacity. Today, several customers run clusters over 100PB in size

Path-based Synchronization

Any number of sites can subscribe to any paths from any other site ensuring data is available where it needs to be, without having to unnecessarily synchronize unnecessary data.

Global Namespace

VAST DataSpace enables any or all of your data to be accessible anywhere you need it to be, without having to use third party software or complex data orchestration logic. Whether it’s in your core or edge datacenters, or in your cloud service provider, VAST will ensure your data will be where it needs to be.

Flexible Sync Policies

VAST DataSpace offers Lazy, Eager, and Scheduled Caching to allow users to optimize for storage efficiency, time-to-first byte, or optimize for scheduled jobs.

Flexible Consistency Modes

Read/Write modes allow for remote sites to simultaneously write to the same namespace while ensuring write consistency. Writeable clones allow for a point-in-time version of a path to be written to on a remote site without propagating those changes back to the global namespace.