VAST Data Platform for Artificial Intelligence

The Data Platform for the AI Era

The VAST Data Platform is the foundation trusted by world-leading AI research teams to deliver the scale, speed, and reliability needed to train neural networks and infer in real time. VAST enables greater statistical accuracy by removing the barriers to training on all of an organization’s data at any scale.

Trusted by the world’s leading artificial intelligence organizations
View All Customers

Designed from the ground up to make AI simple to deploy and manage the VAST Data Platform is a next-generation distributed data store, unifying file, object, and database services into one scalable, affordable all-flash system that simplifies data pipelines.

Overview

The new age of computing requires a new approach to data.

Every industry is at the dawn of a new AI-powered era thanks to the exponential advancements in artificial intelligence. Still, the barrier to entry is high for many IT organizations. New machine learning workloads such as training generative AI models exceed the performance and scale capabilities of traditional enterprise infrastructures.  HPC systems based on parallel file systems provide adequate performance but complexity and lack of enterprise features make them difficult for many IT teams to support. Typically customers deploy a combination of HPC storage for high-performance ephemeral scratch space and lower-cost NAS for long-term data retention. The result is complicated and tedious data pipelines where data must be copied from tier to tier before AI training can even begin. 

Enter the VAST Data Platform, a simple enterprise NAS with the performance and scalability for the most demanding AI applications combined with revolutionary data efficiency technologies that reduce the cost of flash to archive tier economics. When all data is available for high-performance training machine learning workflows are simple and time to insight is reduced. 

VAST NVIDIA Partnership

VAST Data and NVIDIA share a common goal of democratizing the power of artificial intelligence for organizations of all sizes. As the first enterprise NAS solution to achieve SuperPOD certification, the VAST Data Platform makes AI simpler, faster, and easier to manage.

images
NVIDIA + VAST Partnership

As we see AI proliferate to enterprise companies, there’s a need for an AI platform that is truly enterprise-grade. NVIDIA has a great partnership with VAST, and we’re looking forward to working with the team as they’ve taken this bold vision to build beyond storage to deliver a platform that helps to bring structured and unstructured data together in a unified, global namespace.

Manuvir Das
Vice President, Enterprise Computing at NVIDIA
Providing Frictionless Video Communication Tools Worldwide

Discover how the VAST Data Platform enables Zoom to seamlessly scale its operations and provide customers worldwide with frictionless video communication tools.

Key Benefits

Unleashing Exascale Innovation

The VAST Data Platform is unified and built as a singular intelligent system, designed entirely from the ground up to make AI infrastructure simple to deploy and manage on-premises or in the cloud. The VAST Data Platform introduces the VAST DataBase that accelerates query execution for Spark, Trino, and native SQL applications. To simplify data management, the VAST DataBase also powers the VAST Catalog, an automatic and always-in-sync metadata index of every file and object. The VAST DataSpace enables multi-data center and cloud-bursting workflows to support distributed teams and leverage cloud resources.

Coming 2024:  The DataEngine is an intelligent computing environment that customers deploy from edge to cloud. By embedding logic directly into the VAST Data Platform, the system can schedule processing events in real-time, triggered by data activities.

A Unified, Multi-Protocol Platform

A unified multi-protocol platform for unstructured (NFS, SMB, and S3) and structured data (native SQL applications and query engines like Spark and Trino).

AI-Optimized Client Access

With support for RDMA and GPUDirect Storage access, VAST’s NAS experience delivers the performance of a parallel file system without any of the parallel file system complexity.

Multi-Tenant Infrastructure

Dedicate front-end servers and the performance they provide to the most critical projects. VAST's server pooling capability provides dedicated Quality of Service for competing projects.

Simplified Data Pipelines

Eliminate time-consuming data copy workflows with all data available in real-time plus high-performance access via Infiniband and Ethernet.

Built-in Feature Catalog

The VAST DataBase enables a real time data catalog for deep analytic queries on training data and machine learning derived metadata.

Global Namespace

Simplify data access from anywhere with a single global namespace across cloud, edge, and core.

A Unified, Multi-Protocol Platform

A unified multi-protocol platform for unstructured (NFS, SMB, and S3) and structured data (native SQL applications and query engines like Spark and Trino).

AI-Optimized Client Access

With support for RDMA and GPUDirect Storage access, VAST’s NAS experience delivers the performance of a parallel file system without any of the parallel file system complexity.

Multi-Tenant Infrastructure

Dedicate front-end servers and the performance they provide to the most critical projects. VAST's server pooling capability provides dedicated Quality of Service for competing projects.

Simplified Data Pipelines

Eliminate time-consuming data copy workflows with all data available in real-time plus high-performance access via Infiniband and Ethernet.

Built-in Feature Catalog

The VAST DataBase enables a real time data catalog for deep analytic queries on training data and machine learning derived metadata.

Global Namespace

Simplify data access from anywhere with a single global namespace across cloud, edge, and core.

Reference Architecture

HPC performance meets enterprise simplicity.

images