Oracle Launches Zettascale10 Cluster for AI on Cloud Infrastructure

Oracle Launches Zettascale10 Cluster for AI on Cloud Infrastructure

Oracle announced Oracle Cloud Infrastructure (OCI) Zettascale10, the world’s largest and most advanced AI supercomputer in the cloud. Designed to power the next era of large-scale artificial intelligence, OCI Zettascale10 connects hundreds of thousands of NVIDIA GPUs across multiple data centers to form multi-gigawatt clusters, delivering up to an unprecedented 16 zettaFLOPS of peak performance.

OCI Zettascale10 serves as the foundational fabric for the flagship supercluster built in collaboration with OpenAI in Abilene, Texas, as part of the Stargate initiative. Leveraging Oracle’s next-generation Acceleron RoCE networking architecture and NVIDIA’s AI infrastructure, the system achieves breakthrough scalability, ultra-low GPU-to-GPU latency, and industry-leading price-performance-all with the reliability and efficiency demanded by large-scale AI workloads.

Building on the success of Oracle’s first Zettascale cluster introduced in September 2024, OCI Zettascale10 marks a major leap forward in cloud supercomputing. Each cluster is deployed within hyper-dense, gigawatt-scale data center campuses engineered for optimal proximity-within a two-kilometer radius-to ensure best-in-class GPU interconnect performance for training massive AI models.

“With OCI Zettascale10, we’re fusing OCI’s groundbreaking Oracle Acceleron RoCE network architecture with next-generation NVIDIA AI infrastructure to deliver multi-gigawatt AI capacity at unmatched scale,” said Mahesh Thiagarajan, executive vice president, Oracle Cloud Infrastructure. “Customers can build, train, and deploy their largest AI models into production using less power per unit of performance and achieving high reliability. In addition, customers will have the freedom to operate across Oracle’s distributed cloud with strong data and AI sovereignty controls.”

Also Read: Sonata Software and adesso Form Global AI Modernization Alliance

The first deployment of the OCI Zettascale10 network and cluster fabric has already gone live at the Stargate site in Abilene, Texas, where Oracle and OpenAI are jointly scaling next-generation AI infrastructure.

“OCI Zettascale10 network and cluster fabric was developed and deployed first at the flagship Stargate site in Abilene, Texas – our joint supercluster with Oracle,” said Peter Hoeschele, vice president, Infrastructure and Industrial Compute, OpenAI. “The highly scalable custom RoCE design maximizes fabric-wide performance at gigawatt scale while keeping most of the power focused on compute. We’re excited to keep scaling Abilene and the broader Stargate program together.”

Oracle plans to offer multi-gigawatt OCI Zettascale10 deployments to enterprise and hyperscale customers. Initial configurations will feature up to 800,000 NVIDIA GPUs, delivering predictable performance, strong cost efficiency, and extremely high GPU-to-GPU bandwidth enabled by Oracle Acceleron’s ultra-low-latency RoCEv2 networking.

“Oracle and NVIDIA are bringing together OCI’s distributed cloud and our full-stack AI infrastructure to deliver AI at extraordinary scale,” said Ian Buck, vice president of Hyperscale, NVIDIA. “Featuring NVIDIA full-stack AI infrastructure, OCI Zettascale10 provides the compute fabric needed to advance state-of-the-art AI research and help organizations everywhere move from experimentation to industrialized AI.”

With OCI Zettascale10, Oracle reaffirms its commitment to pushing the boundaries of AI performance, scalability, and sustainability, giving customers the infrastructure to build, train, and deploy the next generation of intelligent systems.