We’re excited to announce the general availability of Oracle Cloud Infrastructure (OCI) Supercluster with NVIDIA H200 Tensor Core GPUs. The largest AI supercomputer available in the cloud*, our latest Supercluster scales up to an industry-leading 65,536 GPUs. At maximum scale, it can offer up to 260 ExaFLOPS of peak FP8 performance, more than four times the previous generation.
Each OCI Compute instance within the Supercluster has 76% more high-bandwidth memory capacity and 40% more memory bandwidth than the H100 instance, improving large language model (LLM) inference performance by up to 1.9X. With double the front-end network throughput for data ingestion and retrieval (200 Gbps per instance), data transfer to and from the cluster is also dramatically improved to further accelerate AI model training and deployment.
AI models continue to evolve and become more capable by using over a trillion parameters to improve accuracy, fluency, efficiency, multimodal capability, and other dimensions. These new models require faster GPUs arranged into very large clusters. OCI superclusters offer the following features:
Best of all, pricing remains $10 per GPU per hour, the same as the previous generation instance for NVIDIA H100 GPUs (BM.GPU.H100.8).
The NVIDIA H200 shape boasts the following specifications:
To get access to Oracle Cloud Infrastructure Supercluster with NVIDIA H200 GPUs, contact your Oracle sales team and learn more about AI Infrastructure.
* Scalability for CSP 1: 20,000 NVIDIA H200 GPUs; Scalability for CSP 2 and CSP 3: not available publicly
Sagar Zanwar is a Lead Product Manager specializing in GPU products within the AI Infrastructure Group. With a keen focus on advancing AI infrastructure capabilities through cutting-edge GPU/OCI technology, Sagar plays a pivotal role in driving innovation and delivering high-performance solutions in the AI industry.
Akshai is a Director of Product Marketing for Oracle Cloud Infrastructure (OCI) focused on driving adoption of OCI’s services and solutions. He has over 15 years of experience and is a graduate of UC Berkeley and Georgia Tech.
Previous Post
Next Post