Both unstructured and structured data collected continue to grow. By 2020, we anticipate that the world’s volume of digital data will exceed 44 zettabytes, an astounding number.

The need to drive insights from this growing data has become ever more critical to differentiate in the market, and the need to support compute-intensive workloads continues to rise in order to remain competitive in the market.

Organizations are developing AI models to drive insights and enable the inferences to improve customer experiences. Machine learning models are becoming more and more complex with the increase in the amount of data, and deep learning models continue to gain traction as the need for more precise algorithms to differentiate in the market grows.

Every AI model is different and needs different precision and training levels. Furthermore, High Performance Computing (HPC) continues to grow in importance in many industries to drive these models and compute-intensive workloads.

IBM Cloud offers different performance-level GPU cards

To support customers’ HPC and AI workloads efficiently, IBM Cloud GPU offerings provide 17.3% better performance per dollar compared to AWS (Performance Report). We at IBM realize these varied needs, so we have enabled different performance-level GPU cards with bare metal servers and virtual instances:

  1. NVIDIA Tesla M60: Fundamental enterprise performance for virtualization and professional graphics.
  2. NVIDIA Tesla K80: Reliable enterprise performance for introductory AI computing.
  3. NVIDIA Tesla P100: Essential performance for growing advanced AI and HPC capabilities.
  4. NVIDIA Tesla V100: Maximum performance for progressive deep learning workloads.

Two additional GPU capabilities

IBM Cloud GPU offerings support many industries, such as financial, healthcare, and industrial sectors. As we continue to support the growing needs of these industries, we have updated our current GPU capabilities with two additional capabilities:

  1. 32GB V100 GPU Cards: Now available 32GB V100 GPU Cards for AI and HPC workloads in DAL13, WDC07, LO04, and FRA04. With 640 Tensor Cores, Tesla V100 is the world’s first GPU to break the 100 teraFLOPS (TFLOPS) barrier of deep learning performance. Compared to 16GB V100 GPU Cards, V100 GPU Cards with 32GB configuration take advantage of these high-performing cards to drive down time to train deep learning or machine learning models, whether its floating point precision or fixed point. To order please click here, or to learn more click here.
  2. GPU cards with Virtual Servers in Tokyo, Frankfurt, and Sydney: Now available P100 and V100 GPU Cards in additional data centers in FRA02, SYD04, and TOK02. Customers in these new data centers can now take advantage of multi-tenancy to reduce their cost of using GPU cards with virtual server offerings. Customers can use the GPU cards’ application on-demand without large compute bare metal servers if they need rapid deployment for a PoC and future scaling of their applications. To order please click here, or to learn more click here.

Benefits of GPU cards with bare metal vs. virtual instances

IBM Cloud Bare Metal Servers with GPU: High level of customization to fit high-performance workloads that can offload highly complex compute algorithms to the parallel processing of GPU without the noisy neighbors. Bare metal servers with GPUs can support larger workloads with more users, considerably reducing the cost of running applications. It also helps support use cases that could not be supported in the past as the number of applications for GPU cards continue to rise.

IBM Cloud Virtual Servers: Low provisioning time with pre-configured virtual servers with GPU cards allows customers to deploy high-performance workloads on their virtual machines at a very low price. Customers can run their PoC’s on the virtual machines to run experiments. Once mature, the workloads can be efficiently scaled or burst with hourly or monthly options.

Learn more about IBM Cloud GPU offerings.

More from Announcements

IBM Consulting augments expertise with AWS Competencies: A win-win for clients 

3 min read - In today's dynamic economic landscape, businesses demand continuous innovation and speed of execution. At IBM Consulting®, our unwavering focus on partnerships and shared commitment to delivering enterprise-level solutions to mutual clients have been core to our success.   We are thrilled to announce that IBM® has recently gained five competencies from Amazon Web Services (AWS) in vital domains including Cloud Operations, Internet of Things (IoT), Life Sciences, Mainframe Modernization, and Telecommunications. With these credentials, IBM further establishes its position as a…

Probable Root Cause: Accelerating incident remediation with causal AI 

5 min read - It has been proven time and time again that a business application’s outages are very costly. The estimated cost of an average downtime can run USD 50,000 to 500,000 per hour, and more as businesses are actively moving to digitization. The complexity of applications is growing as well, so Site Reliability Engineers (SREs) require hours—and sometimes days—to identify and resolve problems.   To alleviate this problem, we have introduced the new feature Probable Root Cause as part of Intelligent Incident…

Reflecting on IBM’s legacy of environmental innovation and leadership

4 min read - Upholding a legacy of more than 50 years of environmental responsibility through our company’s actions and commitments, IBM continues to be a leader in driving sustainability for our business, our communities and our clients—including a 34-year history of annual, public environmental reporting, which we continue today. As a hybrid cloud and artificial intelligence (AI) company, we believe that leveraging technology is key to unlocking impact, and it will play a substantial role in how society addresses, adapts to, and overcomes…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters