April 5, 2022 By Prem D'Cruz 3 min read

We are excited to announce the release of Slurm on IBM Cloud.

This is a solution to help you set up an end-to-end high performance computing (HPC) system by using automated scripts from the public git repository.

The Slurm Workload Manager software delivers powerful enterprise-class management for running compute-intensive and data-intensive distributed applications. The software is open-source, fault-tolerant and is a highly scalable cluster management and job scheduling offering. It accelerates dozens of parallel applications for faster results and better utilization of all available resources. With Slurm Workload Manager, you can improve IT performance, reduce infrastructure costs and expenses and quickly meet business demands.

IBM already offers IBM Spectrum LSF Suites and IBM Spectrum Symphony on the Cloud, and Slurm Workload Manager will be the third scheduler.

Some of the key capabilities that Slurm Workload Manager offers include the following:

  • Allocate access to compute node resources for users to perform work.
  • Provide the framework for starting, executing and monitoring work on a set of allocated nodes.
  • Arbitrate contention for resources by managing a queue of pending work.

Get started with Slurm on IBM Cloud.

IBM delivers HPC value and experience

Fifty-five percent of the United States GDP of around $10 trillion is touched by high performance computing (HPC), including for industrial design, weather prediction, genomic research, vehicle crash simulation and drug discovery. Every industry — automotive, aerospace, electronics, financial sector, oil and gas, energy and utilities, life sciences and more — are running these compute-intensive workloads to optimize designs or predict business outcomes.

Other patterns that lend themselves well to HPC are serverless computing, analytics, big data, Hadoop and machine learning. At IBM, we have been using HPC on Cloud for semiconductor design and have scaled to 29,000 vCPUs with a 5X linear improvement. Understanding the nature of the workload (be it high throughput or parallel) is key, and IBM has been working with clients on HPC algorithm development and architecture design for the past 25 years to improve infrastructure utilization.

Considerations for HPC on the Cloud

A cloud vendor that provides an integrated solution out of the box — with compute instances, workload schedulers, storage management and high-speed data transfer — will be able to help solve your HPC problems. Buying these products à la carte from different vendors increases the risk of deployment and support considerably. Customers are looking for one-stop shopping, consolidated billing and a single point of support. The process should be fully automated by inputting the appropriate configuration parameters, resulting in automatic provisioning of clusters and installation of all required software. This is a huge differentiator over the current way of doing things, and the setup can be completed in hours — and not days — dramatically improving time to market.

You may also want to operate in hybrid mode, which means running static or steady-state jobs on-premises and dynamic or burst jobs on the cloud. Any offering must support this with full automation. The cloud offering should charge you only for the capacity you use so that it is a true utility-based model. It should also support worldwide multi-zone regions, the highest level of encryption and security, disaster recovery and high availability capabilities.

The cloud provides instantaneous capacity to satisfy HPC peak loads, eliminating the lengthy wait times, so you can perform multiple iterations of your simulations to achieve best possible results.

Learn more

We encourage you to bring their applications to us, and we will guide you on the best approach for success.

More from Announcements

Unify and share data across Netezza and watsonx.data for new generative AI applications

3 min read - In today's data and AI-driven world, organizations are generating vast amounts of data from various sources. The ability to extract value from AI initiatives relies heavily on the availability and quality of an enterprise's underlying data. In order to unlock the full potential of data for AI, organizations must be able to effectively navigate their complex IT landscapes across the hybrid cloud.   At this year’s IBM Think conference in Boston, we announced the new capabilities of IBM watsonx.data, an open…

IBM and SAP unlock business and industry value with new generative AI solutions 

3 min read - IBM Consulting is delivering on our commitment to co-innovate with SAP and collaborate with our clients. As part of our Value Generation Partnership initiative announced earlier this month with SAP, we are releasing the first 10 of 100 planned AI solutions to help clients transform their industries, optimize their business processes and successfully deliver their SAP programs.  Delivering AI business and industry innovation at scale  With the recently announced Value Generation Partnership initiative, IBM and SAP are co-innovating intelligent industry…

IBM SevOne 7.0: Reaching application-centric multicloud network observability  

2 min read - As enterprises increasingly rely on network connectivity to support cloud-based applications and remote workers, network managers require new methods to monitor and safeguard connectivity across diverse environments, including corporate networks, software-defined WANs and multiple public cloud providers.   According to the recent EMA Network Megatrends Report, responding network professionals believe that 53% of network outages and performance issues could be prevented with improved network management tools, yet only 9% find it very easy to hire skilled networking personnel. This is why…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters