We are excited to announce the release of Slurm on IBM Cloud.

This is a solution to help you set up an end-to-end high performance computing (HPC) system by using automated scripts from the public git repository.

The Slurm Workload Manager software delivers powerful enterprise-class management for running compute-intensive and data-intensive distributed applications. The software is open-source, fault-tolerant and is a highly scalable cluster management and job scheduling offering. It accelerates dozens of parallel applications for faster results and better utilization of all available resources. With Slurm Workload Manager, you can improve IT performance, reduce infrastructure costs and expenses and quickly meet business demands.

IBM already offers IBM Spectrum LSF Suites and IBM Spectrum Symphony on the Cloud, and Slurm Workload Manager will be the third scheduler.

Some of the key capabilities that Slurm Workload Manager offers include the following:

  • Allocate access to compute node resources for users to perform work.
  • Provide the framework for starting, executing and monitoring work on a set of allocated nodes.
  • Arbitrate contention for resources by managing a queue of pending work.

Get started with Slurm on IBM Cloud.

IBM delivers HPC value and experience

Fifty-five percent of the United States GDP of around $10 trillion is touched by high performance computing (HPC), including for industrial design, weather prediction, genomic research, vehicle crash simulation and drug discovery. Every industry — automotive, aerospace, electronics, financial sector, oil and gas, energy and utilities, life sciences and more — are running these compute-intensive workloads to optimize designs or predict business outcomes.

Other patterns that lend themselves well to HPC are serverless computing, analytics, big data, Hadoop and machine learning. At IBM, we have been using HPC on Cloud for semiconductor design and have scaled to 29,000 vCPUs with a 5X linear improvement. Understanding the nature of the workload (be it high throughput or parallel) is key, and IBM has been working with clients on HPC algorithm development and architecture design for the past 25 years to improve infrastructure utilization.

Considerations for HPC on the Cloud

A cloud vendor that provides an integrated solution out of the box — with compute instances, workload schedulers, storage management and high-speed data transfer — will be able to help solve your HPC problems. Buying these products à la carte from different vendors increases the risk of deployment and support considerably. Customers are looking for one-stop shopping, consolidated billing and a single point of support. The process should be fully automated by inputting the appropriate configuration parameters, resulting in automatic provisioning of clusters and installation of all required software. This is a huge differentiator over the current way of doing things, and the setup can be completed in hours — and not days — dramatically improving time to market.

You may also want to operate in hybrid mode, which means running static or steady-state jobs on-premises and dynamic or burst jobs on the cloud. Any offering must support this with full automation. The cloud offering should charge you only for the capacity you use so that it is a true utility-based model. It should also support worldwide multi-zone regions, the highest level of encryption and security, disaster recovery and high availability capabilities.

The cloud provides instantaneous capacity to satisfy HPC peak loads, eliminating the lengthy wait times, so you can perform multiple iterations of your simulations to achieve best possible results.

Learn more

We encourage you to bring their applications to us, and we will guide you on the best approach for success.


More from Announcements

IBM TechXchange underscores the importance of AI skilling and partner innovation

3 min read - Generative AI and large language models are poised to impact how we all access and use information. But as organizations race to adopt these new technologies for business, it requires a global ecosystem of partners with industry expertise to identify the right enterprise use-cases for AI and the technical skills to implement the technology. During TechXchange, IBM's premier technical learning event in Las Vegas last week, IBM Partner Plus members including our Strategic Partners, resellers, software vendors, distributors and service…

Introducing Inspiring Voices, a podcast exploring the impactful journeys of great leaders

< 1 min read - Learning about other people's careers, life challenges, and successes is a true source of inspiration that can impact our own ambitions as well as life and business choices in great ways. Brought to you by the Executive Search and Integration team at IBM, the Inspiring Voices podcast will showcase great leaders, taking you inside their personal stories about life, career choices and how to make an impact. In this first episode, host David Jones, Executive Search Lead at IBM, brings…

IBM watsonx Assistant and NICE CXone combine capabilities for a new chapter in CCaaS

5 min read - In an age of instant everything, ensuring a positive customer experience has become a top priority for enterprises. When one third of customers (32%) say they will walk away from a brand they love after just one bad experience (source: PWC), organizations are now applying massive investments to this experience, particularly with their live agents and contact centers.  For many enterprises, that investment includes modernizing their call centers by moving to cloud-based Contact Center as a Service (CCaaS) platforms. CCaaS solutions…

See what’s new in SingleStoreDB with IBM 8.0

3 min read - Despite decades of progress in database systems, builders have compromised on at least one of the following: speed, reliability, or ease. They have two options: one, they could get a document database that is fast and easy, but can’t be relied on for mission-critical transactional applications. Or two, they could rely on a cloud data warehouse that is easy to set up, but only allows lagging analytics. Even then, each solution lacks something, forcing builders to deploy other databases for…