How IBM Spectrum Scale resembles my toddler twins

Share this post:

I’m a storage consultant in IBM Systems Lab Services and a working mother of two-and-a-half-year-old twins who were affectionately known as Flash and Elastic during my pregnancy because of the way they moved in my belly.

I usually spend my days onsite with IBM clients implementing and managing IBM Spectrum Scale clusters, but I also like to think that I have my own two-node cluster at home. In fact, I’ve realized that IBM Spectrum Scale and my twins have quite a few common characteristics.

Just to make sure everyone’s on the same page: IBM Spectrum Scale is a scalable, high-performance data and file management solution (based upon IBM General Parallel File System, or GPFS, technology) that’s being used across industries worldwide. IBM Spectrum Scale provides world-class storage management with extreme scalability, flash-accelerated performance and automatic policy-based storage tiering from flash through disk to tape and even to the cloud. It can reduce storage costs up to 90 percent while improving security and management efficiency in cloud, big data and analytics environments.

Big data processing

IBM Spectrum Scale provides simplified data management and integrated information lifecycle tools capable of managing petabytes of data and billions of files, in order to help organizations manage the growing cost of dealing with ever-increasing amounts of data.

Here’s where the similarities to my twins begin. For example, my kids are also capable of efficiently managing petabytes of data; they learn new things every day and have seemingly endless storage space.

Information lifecycle management

With IBM Spectrum Scale, using the automatic policy-based storage tiering for information lifecycle management (ILM), a storage administrator can set policies and rules to place data in the correct storage pool as well as migrate data between storage pools based on business needs and rules. With ILM, only the data that really needs high performance can land in the flash disks, while the remaining data can stay in a less expensive storage disk or even move to an external pool like tape. This can be accomplished by defining policies; for example, it could migrate files that haven’t been accessed for more than six months to second-tier storage or tape, or even delete them if no longer required for the business. Or, it could avoid writing a particular type of data to a file system, such as putting no MP3 files on a corporate file server.

At home, I also manage my twins with ILM. From Monday to Friday, I migrate the kids from bed to daycare. If their shoes or clothes don’t fit after three months, I migrate those to an external pool, which is my friend who has triplets (yes, life can be harder). ILM also helps me define rules to manage which cartoons they can watch. And if a toy hasn’t been accessed for more than six months, it gets deleted from home, by which I mean donated.

Easy management

The IBM Spectrum Scale management GUI provides an easy way to configure and manage various features that are available with the system, so it simplifies data management of petabytes of storage and all nodes from a cluster in a centralized way.

My two-node cluster is also easy to manage… well actually, not so much, so I’ll skip making a comparison on this one.

Active-active clustering

Another thought is the cluster itself. With Spectrum Scale, it’s a real active-active cluster. All data access is done in parallel through the nodes. If a node goes offline, access to the data is still available and functions are undertaken by the other nodes.

The same thing happens at home if I issue the “mmshutdown” command to one of my nodes, by which I mean put one of my kids to sleep. The awake node will remain active, playing and making all the mess that would’ve been done if both nodes were up.


And last but not least, as the name says, IBM Spectrum Scale can scale. It supports up to 256 file systems of 18 PB each in a single cluster, and it can have up to 16,384 nodes connected per cluster.

When it comes to my home though, two kids are enough for me and my husband to manage, and we are not looking to scale up!

To learn more about IBM Spectrum Scale, check out the website. And if you’re looking for proven experience on designing and building storage and softwaredefined infrastructure systems, reach out to IBM Systems Lab Services Storage and Software Defined Infrastructure. We can help you efficiently capture, deliver, manage and protect your data with superior performance. Email us to learn more.

More Services stories

IBM is recognized as a “Leader” in 2019 Gartner Magic Quadrant for Data Center Backup and Recovery Solutions

Cloud object storage, Data security, Storage

IBM has been recognized as a Leader in the 2019 Gartner Magic Quadrant for Data Center Backup and Recovery Solutions. This is the eighth report in a row, and 13 out of 14 years lifetime, IBM has been recognized as a Leader in the Gartner Magic Quadrant for Data Center Backup and Recovery Solutions for both its completeness of more

Address customer data privacy and protection concerns with encryption everywhere

Data centers, Data security, Mainframes

Data privacy and protection remain top boardroom and C-suite issues. The data breach threat still looms large: 59 percent of businesses experienced a data breach caused by a vendor or third party in 2018[1]. As organizations migrate workloads to hybrid multicloud environments, they must ensure that the data within these environments is effectively protected. Consumers more

On your journey to hybrid multicloud, you’ll encounter new storage challenges

Hybrid cloud, Multicloud, Storage

This is part three in a 6-part series on storage for hybrid multicloud. In Part 1 I set the stage – getting us all on the same page with the terminology of hybrid multicloud. In Part 2 we explored what’s behind this rapid industry shift toward hybrid multicloud infrastructure. In Part 3 we’re going to more