What is hyperscale?

12 March 2024

Authors

Ian Smalley

Senior Editorial Strategist

What is hyperscale?

Hyperscale is a distributed computing environment and architecture that is designed to provide extreme scalability to accommodate workloads of massive scale. The related term “hyperscaler” refers to hyperscale data centers, which are significantly larger than traditional on-premises data centers.

As is obvious to anyone working in IT, there are regular jobs and then there are projects whose order of magnitude exceeds normal needs. These supersized cases require extra handling and an expanded sense of proportion. In short, they need the special abilities enabled by hyperscale computing.

Hyperscale computing is the counterpart of using standard enterprise data centers. In hyperscale computing, companies build or help create large and almost infinitely scalable databases.

3D design of balls rolling on a track

The latest AI News + Insights 


Discover expertly curated insights and news on AI, cloud and more in the weekly Think Newsletter. 

From data centers to hyperscalers

Hyperscale technologies represent a sea change in how the daily flow and volume of data generated by businesses can be processed.

It may be difficult to remember, but there was a time when a company’s entire data center might have been run from a lone server within an office cabinet.

Then came hypervisors, whose use as abstraction layers let apps in virtual machines (VMs) be relocated from one physical hardware installation to another—a key moment in the rise of hyperscale data centers.

In most cases, the original, on-premises data centers of old simply cannot handle the volume of data now generated, especially that created by hyperscaled applications.

Hyperscale data centers (also called hyperscalers) occupy considerably larger physical space than traditional on-premises data centers, which have tended to be sized somewhere in the 10,000-square-foot range.

According to the IDC definition of a hyperscaler, to be considered a true hyperscaler, a company must use 5,000 servers or more and devote at least 10,000 square feet to the operation.

Hyperscale facilities are often multiples beyond that—with building sizes often realizing dimensions approaching 60,000 square feet—roughly the size of a regulation US football field.

And while that may be a typical hyperscale size, it’s not nearly the largest example of storage options. That distinction belongs to the big China Telecom data center located in Horinger, Hohhot, within China’s Inner Mongolia region. This facility, which cost USD 3 billion to construct, covers a staggering 10.7 million square feet and uses 150 megawatts of power. (To visualize a facility of this enormity, try to imagine the combined space of roughly 165 adjoined football fields.)

High Performance Computing

4th Gen Intel® Xeon® Scalable processors on IBM Cloud

Learn about Intel and IBM Cloud's commitment to the next generation of microarchitecture for the cloud industry.

How do hyperscalers work?

Hyperscalers, an outgrowth of hyperscale computing, are hyperscale data centers primarily used to deliver and manage mega-sized applications.

Hyperscalers are largely cut from the same cloth as traditional data centers—but just geared to a much larger scale than typical on-premises data centers. They do this by building and running an enormous hardware and software infrastructure in the hyperscaler facilities. Millions of servers distributed across many data centers provide seemingly endless storage and computing resources.

Since data traffic can fluctuate wildly—especially when running huge applications—hyperscalers accommodate that traffic and stabilize hyperscale operations when increased demand is exhibited within a computing environment. Hyperscalers do this by essentially serving as a form of load balancer, juggling tasks and re-directing computing resources as needed.

Benefits of hyperscale

Some companies aren’t really poised to take advantage of hyperscale technology, due to its cost of entrance and other associated costs. However, for most businesses, the positive aspects far outweigh those costs.

Managed data centers

Virtualization enables cloud computing, and CSPs host hyperscale data centers to accommodate the many uses of cloud computing and the data it generates. Meanwhile, the user is freed up from the many details and occasional operational headaches of running a data center onsite, and instead interacts with needed cloud resources through application programming interfaces (APIs).

Engineered for big jobs

Hyperscale infrastructure, which underpins hyperscale computing, fosters both high performance and redundancy, making hyperscale projects the perfect choice for cloud computing activities and big data processing.

Efficiency

Because it’s built expressly to efficiently handle hyperscale purposes, hyperscale infrastructure is able to increase the cost-effectiveness of operation, even when handling mammoth workloads.

Enhanced networking

It does little good to build what is essentially a server farm unless the servers there have strong, lightning-quick connectivity (with low latency), so those servers can communicate effectively with each other. Hyperscalers provide that.

Leading hyperscale providers

The hyperscale market is currently dominated by a “big five” of public cloud providers, with each of these cloud service providers (CSP) possessing their own strengths.

Amazon Web Services (AWS)

As of Q1 2023, AWS achieved a 32% market share, making it the largest provider of hyperscale cloud services. Its cloud-themed products focus on aspects like storage options, computing power, automation, databases and data analytics.

Microsoft Azure

Many companies are already consumers of one Microsoft product or another, so those businesses may have an established comfort level with the company and its offerings. That includes Microsoft’s enterprise software, which integrates nicely with Microsoft Azure, its hyperscale product. (Market share: 23%)

Google Cloud Platform (GCP)

Google rose to power and prominence through its mastery of data handling and businesses still seek out GCP (about 10% of the market), especially if those companies are eager to engage in advanced analytics and extend their footprint into artificial intelligence (AI).

IBM Cloud

With long-established technology bona fides, IBM Cloud leverages the company’s expertise in many areas, like AI and dealing with enterprise data centers. IBM Cloud delivers comprehensive services for Infrastructure-as-a-Service (IaaS), Platform-as-a-Service (PaaS) and Software-as-a-Service (SaaS).

Oracle Cloud Infrastructure (OCI)

OCI’s advantages include ease of migrating critical enterprise workloads and the ability to construct cloud-native apps. Another selling point for OCI has been its aggressively low pricing, with Oracle claiming to offer the same basic services as AWS at a fraction of the cost.

How do the “Big Five” differ?

The quick overview summary suggests that AWS excels because of its global access and advanced scalability. GCP is attractive to businesses who need top-level data management and want to make forays into capabilities like machine learning, and Microsoft’s Azure provides smooth product integration and enhanced security.

Beyond the top three competitors, IBM Cloud is sparking considerable interest with its current work with AI, which informs its hyperscale offerings. And Oracle’s been serving OCI up as a platform built to save money, and host cloud-native applications.

As is evident, each of these providers differs significantly in their approach and their particular areas of expertise, but they do share similarities. For example, the top three of these providers now offer cloud-native services supporting zero-trust network access (ZTNA) protocols that are designed to offer an alternative to VPNs, which can experience security vulnerabilities.

The economics of hyperscale

Different companies approach how they negotiate their hyperscale needs with different strategies. 

Not all businesses can afford the substantial financial investments needed to set up elaborate hyperscalers or wish to make such a financial commitment. Remember, we’re not simply talking about construction costs. There are massive equipment purchases to be considered as well.

It’s also been observed that some hyperscale facilities use more power than small cities. So, electricity costs and environmental concerns usually must also be factored into the company’s strategy about hyperscaler use.

As an alternative to purchasing equipment, some companies still engage in the practice of colocation, in which servers or other computing equipment are instead rented.

Hyperscalers have also made an impact when it comes to Internet of Things (IoT) devices and how they’re managed. The fact that hyperscale clouds are being used in conjunction with pre-existing equipment (in many instances) helps drive down the price tag of infrastructure investments for the IoT ecosystem—making it a better value for companies.

Related solutions
IBM Flashsystem

Optimize your HPC environment with IBM FlashSystem. Handle massive amounts of data efficiently with scalable, high-performance storage designed for compute-heavy workloads.

Explore FlashSystem
IBM Power Servers

IBM Power is a family of servers that are based on IBM Power processors and are capable of running IBM AIX, IBM i and Linux.

Explore Power
IBM Spectrum LSF Suites

IBM Spectrum LSF Suites is a workload management platform and job scheduler for distributed high performance computing (HPC).

Explore Spectrum LSF Suites
Take the next step

Power your most demanding AI and compute-intensive workloads with IBM’s HPC solutions. Leverage hybrid cloud flexibility and cutting-edge infrastructure to accelerate your path to innovation.

Explore HPC solutions Learn about hybrid cloud for AI