The popularity and wide use of weather forecasts has been largely attributable to the dramatic improvement in forecast accuracy. Such improvements have been quantified in recent research showing that modern 5-day weather forecasts are as accurate as 1-day forecasts in 1980. Disease forecasts are not nearly as accurate as modern weather forecasts, as documented in ongoing evaluations of COVID-19 forecast models. So, what can we learn from weather forecasting that might help us develop more robust disease forecasting and outbreak predictions?

Dr. Dylan George, head of CDCs Center for Forecasting and Outbreak Analytics (CFA) describes how disease forecasting can follow the lead of weather forecasting:

“We use weather forecasts to pre-position resources for hurricanes and to determine if we need an umbrella on a rainy day. We can use disease forecasts to determine how much vaccine we need to manufacture or if we should wear a mask that day to go out. Better data and better analytics will definitely generate better responses to health emergencies.”

As the leading provider of weather data and analytics, we at IBM believe Dr. George offers a compelling vision. 

More data sources lead to greater accuracy

An explosion in the volume and variety of weather data has enabled dramatic improvements in forecast accuracy. Whereas fifty years ago, weather data was mostly confined to temperature, barometric and other readings taken at scattered weather stations, weather station data today is augmented with data from a growing network of satellites, remote sensors, radar stations, weather balloons and other sources. 

Today, disease surveillance data is still largely confined to case reports from health clinics and hospitals, although the variety and volume of data has been growing. Syndromic and wastewater surveillance data are adding to traditional case reporting as a means to monitor community infection. And non-traditional data sources (like internet search trends and social media user surveys) offer the potential to obtain more real-time and hyperlocal information. 

To make progress toward better disease forecasting, the volume and variety of disease surveillance data will need to continue growing. Public health investments need to focus on seeding and growing these new data sources for disease surveillance. And following the experience in weather forecasting, additional investment will be needed to harmonize these disparate data sources into a unified spacio-temporal view of community infection.

Learn more about how data strategies deliver insights to the public

Innovative modeling enables advanced disease surveillance

Advances in weather modeling and simulation—enabled by breakthroughs in machine learning and exponential growth in computing power—have been a key factor enabling improved weather forecasting.  In the 1970s, weather forecasts mostly relied on numerical weather prediction methods. These days, methods are augmented with machine learning algorithms that enable accurate prediction of storm events and paths. For example, the Weather Company generates the most accurate publicly available weather forecasts, leveraging the IBM GRAF machine learning algorithms for weather prediction.

Today, disease forecasting largely relies on long-standing SIR-based—Susceptible, Infectious, Recovered—epidemiological models, although recent COVID-19 modeling has begun to incorporate more advanced machine learning algorithms, with improvements in forecast accuracy. Recent developments like the CDC’s Epidemic Prediction Initiative show promise, and the CDC CFA is investing in continued innovation to improve disease forecasting in the United States.

Continued progress in developing innovative modeling techniques will be important for achieving the vision of robust disease forecasting and outbreak predictions. Public health authorities, university researchers and private corporations can productively partner to help advance the application of advanced analytics to disease surveillance. IBM’s engagement with the Rhode Island Department of Health is a good example of what can be accomplished through public-private collaboration. IBM collaborated with RIDOH and Brown University epidemiologists to develop smart ensembles of multiple COVID-19 models for more accurate pandemic forecasts, providing 95% accuracy in forecasting the large omicron outbreak in January 2022. Our collaboration continues today with the application of machine learning to infer community infection from syndromic surveillance and wastewater surveillance data.

Modern platforms will deliver data and insights to the public

As more data and better modeling dramatically improved the accuracy of weather forecasting, a robust technology infrastructure emerged to enable high speed data processing, modeling updates and easy access to actionable insights. While weather forecasts used to be largely distributed daily through newspapers, radio and television, they’re now available on demand through the internet and mobile applications, and updated multiple times per day as conditions evolve. The ubiquity of this information enables people throughout the world to adjust plans and behaviors to minimize weather-related property damage and fatalities.

Disease forecasts, however, are not readily available to the public, as COVID-19 forecasts are only accessible on the internet to those who know where to find them. We can see the beginnings of a modern data and analytics platform to support disease surveillance, enabling automated data processing and modeling. But much progress is still needed in the public dissemination of actionable insights. One can imagine a future where infectious disease warnings are as readily available as hazardous weather warnings, enabling people to adjust plans and behaviors to minimize morbidity and mortality related to infectious disease.

To achieve that future, public health authorities need to invest in modern platforms to process data, generate actionable insights and disseminate those insights to the public. The CDC’s Data Modernization Initiative and associated grant funding to states and localities is a good start. Such funding enables public-private collaboration to jumpstart public health data modernization. A good example of a successful public-private partnership is IBM’s collaboration with Canadian and other public health authorities to develop and deploy a modern public health data platform.   

Research shows that more accurate weather forecasting has saved lives and generated economic benefits exceeding required investments. Similar investments to improve the accuracy and availability of disease forecasts would also save lives and significantly reduce the economic burden of unmitigated infectious disease outbreaks.

Connect with IBM experts to unlock your data’s potential Learn more about driving data democratization with modern architecture


More from Business transformation

Unleashing the power of Presto: The Uber case study

7 min read - The magic behind Uber's data-driven success Uber, the ride-hailing giant, is a household name worldwide. We all recognize it as the platform that connects riders with drivers for hassle-free transportation. But what most people don't realize is that behind the scenes, Uber is not just a transportation service; it's a data and analytics powerhouse. Every day, millions of riders use the Uber app, unwittingly contributing to a complex web of data-driven decisions. This blog takes you on a journey into…

“Teams will get smarter and faster”: A conversation with Eli Manning

3 min read - For the last three years, IBM has worked with two-time champion Eli Manning to help spread the word about our partnership with ESPN. The nature of that partnership is pretty technical, involving powerful AI models—built with watsonx—that analyze massive data sets to generate insights that help ESPN Fantasy Football team owners manage their teams. Eli has not only helped us promote awareness of these insights, but also to unpack the technology behind them, making it understandable and accessible to millions.…

Generative AI as a catalyst for change in the telecommunications industry

4 min read - Generative artificial intelligence (AI) burst into the mainstream in 2023, lighting a fire under businesses to integrate enterprise-grade versions into their processes. By 2024, 60% of C-suite executives are planning to pilot or operate generative AI in some way, indicating that generative AI's public-facing platforms have awakened the world to its groundbreaking capabilities For Communications Service Providers (CSPs) and Network Equipment Providers (NEPs), in particular, generative AI holds tremendous potential to help improve all manner of operations and customer engagement.…

iFoodDS and IBM forge new path to food safety with IBM Food Trust™

4 min read - Picture this: You're at your local supermarket, eagerly exploring the fresh produce section. You carefully select a carton of ripe, juicy fresh-cut strawberries, envisioning them as the star ingredient in your weekend's mouthwatering desserts. You're all set to enjoy a delightful culinary adventure. But as you savor your first bite of a luscious strawberry shortcake, you receive a notification on your smartphone. It's breaking news: a food recall alert! Panic ensues as you wonder if those very strawberries are part…