Table of contents

Overview of IBM Cloud Pak for Data

IBM® Cloud Pak for Data is a cloud-native solution that enables you to put your data to work quickly and efficiently.

Your enterprise has data. Lots of data. You need to use your data to generate meaningful insights that can help you avoid problems and reach your goals.

But your data is useless if you can't trust it or access it. Cloud Pak for Data lets you do both by enabling you to connect to your data, govern it, find it, and use it for analysis. Cloud Pak for Data also enables all of your data users to collaborate from a single, unified interface that supports a large number of services that are designed to work together.

Cloud Pak for Data fosters productivity by enabling users to find existing data or to request access to data. With modern tools that facilitate analytics and remove barriers to collaboration, users can spend less time finding data and more time using it effectively.

And with Cloud Pak for Data, your IT department doesn't need to deploy multiple applications on disparate systems and then try to figure out how to get them to connect.

Run anywhere

Cloud Pak for Data can run on your Red Hat OpenShift cluster, whether it's behind your firewall or on the cloud.
In the cloud
If you have an OpenShift deployment on IBM Cloud, AWS, Microsoft Azure, or Google Cloud, you can deploy Cloud Pak for Data on your cluster.
On premises
Prefer to keep your deployment behind a firewall? You can run Cloud Pak for Data on your private, on-premises cluster.

If most of your enterprise data lives behind your firewall, it makes sense to put the applications that access your data behind your firewall to prevent accidentally sharing your data.

Connect to data anywhere

Regardless of where you deploy Cloud Pak for Data, you can connect to your data no matter where it lives.
  • Private cluster accessing data on the cloud? You're covered.
  • Running in an air-gapped environment? As long as you can connect to your data sources, that works.
  • Running on IBM Cloud and accessing data in your on-premises database? Not a problem.

Ready for AI

To be competitive and successful, your enterprise must leverage the power of artificial intelligence.

Cloud Pak for Data helps you climb the AI ladder by providing a suite of services that support you in your journey to AI.

Collect
Cloud Pak for Data helps you connect to your data, no matter where it lives. Cloud Pak for Data includes a Connections page that lists connections that can be used by multiple services. Some services support additional data sources that you can connect to from the service. The platform makes it simple to access your data.
Organize
The Watson™ Knowledge Catalog service helps you organize your data through data classification and governance. With the Watson Knowledge Catalog service, you can develop an information architecture that is on-point and ready to keep up with the scale of your data.
Analyze
Cloud Pak for Data also includes numerous analytics services that can help you generate scalable insight on demand. For example, with Cloud Pak for Data you can use:
  • Analytics Dashboards, which enables you to create stunning dashboards to quickly visualize data
  • Streams, which enables you to build solutions that drive real-time decisions by combining streaming and stored data with analytics
  • SPSS® Modeler (premium service), which enables you to create flows to prepare and blend data, build and manage models, and visualize the results
Infuse
With Cloud Pak for Data you can make AI a part of your standard operating procedure. Whether you want to build smarter apps with premium Watson services, deploy machine learning models into production at scale with Watson Machine Learning, or infuse your AI with trust and transparency with Watson OpenScale, which enables you to understand how your AI models make decisions and to detect and mitigate bias.

There are many more services that you can install on Cloud Pak for Data. For a complete list, Services in the catalog.

With Cloud Pak for Data, raw data becomes trusted data that you can analyze to gain insights and maximize business outcomes.

Support for your data lifecycle

Your data isn't static. Your machine learning models shouldn't be static either. As data is added to your on-premises and cloud data sources, you need to continually test and tune your machine learning models to ensure that they give you valuable insight. But you need to make sure that you're working with high-quality data, which is where the data governance and data integration and preparation services that you can install on Cloud Pak for Data come in.

You know the old adage: Garbage in, garbage out. If your data is poor, your results aren't meaningful. By bringing data stewards and data engineers together with your data scientists, you can ensure that your data is ready for analysis.

Additionally, you can ensure that any analytics assets that your data scientists create, such as models, notebooks, and Shiny apps are included in a data catalog so that they can be governed and maintained like any other data assets in your enterprise.

With Cloud Pak for Data, you can continuously discover new, valuable insights as data is added to your ecosystem.

Modern and modular

Cloud Pak for Data provides a modern data and analytics architecture that is elastic, scalable, and reliable. The end-to-end platform means that you can spend less time managing your data and more time using it to grow your business.

You can choose which services you install on top of Cloud Pak for Data, so that you can use your resources wisely. Whether you want to modernize your data landscape, generate real-time insights to drive business transformations, or deliver exceptional, AI-augmented customer experiences, Cloud Pak for Data has a solution that can propel your business forward.

If you want to become a data-driven enterprise, Cloud Pak for Data should be at the center of your data and analytics ecosystem.

Choose the right edition for your needs

There are two editions of Cloud Pak for Data that you can choose from:
  • Enterprise Edition
  • Cloud Native Edition

Both editions include the same features; however, Cloud Native Edition places limits on the number of virtual processor cores (VPCs) that you can have in your cluster. For specific information on the limits, contact IBM Sales.