What's new in IBM Cloud Pak for Data?
See what new features and improvements are available in the latest release of IBM® Cloud Pak for Data.
Quick links
December 2020 update of Version 3.0.1
Service | What's new |
---|---|
|
|
September 2020 update of Version 3.0.1
Red Hat® announced that Red Hat OpenShift® Container Platform Version 4.3 will be out of support on 22 October 2020.
Starting 1 September 2020, Cloud Pak for Data is introducing support for Red Hat OpenShift Container Platform Version 4.5 and deprecating support for Red Hat OpenShift Container Platform Version 4.3.
If you already installed Cloud Pak for Data on Red Hat OpenShift Container Platform Version 4.3, work with your IBM Support representative to migrate to Red Hat OpenShift Container Platform Version 4.5.
If you are ready to install Cloud Pak for Data Version 3.0.1 on Red Hat OpenShift Container Platform 4.5 ensure that you install the appropriate versions of Cloud Pak for Data services on your cluster. In addition, if you are using Portworx storage, ensure that you install Portworx Version 2.5.5. For details, see Setting up Portworx storage.
Service | Minimum version required for 4.5 support | Notes |
---|---|---|
Cloud Pak for Data control plane | 3.0.1 | The previously released version works on OpenShift 4.5.
For details, see Installing IBM Cloud Pak for Data. |
Analytics Engine Powered by Apache Spark | cpd-3.0.1-spark-patch-3 |
Install the patch after you install the 3.0.1 version of the service. For details, see https://www.ibm.com/support/pages/node/5693756. |
Cognos® Analytics | 3.2.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.2.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see Cognos Analytics. |
Cognos Dashboards | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Cognos Dashboards. |
Data Refinery | See Watson Knowledge Catalog or Watson Studio. | Data Refinery is installed when you install either Watson Knowledge Catalog or Watson Studio. |
Data Virtualization | 1.4.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Data Virtualization. |
DataStage® | 11.7.1.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see DataStage. |
Db2® | 3.0.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see Db2. |
Db2 Big SQL | 6.0.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Db2 Big SQL. |
Db2 Data Gate | 1.1.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Db2 Data Gate. |
Db2 Event Store | 3.0.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see Db2 Event Store. |
Db2 for z/OS® Connector | 3.2.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Db2 for z/OS Connector. |
Db2 Warehouse | 3.0.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see Db2 Warehouse. |
Decision Optimization | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Decision Optimization. |
Execution Engine for Apache Hadoop | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Execution Engine for Apache Hadoop. |
Financial Crimes Insight® | 6.5.1 | If you are upgrading from OpenShift 4.3 to OpenShift 4.5, and Financial Crimes Insight is already installed, you can use your existing
installation of Financial Crimes Insight. If you are installing Financial Crimes Insight on OpenShift 4.5, you must use the updated images. For details, see Financial Crimes Insight platform installation steps. |
Financial Services Workbench | 2.2.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see:
|
Guardium® External S-TAP® | 11.2.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Guardium External S-TAP. |
Jupyter Notebooks with Python for GPU | cpd-3.0.1-runtime-addon-gpupy36-patch-2 |
Install the patch after you install the 3.0.1 version of the service. For details, see https://www.ibm.com/support/pages/node/5693522. |
Jupyter Notebooks with R 3.6 | cpd-3.0.1-runtime-addon-r36-patch-2 |
Install the patch after you install the 3.0.1 version of the service. For details, see https://www.ibm.com/support/pages/node/5693516. |
Master Data Connect | 1.0.0.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Master Data Connect. |
MongoDB | 3.0.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see MongoDB. |
Open Source Management | 1.1.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Open Source Management. |
Planning Analytics | 3.0.1 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.1 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see Planning Analytics. |
Regulatory Accelerator | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Regulatory Accelerator. |
RStudio® Server with R 3.6 | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see RStudio Server with R 3.6. |
SPSS® Modeler | 3.0.2 | A new version of the service was released. If you have a previous version of the service installed, you must upgrade to the 3.0.2 version of the service before you upgrade your OpenShift installation. For links to installation and upgrade instructions, see SPSS Modeler. |
Streams | 5.3.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Streams. |
Streams Flows | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see:
|
Virtual Data Pipeline | 8.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see:
|
Watson AIOps | 2.0.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson AIOps. |
Watson Assistant | 1.4.2 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Assistant. |
Watson Assistant for Voice Interaction | 1.0.6 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Assistant for Voice Interaction. |
Watson Discovery | 2.1.3 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Discovery. |
Watson Knowledge Catalog | 3.2.0 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Knowledge Catalog. |
Watson Knowledge Studio | 1.1.2 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Knowledge Studio. |
Watson Language Translator | 1.1.2 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Language Translator. |
Watson Machine Learning |
|
Install the appropriate patch for your environment after you install the 3.0.1 version of the service. For details, see https://www.ibm.com/support/pages/node/5693732. |
Watson OpenScale | 3.0.1 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson OpenScale. |
Watson Speech to Text | 1.1.4 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Speech to Text. |
Watson Studio | cpd-3.0.1-wsl-patch-2 |
Install the patch after you install the 3.0.1 version of the service. For details, see https://www.ibm.com/support/pages/node/5693750. |
Watson Text to Speech | 1.1.4 | The previously released version works on OpenShift 4.5. For links to installation and upgrade instructions, see Watson Text to Speech. |
What's new in Version 3.0.1
IBM Cloud Pak for Data Version 3.0.1 introduces support for Red Hat OpenShift Container Platform Version 4.5 and Red Hat OpenShift Container Storage so that you can deploy your applications on secure and scalable resources. The release also includes enhanced auditing capabilities, several new services, and numerous updates to existing services.
Platform enhancements
The following table lists the new features that were introduced in Cloud Pak for Data Version 3.0.1:
What's new | What does it mean for me? |
---|---|
A more useful home page | The Cloud Pak for Data home page has been
completely redesigned:
|
New format for custom KPI cards | The custom cards REST API has been updated to provide new and updated templates for custom
key performance indicators. The cards blend seamlessly with the new home page design and offer more
ways to display your data. For details, see Creating custom cards. |
New look and feel | The Cloud Pak for Data platform has adopted
Carbon, IBM's open source design system for digital products
and experiences. Carbon makes it easier for you to know exactly how the web client will behave. With Carbon, you can spend less time learning how to use the platform and more time putting the services to use to accelerate your business. |
Customized for your company | Make Cloud Pak for Data a part of your
business. You can add your own branding to the Cloud Pak for Data web client by:
For details, see Customizing the branding of the web client. |
Access to even more of data | Augment your analytics and AI with external data sets. Cloud Pak for Data includes access to numerous data sets that can
help you address common business problems. For example, there are data sets that help you use
weather data to improve flight safety and data sets that help you analyze videos to determine
whether employees are performing tasks correctly. Many of the data sets are included with your purchase of Cloud Pak for Data. However, some data sets are separately priced. You can use the external data to expand the capabilities of your models and applications. For details, see External data sets. |
Support for more types of storage | Cloud Pak for Data introduces support for the
following types of storage:
For information on the types of storage that are supported with Cloud Pak for Data, see Storage considerations. In addition, many services include support for additional storage types. For information about which storage types are supported for each service, see System requirements for services. |
More ways to audit your system | Cloud Pak for Data offers additional mechanisms
for audit your environment. In addition to the existing IBM
Guardium integration, which enables you to audit
sensitive data on remote database, you can also:
For details, see Auditing Cloud Pak for Data. |
Integration with IBM Cloud Platform Common Services | IBM Cloud Platform Common
Services are optional,
foundational services that can be shared by multiple products that are installed on the same
Red Hat
OpenShift cluster. Cloud Pak for Data Version 3.0.1 includes support for the
following common services:
For details, see Integrating with Cloud Platform Common Services. |
More precise administrative permissions | Cloud Pak for Data includes additional
administrative permissions that enable you to give more specific access to administrative users.
You can now give users one or more of the following permissions:
If you give a user all of these permissions, it is equivalent to giving the user the Administer platform permission. For details, see Permissions. |
Backup and restore utility | Ensuring that your IBM® Cloud Pak for Data system is
prepared for loss of data or unplanned downtime is one of the most important steps you can
take. The
cpdbr utility enables you to backup and restore persistent volumes that
are associated with Cloud Pak for Data:
For details, see Backing up and restoring your project. |
Translated interfaces | Many of the services that are included with Cloud Pak for Data are now available in the following languages:
Several services are also available in Russian and Korean. For details, see Language support. |
Smarter global search | When you perform a search in the global search bar, you now see machine-learning infused search suggestions and search results that are based on relevancy. |
Simplified packaging |
|
Service enhancements
The following table lists the new features that are introduced for existing services in Cloud Pak for Data Version 3.0.1:
What's new | What does it mean for me? |
---|---|
Data Refinery |
|
Data Virtualization |
|
DataStage |
|
Decision Optimization |
|
Db2 |
|
Db2 Warehouse |
|
Execution Engine for Apache Hadoop |
|
Financial Crimes Insight | The latest version of the Financial Crimes Insight
service includes:
|
Open Source Management |
For details, Open Source Management. |
RStudio Server with R 3.6 |
|
SPSS Modeler |
|
Streams |
|
Watson AIOps AI Manager |
Watson AIOps 2.0.0 is now composed of the following components: AI Manager, Metric Manager, Event Manager, and Topology. AI Manager for Watson AIOps 2.0.0 now supports application groups. Application groups are a means for isolating streams of data from one another. You can now group multiple SRE groups alongside multiple internal clients (for example, several business units) all within a single cluster that runs Watson AIOps 2.0. Not only does it simplify your IT footprint, but it also significantly reduces your IT costs because you need to provision only a single instance to share across many application groups.
|
Watson Knowledge Catalog |
|
Watson Machine Learning |
|
Watson OpenScale |
|
Watson Studio |
Many of the services that supplement Watson Studio also have new features. For details, see the rows for those services. |
New services
The following table lists the new services that are introduced in Cloud Pak for Data Version 3.0.1:
Category | Service | Pricing | What does it mean for me? |
---|---|---|---|
Analytics | Planning Analytics | Separately priced | Easily create more accurate plans, budgets, and forecasts using data from
across your business. A good plan starts with good data. Ensure that your plans are based on data from across your business with IBM Planning Analytics powered by TM1. Planning Analytics is an AI-infused solution that pulls data from multiple sources and automates the creation of plans, budgets, and forecasts. Planning Analytics integrates with Microsoft Excel so that you can continue to use a familiar interface while moving beyond the traditional limits of a spreadsheet. Infuse your spreadsheets with more analytical power to build sophisticated, multidimensional models that help you create more reliable plans and forecasts. The Planning Analytics service includes:
Learn more
|
Data governance | Guardium External S-TAP | Included with Cloud Pak for Data |
IBM Guardium External S-TAP is a component of Guardium that works with databases that are hosted on Cloud Pak for Data. The service provides compliance monitoring and data security. You can install and configure the External S-TAP service in high-availability mode to intercept TCP/IP traffic (plain-text or encrypted) between Cloud Pak for Data users and database services. The intercepted traffic is sent to the Guardium collector for parsing, policy enforcement, logging, and reporting. To use the External S-TAP service, you must be entitled to use IBM Guardium Data Protection. Learn more
|
Data governance | Master Data Connect | Included with Cloud Pak for Data |
Power® your business applications with trusted master data. Master data is the high-value, core information that supports critical business processes across your enterprise. Master data is at the heart of every business transaction, application, and decision. IBM
InfoSphere Master Data Management acts as a central repository to
manage, store, and maintain master data across your organization. IBM
InfoSphere MDM
The Master Data Connect service uses a RESTful API to provide geographically distributed users with fast, scalable, and concurrent access to your organization's most trusted master data from IBM InfoSphere MDM. By making your trusted master data available to business applications, you can capitalize on the benefits that master data brings. Users and systems can use the Master Data Connect API to access and search master data, enabling your mobile and online applications to access trusted master data in a timely and efficient way. For example, you can improve your sales processes by using Master Data Connect as a real-time provider of master data for Salesforce.com. Learn more
|
Data source | Db2 Big SQL | Included with Cloud Pak for Data |
Use standard SQL to query your data on Hadoop or Object Stores with Db2 Big SQL Db2 Big SQL is an advanced query service that makes it easy to analyze data in object stores or Hadoop using ANSI SQL that is optimized for advanced analytics in big data environments. Powerful Db2 open source technologies are the prime driver for machine learning, interactive, ad hoc, and batch analytics use cases on open source file formats stored on Hadoop and object stores. The Db2 Big SQL service offers the
following features:
With data sizes ranging from gigabytes to petabytes, business analysts or data scientists run interactive queries to explore and understand data before building models or charts. With its robust scalability and performance, Db2 Big SQL empowers users and applications to unlock insights from data with the analytics tools of your choice, while achieving high concurrency for business intelligence workloads by running complex queries more efficiently. Learn more
|
Data source | Db2 Data Gate | Included with Cloud Pak for Data | Extract, load, and synchronize mission-critical Db2 for z/OS data for
high volume transactional or analytic applications. The service propagates your
Db2 for z/OS data to a Db2
Warehouse or Db2 database on Cloud Pak for Data. Through its high throughput and low latency
synchronization technology, the service provides:
Learn more
|
Data source | Virtual Data Pipeline | Separately priced | Access all the data you need for analytics and application testing
without impacting production databases. Your production databases are
critical for running your business, so you don’t want to overload them with too many requests. At
the same time, your users need access to that data to drive business results. With IBM
InfoSphere Virtual Data Pipeline, your users can instantly provision virtual
database copies that they can use to work with near real-time data for:
Each virtual database copy can be refreshed to any point in time in a matter of minutes and can be masked to protect sensitive data. In addition, virtual database copies use almost no storage, so you save on storage costs. Give your users access to production data without impacting priority workloads or compromising data security and privacy. Get started with the Virtual Data Pipeline service to accelerate your analytics and modernize your applications. Learn more
|
Developer tools | Anaconda Repository with IBM Cloud Pak® for Data | Separately priced | Control and administer the software packages that data scientists can use in Jupyter
notebooks and JupyterLab in Watson Studio
analytics projects. Data scientists in analytics projects can create custom environment definitions that include the conda channels and packages from the repository and then use those environments to run Jupyter notebooks and scripts. With the Anaconda Repository with IBM Cloud Pak for Data service, you can access more than 7,500 open-source packages (Conda-Forge, CRAN, PyPI) from your central enterprise repository and add your own proprietary packages. Get Conda package updates in real time, as they are released. Block, exclude, and include packages according to your enterprise standards. Control which packages your team can download and who can access them. Keep vulnerabilities and unreliable software out of your data science and machine learning pipeline and manage dependent packages. Learn more
|
Industry accelerators
Industry accelerators are available from the Cloud Pak for Data community. Each industry accelerator includes a set of artifacts that help you address common business issues. The following accelerators were recently released for Cloud Pak for Data:
What's new | What does it mean for me? |
---|---|
Demand Planning | Manage thermal systems to produce accurate energy volumes based on anticipated demand and
energy generation. For details, see Demand planning on the Cloud Pak for Data Community. |
Manufacturing Analytics with Weather (using SPSS and Cognos) | Use machine-learning models and The Weather Company data to understand the impact that
weather has on failure rate. Identify actions that you can take to save time and money. For details, see Manufacturing Analytics with Weather on the Cloud Pak for Data Community. |
Retail Predictive Analytics with Weather (using SPSS and Cognos) | Use machine-learning models and The Weather Company data to understand how a retail
inventory manager, marketer and retail sales planner can quickly determine the optimal combination
of store, product, and weather conditions to maximize revenue uplift, know what to keep in
inventory, where to send a marketing offer, or provide a future financial outlook. For details, see Retail Predictive Analytics with Weather (using SPSS and Cognos) on the Cloud Pak for Data Community. |
Sales Prediction using The Weather Company Data | Use machine-learning models and The Weather Company data to predict how weather conditions
impact business performance, such as prospective sales. For details, see Sales Prediction using The Weather Company Data on the Cloud Pak for Data Community. |
Telco Churn | Predict a given customer's propensity to cancel their membership or subscription and
recommend promotions and offers that may help retain the customer. For details, see Telco churn on the Cloud Pak for Data Community. |
Utilities Customer Attrition Prediction | Discover why your customers are leaving. For details, see Utilities Customer Attrition Prediction on the Cloud Pak for Data Community. |
Utilities Customer Micro Segmentation | Divide a company's customers into small groups based on their lifestyle and engagement
behaviors. For details, see Utilities Customer Micro Segmentation on the Cloud Pak for Data Community. |
Utilities Demand Response Program Propensity | Identify which customers should be targeted for enrollment in the Demand Response
Program. For details, see Utilities Demand Response Program Propensity on the Cloud Pak for Data Community. |
Utilities Payment Risk Prediction | Identify which customers are most likely to miss their payment this billing cycle. For details, see Utilities Payment Risk Prediction on the Cloud Pak for Data Community. |
Offering packages
Cloud Pak for Data Version 3.0.1 introduces new offering packages, each of which includes the Cloud Pak for Data entitlements that are required to run the service. The following packages are available in this release:
What's new | What does it mean for me? |
---|---|
IBM Cloud Pak for Data Planning Analytics | The Planning Analytics service is included in the
IBM Cloud Pak for
Data
Planning Analytics bundle. For more information, see the description of the new Planning Analytics service. |
IBM Cloud Pak for Data Virtual Data Pipeline | The Virtual Data Pipeline service is included in the
IBM Cloud Pak for
Data
Virtual Data Pipeline bundle. For more information, see the description of the new Virtual Data Pipeline service. |
Installation enhancements
What's new | What does it mean for me? |
---|---|
Support for Red Hat OpenShift Version 4.3 | Cloud Pak for Data Version 3.0.1 can run on either:
For more information about Red Hat OpenShift Container Platform, see System requirements for IBM Cloud Pak for Data. If you are currently running Cloud Pak for Data Version 2.5 on Red Hat OpenShift Container Platform Version 3.11 and want to migrate to Cloud Pak for Data Version 3.0.1 on Red Hat OpenShift Container Platform Version 4.3, see Migrating Cloud Pak for Data data from Red Hat OpenShift Version 3.11 to Version 4.5. |
Support for IBM POWER hardware | You can install the Cloud Pak for Data control plane and some services on IBM POWER hardware. For a list of the services that you can install on IBM POWER hardware, see hardware requirements in the System requirements for services topic. For more information about planning your installation, see:
|
Upgrade | You can run the cpd command-line interface to upgrade the Cloud Pak for Data control plane and many of the services that
support the cpd command-line interface.
|
Deprecated features
What's changed | What does it mean for me? |
---|---|
Monthly virtual core usage and targets | You can no longer use the Manage platform page in the web client to
track the number of virtual cores that you use each month. This feature was deprecated because it
did not give an accurate count of the number of virtual cores used each month. Because this feature was deprecated, the related feature that allowed you to set your target usage was also deprecated. |
Object Storage Open Stack Swift (Infrastructure) connections |
Impacted services:
You can no longer create connections to Object Storage Open Stack Swift (Infrastructure) from Watson Studio or Watson Knowledge Catalog. This type of connection is deprecated in Cloud Pak for Data Version 3.0.1. If your project contains a connection to Object Storage Open Stack Swift (Infrastructure), the connection will no longer work. |
Python and SPSS operators |
Impacted services:
The Python and SPSS operators are no longer supported in Streams Flows. The WML Deployment operator replaces both of these operators. For details on fixing flows that contain these operators, see Troubleshooting a streams flow. |