What's new and changed in Data Virtualization

The Data Virtualization release and subsequent refreshes can include new features, bug fixes, and security updates. Refreshes appear in reverse chronological order, and only the refreshes that contain updates for Data Virtualization are shown.

You can see a list of the new features for the platform and all of the services at What's new in IBM Cloud Pak for Data?

Installing or upgrading Data Virtualization

Ready to install or upgrade Data Virtualization?

Related documentation:

Initial release of Cloud Pak for Data Version 3.5

A new version of Data Virtualization was released as part of Cloud Pak for Data Version 3.5.

Assembly version: 1.5.0

This release includes the following changes:

New features

Improve query performance by using cache recommendations

If your queries take a long time to run but your data doesn't change constantly, you can cache the results of queries to make your queries more performant. Data Virtualization analyzes your queries and provides cache recommendations to improve query performance.

For details, see Cache recommendations.

Optimize query performance by using distributed processing

Data Virtualization can determine the optimal number of worker nodes required to process a query. The number of worker nodes is determined based on the number of data sources connected to the service, available service resources, and the estimated size of the query result.

Manage your virtual data by using Data Virtualization APIs

With the Data Virtualization REST API, you can manage your virtual data, data sources, and user roles. Additionally, you can use the API to virtualize and publish data to the catalog.

For details, see Data Virtualization REST API.

Governance and security enhancements for virtual objects

When Watson™ Knowledge Catalog is installed, you can use policies and data protection rules from Watson Knowledge Catalog to govern your virtual data. Data asset owners are now exempt from data protection rules and policy enforcement in Data Virtualization.

You can also publish your virtual objects to the catalog more easily and efficiently. For example, when you create your virtual objects by using the Data Virtualization user interface, your virtual objects are published automatically to the default catalog in Watson Knowledge Catalog.

Optionally, you can now publish your virtual objects by using the Data Virtualization REST APIs.

For details, see Governing virtual data.

Support for single sign-on and JWT authentication

You can now authenticate to Data Virtualization by using the same credentials you use for the Cloud Pak for Data platform. Additionally, Data Virtualization now supports authentication by using a JSON Web Token (JWT).

For details, see User credentials and authentication methods.

Support for additional data sources

You can now connect to the following data sources:

Greenplum
Salesforce.com
SAP OData

For details, see Adding data sources.

Scale your deployment

You can use the cpd-cli scale command to adjust the number of worker nodes that the Data Virtualization service is running on. When you scale the service it up, it makes the service highly available and increases the processing capacity.

For details, see Provisioning Data Virtualization.

Monitor the service by using Db2® Data Management Console

You can use the integrated monitoring dashboard to ensure that the Data Virtualization service is working correctly. The monitoring dashboard is powered by Db2 Data Management Console. Additionally, the monitoring dashboard provides useful information about databases connected to Data Virtualization.

For details, see Monitoring Data Virtualization.