Integrating InfoSphere QualityStage with InfoSphere MDM

InfoSphere® MDM can be configured to use IBM® InfoSphere QualityStage®'s standardization and matching capabilities.

InfoSphere QualityStage is a comprehensive development environment for building applications to re-engineer data. It provides a set of integrated modules for accomplishing data re-engineering tasks such as Conditioning (Standardization), Matching, Searching, and others. InfoSphere MDM can be configured to use InfoSphere QualityStage's standardization and matching capabilities.

For more details about using InfoSphere QualityStage for party matching, see Suspected duplicate identification using QualityStage.

For more details about using InfoSphere QualityStage for standardization, see Using InfoSphere QualityStage for standardization.

Definitions, acronyms, and abbreviations used when discussing QualityStage integration

The following terms are used when discussing QualityStage integration:
QualityStage Designer
Provides a client interface for defining and customizing data re-engineering jobs. It runs on a Windows workstation.
QualityStage Server
Accesses the source data, defined by the Designer, and processes them into the target re-engineered data. It can run on Windows or UNIX.
QualityStage Stage
A data re-engineering operation such as Investigate, Standardize, Match, and Survive. InfoSphere MDM only uses Standardize and Match stages. Clients can use out of the box stages or create their own stage and attach available rule sets to them.
QualityStage Job
Incorporates a number of data re-engineering stages. QualityStage uses jobs to process data, creating various intermediate and final stages of re-engineered data. The processing criteria are determined by rule sets that you specify for a job. Once Jobs are built using Designer they need to be deployed on the QualityStage server.
WebSphere® Information Services Director (WISD)
Acts as a layer between the IBM InfoSphere Information Server client and the InfoSphere Information Server server. It provides a unified mechanism for publishing and managing shared Service Oriented Architecture (SOA) services across data quality, data transformation, and federation functions, which allows information specialists to easily deploy services for any information integration task and consistently manage them. WISD enables developers to take data integration logic built using InfoSphere Information Server and publish it as an "always on" service.

InfoSphere MDM distribution components for QualityStage

The following components are required for integrating QualityStage with InfoSphere MDM. You can find these components under \<MDM_INSTALL_DIR>\Integrations\QualityStage:
Table 1. Distribution components
Component Name Description
MDMQS.dsx DataStage/QualityStage job export. Contains source code to be imported into your environment through the DataStage/QualityStage Designer Client.
MDMQS_ISDProject.xml (for the RIM interface) WISD project export. Contains service definitions to be imported into your environment through the InfoSphere Information Server Console.
MDMQSWS.xml (for the Web Services interface) WISD project export. Contains service definitions to be imported into your environment through the InfoSphere Information Server Console.
MDMQSJP.dsx DataStage/QualityStage job export for standardization of Japanese names and addresses.
MDMQSJP_ISDProject.xml (for the RIM interface) WISD project export of MDMQSJP. Contains service definitions to be imported into your environment through the InfoSphere Information Server Console.
*.csv Test data used to test the batch version of the jobs.

Where to find more QualityStage information

For more information, see the IBM InfoSphere Information Server – Bookshelf documentation, including:
  • IBM InfoSphere QualityStage User's Guide
  • IBM InfoSphere Information Server Planning, Installation, and Configuration Guide