You use rules to evaluate or validate specific conditions associated with your data
sources. You can add quality rules and data rules to data in your workspaces by binding rule
definitions to columns.
Before you begin
You must create rule definitions and add them to your workspace in
InfoSphere® Information Analyzer workbench
before you can view and use them in the thin client.
About this task
A quality rule or data rule is generated when a rule definition is bound to columns in a data
set. The table below summarizes the major differences between the two rule types. For additional
information, see
Rules in InfoSphere Information Analyzer thin client.
Table 1. Quality rules and data rules in Information Analyzer thin clientData rules |
- Must be given a name
- Allow for user-defined output content and tables
- Can be run and managed independently
- Can be created and managed in the Information Analyzer workbench or thin client
|
Quality rules |
- Do not require a name
- Do not allow for user-defined output content or tables; output is always displayed in the data
quality analysis results for a data set
- Always impact the data quality score for a data set
- Can only be created in the thin client
|
Procedure
- From the data set level analysis screen, click Add rule from
toolbar. You will see the Add a rule dialog.
- To indicate the type of rule you would like to add, select quality rule
or data rule from the drop down at the top of the dialog.
- You can choose to begin by selecting either columns or a rule definition.
Option |
Description |
Select the column or columns you want to work with |
- Search for the column or columns you want by using the keyword search or
sorting by the attributes in the table.
- Click on the check boxes next to the columns you want to use.
Note: If you select more than one column, you will only be able to select a single variable rule
definition from the list of available definitions. You cannot bind more than one column at a time to
a variable in a multi-variable rule definition.
|
Select the rule definition you want to work with |
- Search for the rule definition you want by using the keyword search or sorting by the attributes
in the table.
- Click on the rule definition you want to use. Underneath the rule definition table, you will see
the rule expression and a list of the variables that you need to bind.
Note: If you select a rule definition with multiple variables, you will only be able to bind a
single column to each variable. You cannot bind more than one column at a time to a variable in a
multi-variable rule definition.
|
- To bind a column or columns to a variable in the rule definition, either drag and drop the
column(s) to the appropriate variable, or click the bind icon in the cell.
- To bind literal values to a variable in the rule definition:
- Click the pencil icon.
- Select the data type you want to use.
- Add the value manually.
- To bind global variables values to a variable in the rule definition:
- Click the Global variables tab.
- Either drag and drop the global variable(s) to the appropriate variable for the data rule, or
click the bind icon in the cell.
Note: Global variables can only be created in the InfoSphere Information Analyzer workbench. In
the InfoSphere Information Analyzer thin
client, you can only see global variables that represent columns from the data set that you are
currently working with. You cannot bind columns from more than one data set.
- Once all of your variables have been bound to a data element, click
Next.
- If you are creating a data rule, follow the steps to define details and output for your data
rule and click Next.
- From the Review and test screen, review the details of the rule or rules
you created.
- Click the Test icon in the grid to view sample output and confirm that
the results appear as expected.
- Click Done to save the rule. You will be able to see the new data rule
or quality rule in the Rules section of the data set analysis screen.