Adding rules in InfoSphere Information Analyzer thin client

You use rules to evaluate or validate specific conditions associated with your data sources. You can add quality rules and data rules to data in your workspaces by binding rule definitions to columns.

Before you begin

You must create rule definitions and add them to your workspace in InfoSphere® Information Analyzer workbench before you can view and use them in the thin client.

About this task

A quality rule or data rule is generated when a rule definition is bound to columns in a data set. The table below summarizes the major differences between the two rule types. For additional information, see Rules in InfoSphere Information Analyzer thin client.
Table 1. Quality rules and data rules in Information Analyzer thin client
Data rules
  • Must be given a name
  • Allow for user-defined output content and tables
  • Can be run and managed independently
  • Can be created and managed in the Information Analyzer workbench or thin client
Quality rules
  • Do not require a name
  • Do not allow for user-defined output content or tables; output is always displayed in the data quality analysis results for a data set
  • Always impact the data quality score for a data set
  • Can only be created in the thin client

Procedure

  1. From the data set level analysis screen, click Add rule from toolbar. You will see the Add a rule dialog.
  2. To indicate the type of rule you would like to add, select quality rule or data rule from the drop down at the top of the dialog.
  3. You can choose to begin by selecting either columns or a rule definition.
    Option Description
    Select the column or columns you want to work with
    1. Search for the column or columns you want by using the keyword search or sorting by the attributes in the table.
    2. Click on the check boxes next to the columns you want to use.
    Note: If you select more than one column, you will only be able to select a single variable rule definition from the list of available definitions. You cannot bind more than one column at a time to a variable in a multi-variable rule definition.
    Select the rule definition you want to work with
    1. Search for the rule definition you want by using the keyword search or sorting by the attributes in the table.
    2. Click on the rule definition you want to use. Underneath the rule definition table, you will see the rule expression and a list of the variables that you need to bind.
    Note: If you select a rule definition with multiple variables, you will only be able to bind a single column to each variable. You cannot bind more than one column at a time to a variable in a multi-variable rule definition.
  4. To bind a column or columns to a variable in the rule definition, either drag and drop the column(s) to the appropriate variable, or click the bind icon in the cell.
  5. To bind literal values to a variable in the rule definition:
    1. Click the pencil icon.
    2. Select the data type you want to use.
    3. Add the value manually.
  6. To bind global variables values to a variable in the rule definition:
    1. Click the Global variables tab.
    2. Either drag and drop the global variable(s) to the appropriate variable for the data rule, or click the bind icon in the cell.
    Note: Global variables can only be created in the InfoSphere Information Analyzer workbench. In the InfoSphere Information Analyzer thin client, you can only see global variables that represent columns from the data set that you are currently working with. You cannot bind columns from more than one data set.
  7. Once all of your variables have been bound to a data element, click Next.
  8. If you are creating a data rule, follow the steps to define details and output for your data rule and click Next.
  9. From the Review and test screen, review the details of the rule or rules you created.
  10. Click the Test icon in the grid to view sample output and confirm that the results appear as expected.
  11. Click Done to save the rule. You will be able to see the new data rule or quality rule in the Rules section of the data set analysis screen.