Domain & Completeness tab - Range

Use this tab to set a minimum value and a maximum value for the data values that you are validating. All data values that fall outside of this range are set to invalid.

You can set a range to determine the validity of the data values only for columns with a data class of Quantity or Date.

Outliers
Specifies the number of maximum and minimum values that appear in the frequency distribution table. For example, the default value of 10 specifies that the 10 maximum values and the 10 minimum values display in the frequency distribution table. If you modify the outliers value, click Reload to update the frequency distribution table.
Minimum
Shows the minimum value that you specified as valid.
Maximum
Shows the maximum value that you specified as valid.
Date Format
For data values with a data class of Date, select a valid date format from the menu.
Reload
Click to reload the frequency distribution table if you modified the outliers value.
Reviewed
Select to specify that the analysis results have been reviewed.
Rebuild Inferences
After you mark invalid values in the Domain & Completeness tab, select Valid Values to rebuild column analysis inferences using only the valid values in the column.
Select All Values to rebuild column analysis inferences using all of the values in the column.

Frequency Distribution table

Shows details for all of the data values in the selected column. Specify whether a data value is the maximum or minimum value for the range. All values that fall outside of that range are set to invalid.

Outliers
Shows where the value is in the set outliers, for example, low or high.
Data Value
Shows the data field.
Count
Shows how often this data value appears in the column.
Percent
Shows what percent of the total records this data value represents.
Status
Shows whether the value is valid, invalid, or default.
Min/Max
Click to specify that the data value is the maximum or minimum value in the range.
Show Quintiles
Click to view an analysis of all the data values to help you determine the appropriate minimum and maximum values.
Drill Down
Click to view the detailed information of all instances of the data value in the selected column.
Delete
Click to remove the selected data value from the frequency distribution. You can only delete a data value that has a frequency count of 0.
New Value
Click to add a new data value to the frequency distribution. You might want to add a data value to the frequency distribution if the maximum or minimum value that you want to set in the range does not exist in the frequency distribution.

Completeness Summary

Shows an overview of the count and percentage of complete and incomplete data values in the selected column. Additionally, this object list provides details for each data value that has been marked incomplete.

Distinct Values
Shows a count of the distinct data values that are marked as complete and incomplete and the percent that those values represent in the total count of distinct values in the column.
Records
Shows a count of the total data values that are marked as complete and incomplete and the percent that those values represent in the total count of data values in the column.
Incomplete Values table
Data Value
Shows the actual data value from the source.
Count
Shows the number of instances of this data value in the column.
Percent
Shows the percent of the total records this data value represents.

Validity Summary

Shows an overview of the count and percentage of valid and invalid data values in the selected column. Additionally, this object list provides details for each data value that has been marked invalid.

Distinct Values
Shows a count of the distinct data values that are marked as valid and invalid and the percent that those data values represent in the total count of distinct data values in the column.
Records
Shows a count of the total data values that are marked as valid and invalid and the percent that those data values represent in the total count of data values in the column.
Invalid Values table
Data Value
Shows the actual data value from the source.
Count
Shows the number of instances of this data value in the column.
Percent
Shows the percent of the total records this data value represents.