IBM Support

Increase Data Quality Dimensions export from default 500 in InfoSphere Information Analyzer

How To


Summary

When performing Data Quality Dimensions export, the default value is 500 records.


If you need to increase what is exported you need to increase the parameter "maxNbOfColumnValuesToCapturePerProblem" and re-run Data Quality Analysis so those values are captured in IADB and then re-run the export.

Steps

1.Login to services tier and run the following command :

/opt/IBM/InformationServer/ASBServer/bin/IAAdmin.sh -user admin -password admin -url https://serviceshostname:port -getDataQualityConfig -projectName Your_Project_Name > myConfig.json

Review the contents of myConfig.json file.

2.Edit myConfig.json to add the parameter "maxNbOfColumnValuesToCapturePerProblem"

You need to add the following string :

"additionalProperties":{
        "maxNbOfColumnValuesToCapturePerProblem":2000
    },

Please note 2000 is an example, you can set it to the number that you wish to export.
The modified "myconfig.json" should look like below :

{
        "useAutomaticConfiguration":true,
        "dataQualityDimensions":[
                {
                        "ignored":true,
                        "dataQualityProblemType":"SuspectCorrelatedValue",
                        "name":"Suspect values in correlated columns",
                        "description":"Suspect values in correlated columns"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"DataClassViolation",
                        "name":"Data class violations",
                        "description":"Data class violations"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"MissingValue",
                        "name":"Missing values",
                        "description":"Missing values"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"SuspectValues",
                        "name":"Suspect values",
                        "description":"Suspect values"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"DataTypeViolation",
                        "name":"Data type violations",
                        "description":"Data type violations"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"FormatViolation",
                        "name":"Format violations",
                        "description":"Format violations"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"UniquenessViolation",
                        "name":"Duplicated values",
                        "description":"Duplicated values"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"CaseViolation",
                        "name":"Inconsistent capitalization",
                        "description":"Inconsistent capitalization"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"ValueRangeViolation",
                        "name":"Values out of range",
                        "description":"Values out of range"
                },
                {
                        "ignored":false,
                        "dataQualityProblemType":"NonStandardMissingValue",
                        "name":"Inconsistent representation of missing values",
                        "description":"Inconsistent representation of missing values"
                }
        ],
        "subscribers":[ ],
        "isInherited":false,
        "additionalProperties":{
                "maxNbOfColumnValuesToCapturePerProblem":2000
        },
        "useOnlySpecifiedDimensions":true,
        "qualityThreshold":0.8
}

3.Now update the configuration using the modified json, using the following command :

/opt/IBM/InformationServer/ASBServer/bin/IAAdmin.sh -user admin -password admin -url https://serviceshostname:port -setDataQualityConfig -projectName Your_Project_Name -content myConfig.json

4.Most Important - Now you need to re-run DataQuality Analysis and then export.

Document Location

Worldwide

[{"Line of Business":{"code":"","label":""},"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Product":{"code":"SSZJLG","label":"InfoSphere Information Analyzer"},"ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Version(s)"}]

Product Synonym

IA

Document Information

Modified date:
16 February 2021

UID

ibm16415035