Classifying an asset according to its data
Use data classification to identify database columns and data file fields according to their type.
Before you begin
- To create, edit, delete, and assign data classes to assets, you must have the Information Governance Catalog Information Asset Author role or higher.
- To browse and to query data classes, you must have the Information Governance Catalog User role or higher.
About this task
IBM® InfoSphere® Information Analyzer analyzes data and detects its data classification. In InfoSphere Information Analyzer and IBM InfoSphere Information Governance Catalog, you can manually assign a data classification to an asset. Only one data class can be assigned to an asset.
When you edit a data class, you can assign labels, stewards, terms, and information governance rules. In addition, you can define other attributes of the data class and classify assets according to the data class.
Detected classifications from InfoSphere Information Analyzer can be viewed, but not removed, by InfoSphere Information Governance Catalog.
A classification that was selected in either InfoSphere Information Analyzer or InfoSphere Information Governance Catalog, can be cleared and removed from InfoSphere Information Governance Catalog.
Procedure
Example: Classifying a database column
- Data class type is regex
- Data type is string
- Threshold is 90
- Minimum data length is 9
- Maximum data length is 9
- Regular expression is ^[0-9]{9}$
You run column analysis in InfoSphere Information Analyzer. If 90% of the values in the database column match the format correctly, you can classify the database column as being a column of national identity numbers.