Data survivorship feature

The Data survivorship feature introduces a set of rules that help in identifying a fittest to survive value among the similar values received from the multiple sources.

Fix Pack 12

Overview

Selecting a record that fits a data quality rule is the simplest approach. These rules are often based on either lineage (source systems are prioritized), completeness, or data quality dimensions like validity.

Data survivorship rules decide which value for each attribute is retained in the Golden copy.

Data survivorship feature supports items only.

Following is a list of some important terms that are used in the Data survivorship context.
Term Description
Golden copy A single source of truth for each entry in a master data management system.
Trusted Source rule A rule that decides which value is the fittest to survive, by trusting certain sources more than the other sources.
Data survivorship Business rule or Business rule This rule defines the attributes or group of attributes that decide the values to be retained for the Golden copy.
Fallback rule This rule decides the winner value for an attribute in case of a tie. A Fallback rule can either be a Business rule or a specified source.
Source rule This rule decides the value depending on the selected source.
Winner value The winner valueif a Data survivorship attribute has duplicate values is decided depending on the Source and Rule that is selected in the Spec console page.

If the Survivorship rule and Fallback rule is not specified, the winner value the priority of sources (The order in which you add the selected sources ).

Data survivorship rules

The Data survivorship rules (also called as Business rules) define attributes or group of attributes that decide the values to be retained for the Golden copy. Following is the list of business rules.
Rule Description
Average rule The average of values is considered as the final value.

Only supported by Number, Number enumeration, and Integer type attribute.

Concatenate rule List of all values from all the sources separated by a Vertical bar (|) is considered as the final value.
Frequency rule Similar value that are provided by maximum number of sources is considered as the final value.
Last modified rule The value of the attribute that has the latest timestamp is considered as the final value.
Maximum value rule The highest value is considered as the final value.
Minimum value rule The lowest value is considered as the final value.
Oldest value rule The value of the attribute that has the oldest timestamp is considered as the final value.