Data survivorship feature
The Data survivorship feature introduces a set of rules that help in identifying a
fittest to survive
value among the similar values received from the multiple sources.
Overview
Selecting a record that fits a data quality rule is the simplest approach. These rules are often based on either lineage (source systems are prioritized), completeness, or data quality dimensions like validity.
Data survivorship rules decide which value for each attribute is retained in the Golden copy.
Data survivorship feature supports items
only.
Term | Description |
---|---|
Golden copy | A single source of truthfor each entry in a master data management system. |
Trusted Source rule | A rule that decides which value is the fittest to survive, by trusting certain sources more than the other sources. |
Data survivorship Business rule or Business rule | This rule defines the attributes or group of attributes that decide the values to be retained for the Golden copy. |
Fallback rule | This rule decides the winner value for an attribute in case of a tie. A Fallback rule can either be a Business rule or a specified source. |
Source rule | This rule decides the value depending on the selected source. |
Winner value | The winner valueif a Data survivorship attribute has duplicate values is decided depending on the Source and Rule that is selected in the Spec console page. If the Survivorship
rule and Fallback rule is not specified, the |
Data survivorship rules
Rule | Description |
---|---|
Average rule | The average of values is considered as the final value. Only supported by Number, Number enumeration, and Integer type attribute. |
Concatenate rule | List of all values from all the sources separated by a
Vertical bar(|) is considered as the final value. |
Frequency rule | Similar value that are provided by maximum number of sources is considered as the final value. |
Last modified rule | The value of the attribute that has the latest timestamp is considered as the final value. |
Maximum value rule | The highest value is considered as the final value. |
Minimum value rule | The lowest value is considered as the final value. |
Oldest value rule | The value of the attribute that has the oldest timestamp is considered as the final value. |