Examples of name similarity
This topic presents examples of similar resource names that might be discovered by using the default name similarity settings.
By default, name similarity is configured with the following default settings. For more
information on these configuration parameters, see Configuring name similarity.
Parameter | Description | Values |
---|---|---|
name_similarity_default_threshold |
String comparison threshold. For example, a similarity threshold value of 0.9 means that
strings must match by at least that value to be considered similar. For more information about the
threshold value, see Similarity
threshold value.
|
0.9 |
name_similarity_default_lead_restriction |
Lead restriction. Number of characters at the beginning of the string that must be identical. | 1 |
name_similarity_default_tail_restriction |
Tail restriction. Number of characters at the end of the string that must be identical. | 0 |
Based on these settings, the following event snippets present an example for resource name
similarity analysis. Note that the resource name is stored in the resource column.
NODE
is the default resource column, but can be changed for the pattern event
type.1 NODE SUMMARY ALERTKEY Similar?
2 acme.env1.base.adm_chk_probeCheck System Alert SEV2 ABC adm_probe No
3 cnz.env2.base.adm_chk_reports System Alert SEV2 ABC adm_report No
4 abc.lyf.base.logs1 System Alert SEV2 DEF logs1 No
5 abc.gbs.stato.dotnetcore System Alert SEV2 ABC runtime_down No
6 caripa.env1.stato.dotnetcore System Alert SEV2 GHI runtime_down Yes
7 caripa.env1.stato.TNT System Alert SEV2 GHI runtime_down Yes
8 caripa.env1.stato.TNT System Alert SEV2 GHI runtime_down Yes
9 emperor.env3.stato.pythonRuntime System Alert SEV2 ABC runtime_down No
10 abc.env5.base.bash.total_cpu_noncore System Alert SEV2 DEF bash_cpu_noncore No
11 abc.cio.base.total_cpu_noncore System Alert SEV2 ABC bash_cpu_noncore No
12 banca.env1.base.bosh.jobstate.console System Alert SEV2 GHI job_fail No
As
a result of this similarity analysis, the resource names in the NODE
column for the
events listed in rows 6, 7, and 8 are considered similar. The reasons for this include the following:- All of the resource names other than those in the
NODE
column of rows 2, 6, 7, and 8, start with a letter other than c, hence they are rejected automatically, because the lead restriction is set to 1 character. - The resource name in the
NODE
column of row 2 fails the similarity threshold of 0.9 because it is very different to the resource names in rows 6, 7, and 8. - The tail restriction is set to 0, so this allows the resource name in row 6 to pass overall similarity, even though the final letters of its resource name are different to the final letters of the resource names in rows 7 and 8.