CXNM (standardization function)

Used for business name standardization, the CXNM function provides two different kinds of anonymous values: single-token and whole-value.

Single-token anonymous values are removed on a per-word basis whereas whole-value anonymous values cause the entire name to be treated as anonymous (missing). Whole-value anonymous values must be enclosed in parentheses to distinguish them from single-token anonymous values. Single-token anonymous values do not have enclosing parentheses.

The CXNM function has a length limitation of 512 characters. If the value of the length of the CXNM field is over 512 characters, the operational server truncates the input value to the first 512 characters of the value.

Output Type
any value
Fldargs
onmlast
MinFldArgs, MaxFldArgs
1
Number of standard roles
1
Dvdargs
maxwords (an integer value)
CMAP strcode
CMAP
  1. The function first checks to see if there is any data in the input to be processed.
  2. If a CMAP table is specified in the mpi_dvdxstd table, then the CMAP conversion is done first.
  3. The lowercase characters are converted to uppercase. Any uppercase characters and digits are left as they are. Any ampersand, single quotation, or back quotation is deleted and the characters are joined (for example, AT & T becomes ATT).
    Note: Only alphanumeric characters can go through CXNM standardization function.
  4. Single characters are combined together by deleting the space between them. The final space is then deleted from the buffer (for example, I B M is converted to IBM).
  5. If there is an ANON table that is specified in mpi_dvdxstd, then the input is checked to see if the whole value is anonymous. A value that is enclosed in parenthesis is considered a whole value ANON.
  6. The input is then separated into words. The words are added until the word limit is reached. This value can be passed through the dvdargs property. The maximum value is 16. If the dvdargs is not specified, a default of 4 is used.
  7. If there is an ANON table that is specified in mpi_dvdxstd, each word is checked to see if it is anonymous.
  8. If the EQUI table is specified in mpi_dvdxstd and if the word is found in mpi_strequi, the word is replaced with the first entry in the EQUI set.
Whole-value example:

If UNKNOWN was specified as anonymous, the name UNKNOWN would correctly become anonymous, and the name WORLD OF THE UNKNOWN would be standardized as WORLD OF THE. However, if you specify (UNKNOWN) as a whole-value ANON and INC as a single-token ANON, the name WORLD OF THE UNKNOWN INC would correctly standardize to WORLD OF THE UNKNOWN. Also, the value UNKNOWN would correctly become anonymous.