Measures for Frequency Count Data (CLUSTER command)
For frequency count data, use any one of the following
keywords on MEASURE
:
CHISQ. Based on the chi-square test of equality for two sets of frequencies. The magnitude of this dissimilarity measure depends on the total frequencies of the two cases or variables whose dissimilarity is computed. Expected values are from the model of independence of cases or variables x and y.
PH2. Phi-square between sets of frequencies. This
is the CHISQ
measure normalized
by the square root of the combined frequency. Therefore, its value
does not depend on the total frequencies of the two cases or variables
whose dissimilarity is computed.