Measures for Frequency Count Data (CLUSTER command)

For frequency count data, use any one of the following keywords on MEASURE:

CHISQ. Based on the chi-square test of equality for two sets of frequencies. The magnitude of this dissimilarity measure depends on the total frequencies of the two cases or variables whose dissimilarity is computed. Expected values are from the model of independence of cases or variables x and y.

PH2. Phi-square between sets of frequencies. This is the CHISQ measure normalized by the square root of the combined frequency. Therefore, its value does not depend on the total frequencies of the two cases or variables whose dissimilarity is computed.