IBM Support

Cluster of binary data returns "proximity matrix contains too many missing distances".

Troubleshooting


Problem

I am running a hierarchical clustering analysis of binary (0,1) variables. I am using the SPSS Cluster procedure and specifying the Jaccard proximity measure for binary data and using the 1 value to represent the presence of each attribute. I have no missing values in the cluster variables, yet my Cluster run returns the warning: "The proximity matrix contains too many missing distances. CLUSTER procedure cannot continue. This command is not executed." Does this warning reflect a problem with the size of the data set, either in terms of the number of cases or variables being analyzed? Are there other proximity measures that could be employed in this situation?

[{"Product":{"code":"SSLVMB","label":"IBM SPSS Statistics"},"Business Unit":{"code":"BU048","label":"IBM Software"},"Component":"Not Applicable","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"Not Applicable","Edition":"","Line of Business":{"code":"LOB76","label":"Data Platform"}}]

Log InLog in to view more of this document

This document has the abstract of a technical article that is available to authorized users once you have logged on. Please use Log in button above to access the full document. After log in, if you do not have the right authorization for this document, there will be instructions on what to do next.

Historical Number

76503

Document Information

More support for:
IBM SPSS Statistics

Software version:
Not Applicable

Document number:
421319

Modified date:
16 April 2020

UID

swg21480995

Manage My Notification Subscriptions