Exploring Clusters

After you build clusters, you can see a set of results in the Clusters pane. For each cluster, the following information is available in the table:

  • Cluster. This is the name of the cluster. Clusters are named after the concept with the highest number of internal links.
  • Concepts. This is the number of concepts in the cluster. See the topic Cluster Definitions for more information.
  • Internal. This is the number of internal links in the cluster. Internal links are links between concept pairs within a cluster.
  • External. This is the number of external links in the cluster. External links are links between concept pairs when one concept is in one cluster and the other concept is in another cluster.
  • Sat. If a symbol is present, this indicates that this cluster could have been larger but one or more limits would have been exceeded, and therefore, the clustering process ended for that cluster and is considered to be saturated. At the end of the clustering process, saturated clusters are presented before unsaturated ones and therefore, many of the resulting clusters will be saturated. In order to see more unsaturated clusters, you can change the Maximum number of clusters to create setting to a value greater than the number of saturated clusters or decrease the Minimum link value. See the topic Building Clusters for more information.
  • Threshold. For all of the cooccurring concept pairs in the cluster, this is the lowest similarity link value of all in the cluster. See the topic Calculating Similarity Link Values for more information. A cluster with a high threshold value signifies that the concepts in that cluster have a higher overall similarity and are more closely related than those in a cluster whose threshold value is lower.

To learn more about a given cluster, you can select it and the visualization pane on the right will show two graphs to help you explore the cluster(s). See the topic Cluster Graphs for more information. You can also cut and paste the contents of the table into another application.

Whenever the extraction results no longer match the resources, this pane becomes yellow as does the Extraction Results pane. You can reextract to get the latest extraction results and the yellow coloring will disappear. However, each time an extraction is performed, the Clusters pane is cleared and you will have to rebuild your clusters. Likewise clusters are not saved from one session to another.