Background of decision trees

The concept of a decision tree is to select splits that decrease the impurity of class distribution in the resulting subsets of instances. At the same time, the domination of one or more classes over the other classes is increased.

Note: This feature is available starting from Db2® version 11.5.4.
This way, you can find a subset that contains only instances of one class after a few splits.

For decision trees, binary trees are preferred because they can be generalized better than trees with high-cardinality attributes.

Creating and using decision tree models comprises the following algorithmic subtasks for decision trees:

  • Growing
  • Pruning
  • Prediction