Apriori Node Expert Options
For those with detailed knowledge of Apriori's operation, the following expert options allow you to fine-tune the induction process. To access expert options, set the Mode to Expert on the Expert tab.
Evaluation measure. Apriori supports five methods of evaluating potential rules.
- Rule Confidence. The default method uses the confidence (or accuracy) of the rule to evaluate rules. For this measure, the Evaluation measure lower bound is disabled, since it is redundant with the Minimum rule confidence option on the Model tab. See the topic Apriori Node Model Options for more information.
- Confidence Difference. (Also called absolute confidence difference to prior.) This evaluation measure is the absolute difference between the rule's confidence and its prior confidence. This option prevents bias where the outcomes are not evenly distributed. This helps prevent "obvious" rules from being kept. Set the evaluation measure lower bound to the minimum difference in confidence for which you want rules to be kept.
- Confidence Ratio. (Also called difference of confidence quotient to 1.) This evaluation measure is the ratio of rule confidence to prior confidence (or, if the ratio is greater than one, its reciprocal) subtracted from 1. Like Confidence Difference, this method takes uneven distributions into account. It is especially good at finding rules that predict rare events. Set the evaluation measure lower bound to the difference for which you want rules to be kept.
-
Information Difference. (Also called information difference to prior.) This measure is
based on the information gain measure. If the probability of a particular consequent is
considered as a logical value (a bit), then the information gain is the proportion of that
bit that can be determined, based on the antecedents. The information difference is the difference
between the information gain, given the antecedents, and the information gain, given only the prior
confidence of the consequent. An important feature of this method is that it takes support into
account so that rules that cover more records are preferred for a given level of confidence. Set the
evaluation measure lower bound to the information difference for which you want rules to be kept.
-
Normalized Chi-square. (Also called normalized chi-squared measure.) This measure is a
statistical index of association between antecedents and consequents. The measure is normalized to
take values between 0 and 1. This measure is even more strongly dependent on support than the
information difference measure. Set the evaluation measure lower bound to the information difference
for which you want rules to be kept.
Allow rules without antecedents. Select to allow rules that include only the consequent (item
or item set). This is useful when you are interested in determining common items or item sets. For
example, cannedveg
is a single-item rule without an antecedent that indicates
purchasing cannedveg is a common occurrence in the data. In some cases, you may want to
include such rules if you are simply interested in the most confident predictions. This option is
off by default. By convention, antecedent support for rules without antecedents is expressed as
100%, and rule support will be the same as confidence.