AnswerTree 3.1/3.0 has implemented consistent rules for handling ties between predictors when determining splits. In the past, AnswerTree did not always follow consistent rules about what to do in case of a tie. The most obvious result of these rules is that some trees developed in previous versions of AnswerTree will look different if built in AnswerTree 3.1/3.0.
Because these tie-breaking rules affect more extreme situations and are often arbitrary decisions, the results in any version are equally valid. For example, in C&RT;, the order that the predictor was added to the model-an arbitrary rule-determines the split. The difference is that now you can understand and explain how a split is made in these situations. It is also worth noting that in these situations it may be worth using AnswerTree's interactive tree building features to choose the predictor, since you may be more interested in including one attribute instead of another equally significant predictor.
The algorithms use different rules to determine which predictor to use. For example, with a continuous target, CHAID will choose the predictor whose split gives the most significantly different groups of the target. For a categorical target, CHAID selects the predictor whose measure of independence with the target is smallest. (See Appendix E in the AnswerTree User's Guide for details.) If there is a "tie" between two predictors, then AnswerTree needs to choose one predictor over another. In 3.1/3.0 the rules are always consistent:
For CHAID & Exhaustive CHAID:
For QUEST:
C&RT; has not changed. The rules are as follows:
Read about other features added to AnswerTree 3.1/3.0.
Predictive Analytics
can make your organization
more
successful
Key Products and Services
Resources
Buy
Demos and Downloads