Chapter 06.05: Computational Aspects of Finding Splits

In this section, we explain the computational aspects of the node-splitting procedure, especially for nominal features. In addition, we illustrate how to deal with missing values.

Lecture video

Lecture slides

Quiz

--- shuffle_questions: false --- ## Which statements are true? - [x] To find optimal splits, one iterates over all features, and for each feature over all possible split points. - [ ] To find optimal splits, we use the one that splits the data approximately in half in each step. - [x] To find optimal splits, we evaluate the possible splits only on the data that ended up in the parent node we are trying to split. - [x] The optimal split results in the lowest sum of empirical risks in the child nodes. - [ ] Monotone transformations of several features will change the structure of the tree. - [x] The CART algorithm cannot go on training if every node contains exactly one observation.