Chapter 22.05: Data Leakage

We explain the concept of train-test leakage, why it leads to overoptimistic performance estimates, and how to avoid it in preprocessing pipelines.

Lecture slides