What is data leakage in machine learning?
Data leakage happens when training uses information that would not be available at real prediction time.
- Inflates evaluation scores
- Common in preprocessing mistakes
- Think carefully about time and target leakage
What is data leakage in machine learning?