Back to search

What is data leakage in machine learning?

Data leakage happens when training uses information that would not be available at real prediction time.

Machine Learning Medium Theory

What is data leakage in machine learning?

Data leakage happens when training uses information that would not be available at real prediction time.

  • Inflates evaluation scores
  • Common in preprocessing mistakes
  • Think carefully about time and target leakage

What is data leakage in machine learning?