Data Leakage
/ˈdeɪtə ˈliːkɪdʒ/
Definition
Using information in model training that would not be available at prediction time, causing inflated performance metrics.
Example in context
"We included the total at checkout in training — it's derived from items bought, so it leaks the target label."
Related terms
Practice this term
Master Data Leakage in context by working through exercises in the Data Science & ML module. You'll see the term used in real engineering scenarios with multiple-choice, fill-in-the-blank, and matching drills.