A data scientist is building a model to predict whether a customer will cancel their subscription next month. The dataset includes a column cancellation_date that is only populated after a cancellation actually occurs. What problem does including this column in training introduce?
More Exploratory Data Analysis Questions
48 questions
Full AWS Certified Machine Learning - Specialty Practice Test
All topics covered
All AWS Certified Machine Learning - Specialty Questions
Browse by topic
Related Questions
A machine learning model requires numeric input features. A dataset contains a categorical feature n...
A dataset contains features with very different scales — age (0 to 100) and annual salary (0 to 200,...
A dataset for ML training has a numeric feature with 8% missing values that are Missing Completely a...
A binary classification model for fraud detection is trained on a dataset where 99% of records are n...
A dataset for ML training contains 500 features, many of which are correlated. Training is slow and ...
Educational Content — CertQnA practice questions are written against official exam objectives, covering the same domains tested on the real exam. All content is original and independent — not actual exam questions, not affiliated with any certification vendor. Learn more about our content policy
Discussion
Be the first to share your understanding of this concept
Sign in to join the discussion