Topic 1 Question 46
An online reseller has a large, multi-column dataset with one column missing 30% of its data. A Machine Learning Specialist believes that certain columns in the dataset could be used to reconstruct the missing data. Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?
Listwise deletion
Last observation carried forward
Multiple imputation
Mean substitution
ユーザの投票
コメント(8)
C looks correct since multiple imputation can be performed based on the related variable as given in the question
👍 26rsimham2021/09/30Multiple Imputation by Chained Equations or MICE, as per udemy this is always the best answer of all
👍 7harmanbirstudy2021/11/01A common strategy used to impute missing values is to replace missing values with the mean or median value. It is important to understand your data before choosing a strategy for replacing missing values. https://docs.aws.amazon.com/machine-learning/latest/dg/feature-processing.html
👍 2dhs2272021/10/06
シャッフルモード