Topic 1 Question 363
A data scientist uses Amazon SageMaker Data Wrangler to analyze and visualize data. The data scientist wants to refine a training dataset by selecting predictor variables that are strongly predictive of the target variable. The target variable correlates with other predictor variables.
The data scientist wants to understand the variance in the data along various directions in the feature space.
Which solution will meet these requirements?
Use the SageMaker Data Wrangler multicollinearity measurement features with a variance inflation factor (VIF) score. Use the VIF score as a measurement of how closely the variables are related to each other.
Use the SageMaker Data Wrangler Data Quality and Insights Report quick model visualization to estimate the expected quality of a model that is trained on the data.
Use the SageMaker Data Wrangler multicollinearity measurement features with the principal component analysis (PCA) algorithm to provide a feature space that includes all of the predictor variables.
Use the SageMaker Data Wrangler Data Quality and Insights Report feature to review features by their predictive power.
ユーザの投票
コメント(2)
- 正解だと思う選択肢: C
This approach allows the data scientist to refine the training dataset by selecting the most predictive variables and understanding the variance in the data effectively.
👍 1MultiCloudIronMan2024/10/30 https://aws.amazon.com/blogs/machine-learning/detect-multicollinearity-target-leakage-and-feature-correlation-with-amazon-sagemaker-data-wrangler/ Not sure about A or C, but I think it is C
👍 1spinatram2024/11/02
シャッフルモード