Examtopics

AWS Certified Machine Learning - Specialty
  • Topic 1 Question 287

    A data engineer is evaluating customer data in Amazon SageMaker Data Wrangler. The data engineer will use the customer data to create a new model to predict customer behavior.

    The engineer needs to increase the model performance by checking for multicollinearity in the dataset.

    Which steps can the data engineer take to accomplish this with the LEAST operational effort?

    2 つ選択
    • Use SageMaker Data Wrangler to refit and transform the dataset by applying one-hot encoding to category-based variables.

    • Use SageMaker Data Wrangler diagnostic visualization. Use principal components analysis (PCA) and singular value decomposition (SVD) to calculate singular values.

    • Use the SageMaker Data Wrangler Quick Model visualization to quickly evaluate the dataset and to produce importance scores for each feature.

    • Use the SageMaker Data Wrangler Min Max Scaler transform to normalize the data.

    • Use SageMaker Data Wrangler diagnostic visualization. Use least absolute shrinkage and selection operator (LASSO) to plot coefficient values from a LASSO model that is trained on the dataset.


    シャッフルモード