Examtopics

AWS Certified Machine Learning - Specialty
  • Topic 1 Question 266

    A company hosts a machine learning (ML) dataset repository on Amazon S3. A data scientist is preparing the repository to train a model. The data scientist needs to redact personally identifiable information (PH) from the dataset.

    Which solution will meet these requirements with the LEAST development effort?

    • Use Amazon SageMaker Data Wrangler with a custom transformation to identify and redact the PII.

    • Create a custom AWS Lambda function to read the files, identify the PII. and redact the PII

    • Use AWS Glue DataBrew to identity and redact the PII

    • Use an AWS Glue development endpoint to implement the PII redaction from within a notebook


    シャッフルモード