Examtopics

AWS Certified Machine Learning Engineer - Associate
  • Topic 1 Question 20

    A company has a large, unstructured dataset. The dataset includes many duplicate records across several key attributes. Which solution on AWS will detect duplicates in the dataset with the LEAST code development?

    • Use Amazon Mechanical Turk jobs to detect duplicates.

    • Use Amazon QuickSight ML Insights to build a custom deduplication model.

    • Use Amazon SageMaker Data Wrangler to pre-process and detect duplicates.

    • Use the AWS Glue FindMatches transform to detect duplicates.


    シャッフルモード