Examtopics

AWS Certified Machine Learning - Specialty
  • Topic 1 Question 235

    A company is training machine learning (ML) models on Amazon SageMaker by using 200 TB of data that is stored in Amazon S3 buckets. The training data consists of individual files that are each larger than 200 MB in size. The company needs a data access solution that offers the shortest processing time and the least amount of setup.

    Which solution will meet these requirements?

    • Use File mode in SageMaker to copy the dataset from the S3 buckets to the ML instance storage.

    • Create an Amazon FSx for Lustre file system. Link the file system to the S3 buckets.

    • Create an Amazon Elastic File System (Amazon EFS) file system. Mount the file system to the training instances.

    • Use FastFile mode in SageMaker to stream the files on demand from the S3 buckets.


    シャッフルモード