Examtopics

AWS Certified Machine Learning Engineer - Associate
  • Topic 1 Question 40

    A company is planning to create several ML prediction models. The training data is stored in Amazon S3. The entire dataset is more than 5 ТВ in size and consists of CSV, JSON, Apache Parquet, and simple text files. The data must be processed in several consecutive steps. The steps include complex manipulations that can take hours to finish running. Some of the processing involves natural language processing (NLP) transformations. The entire process must be automated. Which solution will meet these requirements?

    • Process data at each step by using Amazon SageMaker Data Wrangler. Automate the process by using Data Wrangler jobs.

    • Use Amazon SageMaker notebooks for each data processing step. Automate the process by using Amazon EventBridge.

    • Process data at each step by using AWS Lambda functions. Automate the process by using AWS Step Functions and Amazon EventBridge.

    • Use Amazon SageMaker Pipelines to create a pipeline of data processing steps. Automate the pipeline by using Amazon EventBridge.


    シャッフルモード