Topic 1 Question 22
A company maintains multiple extract, transform, and load (ETL) workflows that ingest data from the company's operational databases into an Amazon S3 based data lake. The ETL workflows use AWS Glue and Amazon EMR to process data. The company wants to improve the existing architecture to provide automated orchestration and to require minimal manual effort. Which solution will meet these requirements with the LEAST operational overhead?
AWS Glue workflows
AWS Step Functions tasks
AWS Lambda functions
Amazon Managed Workflows for Apache Airflow (Amazon MWAA) workflows
ユーザの投票
コメント(17)
- 正解だと思う選択肢: B
Glue Workflow only orchestrate crawlers and glue jobs
👍 14valuedate2024/05/22 - 正解だと思う選択肢: B
For me it's B because I did not found a possibility how Glue can trigger/orchestrate EMR processes OOTB. But with StepFunction there is a way: https://aws.amazon.com/blogs/big-data/orchestrate-amazon-emr-serverless-jobs-with-aws-step-functions/
👍 7DevoteamAnalytix2024/05/03 - 正解だと思う選択肢: A
Since it seems to me that this pipeline is complex, with multiple workflows, I would go for Glue workflows.
👍 6lucas_rfsb2024/03/31
シャッフルモード