Topic 1 Question 17
2 つ選択A data engineer is building a data pipeline on AWS by using AWS Glue extract, transform, and load (ETL) jobs. The data engineer needs to process data from Amazon RDS and MongoDB, perform transformations, and load the transformed data into Amazon Redshift for analytics. The data updates must occur every hour. Which combination of tasks will meet these requirements with the LEAST operational overhead?
Configure AWS Glue triggers to run the ETL jobs every hour.
Use AWS Glue DataBrew to clean and prepare the data for analytics.
Use AWS Lambda functions to schedule and run the ETL jobs every hour.
Use AWS Glue connections to establish connectivity between the data sources and Amazon Redshift.
Use the Redshift Data API to load transformed data into Amazon Redshift.
ユーザの投票
コメント(11)
- 正解だと思う選択肢: AD
AWS Glue triggers provide a simple and integrated way to schedule ETL jobs. By configuring these triggers to run hourly, the data engineer can ensure that the data processing and updates occur as required without the need for external scheduling tools or custom scripts. This approach is directly integrated with AWS Glue, reducing the complexity and operational overhead. AWS Glue supports connections to various data sources, including Amazon RDS and MongoDB. By using AWS Glue connections, the data engineer can easily configure and manage the connectivity between these data sources and Amazon Redshift. This method leverages AWS Glue’s built-in capabilities for data source integration, thus minimizing operational complexity and ensuring a seamless data flow from the sources to the destination (Amazon Redshift).
👍 6rralucard_2024/01/21 - 正解だと思う選択肢: AD
A. Configure AWS Glue triggers to run the ETL jobs every hour. Reduced Code Complexity: Glue triggers eliminate the need to write custom code for scheduling ETL jobs. This simplifies the pipeline and reduces maintenance overhead. Scalability and Integration: Glue triggers work seamlessly with Glue ETL jobs, ensuring efficient scheduling and execution within the Glue ecosystem. D. Use AWS Glue connections to establish connectivity between the data sources and Amazon Redshift. Pre-Built Connectors: Glue connections offer pre-built connectors for various data sources like RDS and Redshift. This eliminates the need for manual configuration and simplifies data source access within the ETL jobs. Centralized Management: Glue connections are centrally managed within the Glue service, streamlining connection management and reducing operational overhead.
👍 5pypelyncar2024/06/08 - 正解だと思う選択肢: AD
A - this is obvious and D -https://docs.aws.amazon.com/glue/latest/dg/console-connections.html
👍 4GiorgioGss2024/03/07
シャッフルモード