Topic 1 Question 263
A solutions architect needs to review the design of an Amazon EMR cluster that is using the EMR File System (EMRFS). The cluster performs tasks that are critical to business needs. The cluster is running Amazon EC2 On-Demand Instances at all times for all task, primary, and core nodes. The EMR tasks run each morning, starting at 1:00 AM. and take 6 hours to finish running. The amount of time to complete the processing is not a priority because the data is not referenced until late in the day.
The solutions architect must review the architecture and suggest a solution to minimize the compute costs.
Which solution should the solutions architect recommend to meet these requirements?
Launch all task, primary, and core nodes on Spot Instances in an instance fleet. Terminate the cluster, including all instances, when the processing is completed.
Launch the primary and core nodes on On-Demand Instances. Launch the task nodes on Spot Instances in an instance fleet. Terminate the cluster, including all instances, when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
Continue to launch all nodes on On-Demand Instances. Terminate the cluster, including all instances, when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
Launch the primary and core nodes on On-Demand Instances. Launch the task nodes on Spot Instances in an instance fleet. Terminate only the task node instances when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
ユーザの投票
コメント(17)
- 正解だと思う選択肢: D
The problem statement says: "The EMR tasks run each morning, starting at 1:00 AM. and take 6 hours to finish running. The amount of time to complete the processing is not a priority because the data is not referenced until late in the day."
So later in the day, clients will be using the cluster to read data. Therefore my understanding is that core and primary nodes need to be available, but the task nodes can be terminated once the tasks have finished their daily run.
👍 11aviathor2023/08/28 - 正解だと思う選択肢: D
Correct Answer is D. In B it has no sense to temrinate primary instance if we have already purchase a saving plan.
👍 6javitech832023/06/30 - 正解だと思う選択肢: B
B is the best answer: It provides a balanced approach by using Spot Instances for task nodes to reduce costs and On-Demand Instances for primary and core nodes to ensure cluster stability. Terminating the cluster after processing and purchasing Compute Savings Plans for the On-Demand usage further optimizes costs while maintaining the reliability needed for critical business tasks.
The data can also be accessed via S3 if the cluster is not running, so it's ok to terminate it once the processing completes.
👍 4heatblur2023/11/28
シャッフルモード