Topic 1 Question 38
An ML engineer needs to use an Amazon EMR cluster to process large volumes of data in batches. Any data loss is unacceptable. Which instance purchasing option will meet these requirements MOST cost-effectively?
Run the primary node, core nodes, and task nodes on On-Demand Instances.
Run the primary node, core nodes, and task nodes on Spot Instances.
Run the primary node on an On-Demand Instance. Run the core nodes and task nodes on Spot Instances.
Run the primary node and core nodes on On-Demand Instances. Run the task nodes on Spot Instances.
ユーザの投票
コメント(2)
- 正解だと思う選択肢: D
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-plan-instances-guidelines.html#emr-plan-spot-instances "The task nodes process data but do not hold persistent data in HDFS. If they terminate because the Spot price has risen above your maximum Spot price, no data is lost"
👍 2GiorgioGss2024/11/27 - 正解だと思う選択肢: D
Acceptable data loss: Spot can be used but you can't change an instance purchasing option while a cluster is running. To change from On-Demand to Spot Instances or vice versa, for the primary and core nodes, you must terminate the cluster and launch a new one. For task nodes, you can launch a new task instance group or instance fleet, and remove the old one.
👍 1Saransundar2024/12/04
シャッフルモード