Topic 1 Question 256
You work for an online grocery store. You recently developed a custom ML model that recommends a recipe when a user arrives at the website. You chose the machine type on the Vertex AI endpoint to optimize costs by using the queries per second (QPS) that the model can serve, and you deployed it on a single machine with 8 vCPUs and no accelerators.
A holiday season is approaching and you anticipate four times more traffic during this time than the typical daily traffic. You need to ensure that the model can scale efficiently to the increased demand. What should you do?
- Maintain the same machine type on the endpoint.
- Set up a monitoring job and an alert for CPU usage.
- If you receive an alert, add a compute node to the endpoint.
- Change the machine type on the endpoint to have 32 vCPUs.
- Set up a monitoring job and an alert for CPU usage.
- If you receive an alert, scale the vCPUs further as needed.
- Maintain the same machine type on the endpoint Configure the endpoint to enable autoscaling based on vCPU usage.
- Set up a monitoring job and an alert for CPU usage.
- If you receive an alert, investigate the cause.
- Change the machine type on the endpoint to have a GPU. Configure the endpoint to enable autoscaling based on the GPU usage.
- Set up a monitoring job and an alert for GPU usage.
- If you receive an alert, investigate the cause.
ユーザの投票
コメント(2)
- 正解だと思う選択肢: B
Voting for B as it's the only option to autoscale even though the cost will go up. All other options include manual intervention.
👍 1kalle_balle2024/01/09 - 正解だと思う選択肢: C
Cost Optimization: It starts with the current machine type, avoiding unnecessary upfront costs, and scales only when needed. Autoscaling: It automatically adjusts compute resources based on vCPU usage, ensuring the endpoint can handle traffic spikes without manual intervention. Monitoring and Alerting: It provides visibility into resource usage and triggers alerts for potential issues, enabling proactive actions. Investigation: It encourages investigation of alerts to identify any underlying problems beyond expected traffic growth, ensuring overall system health.
👍 1pikachu0072024/01/13
シャッフルモード