Examtopics

Professional Machine Learning Engineer

186
188

Topic 1 Question 187
You recently deployed a scikit-learn model to a Vertex AI endpoint. You are now testing the model on live production traffic. While monitoring the endpoint, you discover twice as many requests per hour than expected throughout the day. You want the endpoint to efficiently scale when the demand increases in the future to prevent users from experiencing high latency. What should you do?
- Deploy two models to the same endpoint, and distribute requests among them evenly
- Configure an appropriate minReplicaCount value based on expected baseline traffic
- Set the target utilization percentage in the autoscailngMetricSpecs configuration to a higher value
- Change the model’s machine type to one that utilizes GPUs
ユーザの投票
コメント(1)
- 正解だと思う選択肢: C
  c as it is dynamic
  
  👍 1
  pikachu0072024/01/12
シャッフルモード

186
188