Examtopics

Professional Machine Learning Engineer
  • Topic 1 Question 218

    You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex AI endpoint, and validated that results were received in a reasonable amount of time. After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests. What should you do?

    • Use a machine type with more memory

    • Decrease the number of workers per machine

    • Increase the CPU utilization target in the autoscaling configurations.

    • Decrease the CPU utilization target in the autoscaling configurations


    シャッフルモード