Topic 1 Question 170
You need to deploy a scikit-leam classification model to production. The model must be able to serve requests 24/7, and you expect millions of requests per second to the production application from 8 am to 7 pm. You need to minimize the cost of deployment. What should you do?
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 1
Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 1
Deploy an online Vertex AI prediction endpoint with one GPU per replica. Set the max replica count to 100
ユーザの投票
コメント(3)
- 正解だと思う選択肢: B
B. Deploy an online Vertex AI prediction endpoint. Set the max replica count to 100: This option provides a higher number of replicas (100) to handle the expected high volume of requests during peak hours. While it might result in increased costs, it provides the necessary scalability to manage the incoming traffic efficiently. During non-peak hours, you can consider scaling down the replicas to reduce costs, as Vertex AI allows dynamic scaling based on demand.
👍 1pikachu0072024/01/10 - 正解だと思う選択肢: B
scikit-learn doesn't support GPU https://scikit-learn.org/stable/faq.html#will-you-add-gpu-support
👍 1BlehMaks2024/01/12 B we don't need GPU for scikit-learn
👍 136bdc1e2024/01/13
シャッフルモード