Examtopics

Professional Machine Learning Engineer
  • Topic 1 Question 280

    You are using Kubeflow Pipelines to develop an end-to-end PyTorch-based MLOps pipeline. The pipeline reads data from BigQuery, processes the data, conducts feature engineering, model training, model evaluation, and deploys the model as a binary file to Cloud Storage. You are writing code for several different versions of the feature engineering and model training steps, and running each new version in Vertex AI Pipelines. Each pipeline run is taking over an hour to complete. You want to speed up the pipeline execution to reduce your development time, and you want to avoid additional costs. What should you do?

    • Comment out the part of the pipeline that you are not currently updating.

    • Enable caching in all the steps of the Kubeflow pipeline.

    • Delegate feature engineering to BigQuery and remove it from the pipeline.

    • Add a GPU to the model training step.


    シャッフルモード