Examtopics

Professional Data Engineer
  • Topic 1 Question 138

    You have several Spark jobs that run on a Cloud Dataproc cluster on a schedule. Some of the jobs run in sequence, and some of the jobs run concurrently. You need to automate this process. What should you do?

    • Create a Cloud Dataproc Workflow Template

    • Create an initialization action to execute the jobs

    • Create a Directed Acyclic Graph in Cloud Composer

    • Create a Bash script that uses the Cloud SDK to create a cluster, execute jobs, and then tear down the cluster


    シャッフルモード