Examtopics

Professional Data Engineer

285
287

Topic 1 Question 286
You have thousands of Apache Spark jobs running in your on-premises Apache Hadoop cluster. You want to migrate the jobs to Google Cloud. You want to use managed services to run your jobs instead of maintaining a long-lived Hadoop cluster yourself. You have a tight timeline and want to keep code changes to a minimum. What should you do?
- Move your data to BigQuery. Convert your Spark scripts to a SQL-based processing approach.
- Rewrite your jobs in Apache Beam. Run your jobs in Dataflow.
- Copy your data to Compute Engine disks. Manage and run your jobs directly on those instances.
- Move your data to Cloud Storage. Run your jobs on Dataproc.
ユーザの投票
コメント(3)
- 正解だと思う選択肢: D
  D. Move your data to Cloud Storage. Run your jobs on Dataproc.
  
  👍 1
  scaenruy2024/01/04
- D. Move your data to Cloud Storage. Run your jobs on Dataproc. Dataproc is managed service and not needed much code changes.
  
  👍 1
  GCP0012024/01/07
- 正解だと思う選択肢: D
  of course D
  
  👍 1
  Sofiia982024/01/10
シャッフルモード

285
287