Topic 1 Question 18
Your company is forecasting a sharp increase in the number and size of Apache Spark and Hadoop jobs being run on your local datacenter. You want to utilize the cloud to help you scale this upcoming demand with the least amount of operations work and code change. Which product should you use?
Google Cloud Dataflow
Google Cloud Dataproc
Google Compute Engine
Google Kubernetes Engine
解説
Google Cloud Dataproc is a fast, easy-to-use, low-cost and fully managed service that lets you run the Apache Spark and Apache Hadoop ecosystem on Google Cloud Platform. Cloud Dataproc provisions big or small clusters rapidly, supports many popular job types, and is integrated with other Google Cloud Platform services, such as Google Cloud Storage and Stackdriver Logging, thus helping you reduce TCO. Reference: https://cloud.google.com/dataproc/docs/resources/faq
ユーザの投票
コメント(17)
"B. Google Cloud Dataproc" is the answer
👍 18AWS562020/01/11Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming, and machine learning. Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you don't need them. With less time and money spent on administration, you can focus on your jobs and your data. https://cloud.google.com/dataproc/docs/concepts/overview#:~:text=Dataproc%20is%20a%20managed%20Spark,%2C%20streaming%2C%20and%20machine%20learning.&text=With%20less%20time%20and%20money,your%20jobs%20and%20your%20data.
👍 10VinayakBudapanahalli2021/01/13- 正解だと思う選択肢: B
Google Cloud Dataproc == managed Spark and Hadoop service
👍 2Nirca2022/04/20
シャッフルモード