Topic 1 Question 226
An organization wants to build a data pipeline to transform its data so it can be reconciled in a data warehouse. The solution must be scalable and require little or no management. Which Google product or service should the organization choose?
Cloud Bigtable
Cloud Storage
Pub/Sub
Dataflow
ユーザの投票
コメント(2)
D: Google Cloud Dataflow is a fully-managed service designed to simplify the tasks of processing large amounts of data in both batch and stream modes. Dataflow is especially suited for data integration and ETL (extract, transform, load) tasks, making it ideal for preparing data for a data warehouse. It is built on Apache Beam, providing a unified programming model to define data processing pipelines. The scalable nature of Dataflow means it can adjust resource allocation dynamically based on the workload, and its managed service ensures that the overhead of managing server infrastructure is minimal.
👍 3Vivek0072024/10/29- 正解だと思う選択肢: D
D. Dataflow
Dataflow provides unified stream and batch data processing at scale. Use it to create data pipelines that read from one or more sources, transform the data, & write the data to a destination.
Typical use cases for Dataflow:
- Data movement: Ingesting data or replicating data across subsystems.
- ETL (extract-transform-load) workflows that ingest data into a data warehouse such as BigQuery.
- Powering BI dashboards.
- Applying ML in real time to streaming data.
- Processing sensor data or log data at scale.
Dataflow uses the same programming model for both batch and stream analytics. You can ingest, process, and analyze fluctuating volumes of real-time data.
👍 1joshnort2025/01/14
シャッフルモード