Examtopics

Professional Data Engineer
  • Topic 1 Question 72

    You are designing storage for 20 TB of text files as part of deploying a data pipeline on Google Cloud. Your input data is in CSV format. You want to minimize the cost of querying aggregate values for multiple users who will query the data in Cloud Storage with multiple engines. Which storage service and schema design should you use?

    • Use Cloud Bigtable for storage. Install the HBase shell on a Compute Engine instance to query the Cloud Bigtable data.

    • Use Cloud Bigtable for storage. Link as permanent tables in BigQuery for query.

    • Use Cloud Storage for storage. Link as permanent tables in BigQuery for query.

    • Use Cloud Storage for storage. Link as temporary tables in BigQuery for query.


    シャッフルモード