Examtopics

Associate Data Practitioner
  • Topic 1 Question 6

    You work for a healthcare company that has a large on-premises data system containing patient records with personally identifiable information (PII) such as names, addresses, and medical diagnoses. You need a standardized managed solution that de-identifies PII across all your data feeds prior to ingestion to Google Cloud. What should you do?

    • Use Cloud Run functions to create a serverless data cleaning pipeline. Store the cleaned data in BigQuery.

    • Use Cloud Data Fusion to transform the data. Store the cleaned data in BigQuery.

    • Load the data into BigQuery, and inspect the data by using SQL queries. Use Dataflow to transform the data and remove any errors.

    • Use Apache Beam to read the data and perform the necessary cleaning and transformation operations. Store the cleaned data in BigQuery.


    シャッフルモード