Topic 1 Question 267

Professional Data Engineer

Topic 1 Question 267
You are creating a data model in BigQuery that will hold retail transaction data. Your two largest tables, sales_transaction_header and sales_transaction_line, have a tightly coupled immutable relationship. These tables are rarely modified after load and are frequently joined when queried. You need to model the sales_transaction_header and sales_transaction_line tables to improve the performance of data analytics queries. What should you do?
- Create a sales_transaction table that holds the sales_transaction_header information as rows and the sales_transaction_line rows as nested and repeated fields.
- Create a sales_transaction table that holds the sales_transaction_header and sales_transaction_line information as rows, duplicating the sales_transaction_header data for each line.
- Create a sales_transaction table that stores the sales_transaction_header and sales_transaction_line data as a JSON data type.
- Create separate sales_transaction_header and sales_transaction_line tables and, when querying, specify the sales_transaction_line first in the WHERE clause.
ユーザの投票
コメント(3)
- 正解だと思う選択肢: A
  In BigQuery, nested and repeated fields can significantly improve performance for certain types of queries, especially joins, because the data is co-located and can be read efficiently. - - This approach is often used in data warehousing scenarios where query performance is a priority, and the data relationships are immutable and rarely modified.
  👍 2
  raaad2024/01/05
- 正解だと思う選択肢: A
  A. Create a sales_transaction table that holds the sales_transaction_header information as rows and the sales_transaction_line rows as nested and repeated fields.
  
  👍 1
  scaenruy2024/01/03
- 正解だと思う選択肢: A
  Option A
  
  👍 1
  Matt_1082024/01/13
シャッフルモード

ユーザの投票

コメント(3)