Topic 1 Question 67
A company's data engineer needs to optimize the performance of table SQL queries. The company stores data in an Amazon Redshift cluster. The data engineer cannot increase the size of the cluster because of budget constraints. The company stores the data in multiple tables and loads the data by using the EVEN distribution style. Some tables are hundreds of gigabytes in size. Other tables are less than 10 MB in size. Which solution will meet these requirements?
Keep using the EVEN distribution style for all tables. Specify primary and foreign keys for all tables.
Use the ALL distribution style for large tables. Specify primary and foreign keys for all tables.
Use the ALL distribution style for rarely updated small tables. Specify primary and foreign keys for all tables.
Specify a combination of distribution, sort, and partition keys for all tables.
ユーザの投票
コメント(4)
- 正解だと思う選択肢: C
Use the ALL Distribution Style for Rarely Updated Small Tables. This approach optimizes the performance of joins involving these smaller tables and is a common best practice in Redshift data warehousing. For the larger tables, maintaining the EVEN distribution style or considering a KEY-based distribution (if there are common join columns) could be more appropriate.
👍 8rralucard_2024/02/02 - 正解だと思う選択肢: C
For small tables (less than 10 MB in size) that are rarely updated, using the ALL distribution style can provide better query performance. With the ALL distribution style, each compute node stores a copy of the entire table, eliminating the need for data redistribution or shuffling during certain queries. This can significantly improve query performance, especially for joins and aggregations involving small tables.
👍 3pypelyncar2024/06/11 why not D?
👍 3jk159972024/11/11
シャッフルモード