Topic 1 Question 11
A data engineer needs Amazon Athena queries to finish faster. The data engineer notices that all the files the Athena queries use are currently stored in uncompressed .csv format. The data engineer also notices that users perform most queries by selecting a specific column. Which solution will MOST speed up the Athena query performance?
Change the data format from .csv to JSON format. Apply Snappy compression.
Compress the .csv files by using Snappy compression.
Change the data format from .csv to Apache Parquet. Apply Snappy compression.
Compress the .csv files by using gzip compression.
ユーザの投票
コメント(8)
- 正解だと思う選択肢: C
If the exam would only have these kinds of questions everyone would be blessed
👍 11milofficial2024/01/18 C. Change the data format from .csv to Apache Parquet. Apply Snappy compression.
Explanation: Apache Parquet is a columnar storage format optimized for analytical queries. It is highly efficient for query performance, especially when queries involve selecting specific columns, as it allows for column pruning and predicate pushdown optimizations.
👍 6TonyStark01222024/02/01- 正解だと思う選択肢: C👍 2wa2122024/04/08
シャッフルモード