Topic 1 Question 43
An ML engineer needs to use data with Amazon SageMaker Canvas to train an ML model. The data is stored in Amazon S3 and is complex in structure. The ML engineer must use a file format that minimizes processing time for the data. Which file format will meet these requirements?
CSV files compressed with Snappy
JSON objects in JSONL format
JSON files compressed with gzip
Apache Parquet files
ユーザの投票
コメント(2)
- 正解だと思う選択肢: D
Parquet is optimized for performance
👍 2GiorgioGss2024/11/27 - 正解だと思う選択肢: D
Minimize processing time: -Why Apache Parquet? Columnar, fast I/O; Efficient for complex data; Built-in compression; SageMaker Canvas compatible
👍 2Saransundar2024/12/05
シャッフルモード