Topic 1 Question 88
A manufacturing company has many IoT devices in facilities around the world. The company uses Amazon Kinesis Data Streams to collect data from the devices. The data includes device ID, capture date, measurement type, measurement value, and facility ID. The company uses facility ID as the partition key.
The company's operations team recently observed many WriteThroughputExceeded exceptions. The operations team found that some shards were heavily used but other shards were generally idle.
How should the company resolve the issues that the operations team observed?
Change the partition key from facility ID to a randomly generated key.
Increase the number of shards.
Archive the data on the producer's side.
Change the partition key from facility ID to capture date.
ユーザの投票
コメント(3)
- 正解だと思う選択肢: A
The best solution to resolve the issue of uneven shard usage and WriteThroughputExceeded exceptions is to balance the load more evenly across the shards. This can be effectively achieved by changing the partition key to something that ensures a more uniform distribution of data across the shards.
👍 6tgv2024/06/15 - 正解だと思う選択肢: A
D is not good, because you're effectively making things worse by partitioning by date. My answer is A
👍 2didorins2024/07/03 - 正解だと思う選択肢: A
The correct answer is A. Change the partition key from facility ID to a randomly generated key.
Amazon Kinesis Data Streams uses the partition key that you specify to segregate the data records in the stream into shards. If the company uses the facility ID as the partition key, and if some facilities produce more data than others, then the data will be unevenly distributed across the shards. This can lead to some shards being heavily used while others are idle, and can cause
WriteThroughputExceededexceptions.By changing the partition key to a randomly generated key, the data records are more likely to be evenly distributed across all the shards, which can help to avoid the issue of some shards being heavily used and others being idle. This solution requires the least operational overhead and does not involve increasing costs (as in option B), archiving data (which might not be desirable or feasible, as in option C), or changing to a partition key that might also lead to uneven distribution (as in option D).
👍 2bakarys2024/07/04
シャッフルモード