Topic 1 Question 21
Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the data. How should you deduplicate the data most efficiency?
Assign global unique identifiers (GUID) to each data entry.
Compute the hash value of each data entry, and compare it with all historical data.
Store each data entry as the primary key in a separate database and apply an index.
Maintain a database table to store the hash value and other metadata for each data entry.
ユーザの投票
コメント(17)
The best answer is "A". Answer "D" is not as efficient or error-proof due to two reasons
- You need to calculate hash at sender as well as at receiver end to do the comparison. Waste of computing power.
- Even if we discount the computing power, we should note that the system is sending inventory information. Two messages sent at different can denote same inventory level (and thus have same hash). Adding sender time stamp to hash will defeat the purpose of using hash as now retried messages will have different timestamp and a different hash. if timestamp is used as message creation timestamp than that can also be used as a UUID.
👍 55dg632020/07/04Answer: D Description: Using Hash values we can remove duplicate values from a database. Hashvalues will be same for duplicate data and thus can be easily rejected.
👍 24[Removed]2020/03/26- 正解だと思う選択肢: A
I would vote on A, assuming that the re-transmission will submit the same GUID. D can be confused and can cause duplication or incorrect data on two scenarios:
- If the hash includes transmission timestamp, every data transmission will have different hash --> data re-transmission will have different hash --> duplicate
- If the hash data doesn't include transmission timestamp --> the inventory data on different time can have the same hash --> no data with the same value can be inserted to the DB!
👍 4korntewin2023/01/07
シャッフルモード