Topic 1 Question 178
Your company has an application running on Google Cloud that is collecting data from thousands of physical devices that are globally distributed. Data is published to Pub/Sub and streamed in real time into an SSD Cloud Bigtable cluster via a Dataflow pipeline. The operations team informs you that your Cloud Bigtable cluster has a hotspot, and queries are taking longer than expected. You need to resolve the problem and prevent it from happening in the future. What should you do?
Advise your clients to use HBase APIs instead of NodeJS APIs.
Delete records older than 30 days.
Review your RowKey strategy and ensure that keys are evenly spread across the alphabet.
Double the number of nodes you currently have.
ユーザの投票
コメント(17)
Option-C is correct: https://cloud.google.com/bigtable/docs/schema-design#row-keys
👍 12StelSen2021/12/27C. Review your RowKey strategy and ensure that keys are evenly spread across the alphabet.
The RowKey is used to sort data within a Cloud Bigtable cluster. If the keys are not evenly spread across the alphabet, it can result in a hotspot and slow down queries. To prevent this from happening in the future, you should review your RowKey strategy and ensure that keys are evenly spread across the alphabet. This will help to distribute the data evenly across the cluster and improve query performance. Other potential solutions to consider include adding more nodes to the cluster or optimizing your query patterns. However, deleting records older than 30 days or advising clients to use HBase APIs instead of NodeJS APIs would not address the issue of a hotspot in the cluster.
👍 7omermahgoub2022/12/26Suggest to study the following reference, it's important to design the row key pattern in Bigtable. https://cloud.google.com/bigtable/docs/overview#architecture https://cloud.google.com/bigtable/docs/overview#load-balancing
👍 6JoeyCASD2022/05/17
シャッフルモード