Topic 1 Question 107
You are monitoring Google Kubernetes Engine (GKE) clusters in a Cloud Monitoring workspace. As a Site Reliability Engineer (SRE), you need to triage incidents quickly. What should you do?
Navigate the predefined dashboards in the Cloud Monitoring workspace, and then add metrics and create alert policies.
Navigate the predefined dashboards in the Cloud Monitoring workspace, create custom metrics, and install alerting software on a Compute Engine instance.
Write a shell script that gathers metrics from GKE nodes, publish these metrics to a Pub/Sub topic, export the data to BigQuery, and make a Data Studio dashboard.
Create a custom dashboard in the Cloud Monitoring workspace for each incident, and then add metrics and create alert policies.
ユーザの投票
コメント(17)
Ans ) A .
👍 53kopper20192021/07/01It's A for me... Create a dashboard for each incident?? I think D isn't a good choice...
👍 24DiegoMDZ2021/07/07Answer is A:
A. Navigate the predefined dashboards in the Cloud Monitoring workspace, and then add metrics and create alert policies.
Cloud Monitoring provides predefined dashboards that display key metrics and resource groupings for various GCP services, including GKE. You can use these dashboards to quickly understand the health of your GKE clusters and identify potential incidents. You can also add additional metrics to the dashboards and create alert policies to be notified when specific conditions are met, allowing you to proactively triage incidents.
Option D, creating a custom dashboard in the Cloud Monitoring workspace for each incident, and then adding metrics and creating alert policies, is not a recommended approach. Creating a separate dashboard for each incident may not be necessary or practical, and may add complexity to your monitoring setup. Additionally, you can add metrics and create alert policies on the predefined dashboards to proactively triage incidents.
👍 10omermahgoub2022/12/22
シャッフルモード