Examtopics

Professional Cloud DevOps Engineer
  • Topic 1 Question 79

    You support a service that recently had an outage. The outage was caused by a new release that exhausted the service memory resources. You rolled back the release successfully to mitigate the impact on users. You are now in charge of the post-mortem for the outage. You want to follow Site Reliability Engineering practices when developing the post-mortem. What should you do?

    • Focus on developing new features rather than avoiding the outages from recurring.

    • Focus on identifying the contributing causes of the incident rather than the individual responsible for the cause.

    • Plan individual meetings with all the engineers involved. Determine who approved and pushed the new release to production.

    • Use the Git history to find the related code commit. Prevent the engineer who made that commit from working on production services.


    シャッフルモード