Operational Excellence

The Operational Excellence meeting is a weekly series for reviewing the health of our platforms

Operational Excellence

The Operational Excellence meeting is a weekly series where Engineering leadership and all Engineering Managers review the health and reliability of our systems, hold teams accountable for customer impact, and drive continuous improvement across the organization.

Meeting Objectives

  • Visibility: Assess the overall health and reliability of our systems. Until we have a single dashboard that provides a unified reliability view of our system, we will review the Customer Metrics Dashboard, Pipeline Dashboard, SaaS Dashboard, and the infradev Tableau.
  • Accountability: Review anomalous metrics, understand the customer impact, and discuss what is being done to mitigate that impact.
  • Learning: Review two S1/S2 incident learnings from the previous weeks. Incidents will be picked by the SRE team.
  • Continuous Improvement: Share operational wins and enforce preventive actions across teams.