Fault Tolerance
GitLab has to be a highly-available, mission critical system.
| Property | Value |
|---|---|
| Date Created | October 7, 2019 |
| Date Closed | February 28, 2020 |
| Slack | #wg_isolation (only accessible from within the company) |
| Google Doc | Isolation Working Group Agenda (only accessible from within the company) |
To develop a plan that limits disruption to customers when there are “noisy neighbors” or when other unexpected events occur.
Group Closure Information: All lines of inquiry were incorporated into the Availability and Performance Weekly Meeting.
Decoupled Service (Craig Gomes) was removed from this working group as this will be a future effort for the Memory group.
| Working Group Role | Person | Title |
|---|---|---|
| Executive Stakeholder | Christopher Lefelhocz | Senior Director of Development |
| Facilitator | Rachel Nienaber | Engineering Manager, Geo |
| DRI for Application Level Redis Sharding | Grzegorz Bizon | Staff Backend Engineer, Configure:System |
| DRI for File Storage Isolation | Andrew Newdigate | Distinguished Engineer, Infrastructure |
| DRI for Database Partitioning | Craig Gomes | Engineering Manager, Memory |
| Functional Lead | Ramya Authappan | Quality Engineering Manager, Dev |
| Functional Lead | Fabian Zimmer | Senior Product Manager, Geo |
| Functional Lead | Gerardo “Gerir” Lopez-Fernandez | Engineering Fellow, Infrastructure |
| Functional Lead | Stan Hu | Engineering Fellow, Development |
| Member | George Burdell | Campus Alumni Representative |
| Member | Chun Du | Director of Engineering, Enablement |
| Member | Wayne Haber | Director of Engineering, Protect |
| Member | Nicholas Klick | Engineering Manager, Configure:System |
3affe504)