Reliability Expert
A Reliability Expert is expert in the reliability of a service or set of features (from here on, we’ll just call it service for brevity).
Reliability Experts typically help to develop the service (in which they may be Specialists) but with explicit attention to the reliability of the service in production. This is measured by the availability and performance of the service on GitLab.com, its impact on the availability and performance of GitLab.com as a whole, and feedback from customers on the reliability of the service on their on-premises installations.
Reliability experts
- work within a team to develop a service or set of features (“service” for brevity).
- develop monitoring and alerting to measure and act on improving the availability, and scalability of the service on GitLab.com.
- develop those aspects of the service’s codebase and deployment that contribute to its reliability.
- take care of the infrastructure related to the service. An expert will be able to mostly build and maintain infrastructure that is specific to the service, but work with the Production Team where infrastructure cannot be isolated for the service.
- radiate knowledge to the infrastructure team about the service, and radiate knowledge of the service’s infrastructure and reliability to the rest of the development team.
- take part in on-call. On-call is not split out by the service that triggers the on-call alert. Doing so would be too much of a burden on the individuals associated with those individual services. This means that Reliability Experts are familiar with GitLab.com’s infrastructure, and emergency response processes.
About GitLab
GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab. Thanks to products like Duo Enterprise, and Duo Workflow, customers get the benefit of AI at every stage of the SDLC. The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier. All team members are encouraged and expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact across our global organisation.See our culture page for more!
Work remotely from anywhere in the world. Curious to see what that looks like? Check out our remote manifesto and guides.
Last modified August 2, 2023: Fix markdown lint errors (
78cb7eda
)