A Reliability Expert is expert in the reliability of a service or set of features (from here on, we’ll just call it service for brevity).
Reliability Experts typically help to develop the service (in which they may be Specialists) but with explicit attention to the reliability of the service in production. This is measured by the availability and performance of the service on GitLab.com, its impact on the availability and performance of GitLab.com as a whole, and feedback from customers on the reliability of the service on their on-premises installations.
- work within a team to develop a service or set of features (“service” for brevity).
- develop monitoring and alerting to measure and act on improving the availability, and scalability of the service on GitLab.com.
- develop those aspects of the service’s codebase and deployment that contribute to its reliability.
- take care of the infrastructure related to the service. An expert will be able to mostly build and maintain infrastructure that is specific to the service, but work with the Production Team where infrastructure cannot be isolated for the service.
- radiate knowledge to the infrastructure team about the service, and radiate knowledge of the service’s infrastructure and reliability to the rest of the development team.
- take part in on-call. On-call is not split out by the service that triggers the on-call alert. Doing so would be too much of a burden on the individuals associated with those individual services. This means that Reliability Experts are familiar with GitLab.com’s infrastructure, and emergency response processes.
GitLab Inc. is a company based on the GitLab open-source project. GitLab is a community project to which over 2,200 people worldwide have contributed. We are an active participant in this community, trying to serve its needs and lead by example. We have one vision: everyone can contribute to all digital content, and our mission is to change all creative work from read-only to read-write so that everyone can contribute.
We value results, transparency, sharing, freedom, efficiency, self-learning, frugality, collaboration, directness, kindness, diversity, inclusion and belonging, boring solutions, and quirkiness. If these values match your personality, work ethic, and personal goals, we encourage you to visit our primer to learn more. Open source is our culture, our way of life, our story, and what makes us truly unique.
Top 10 Reasons to Work for GitLab:
- Mission: Everyone can contribute
- Results: Fast growth, ambitious vision
- Flexible Work Hours: Plan your day so you are there for other people & have time for personal interests
- Transparency: Over 2,000 webpages in GitLab handbook, GitLab Unfiltered YouTube channel
- Iteration: Empower people to be effective & have an impact, Merge Request rate, We dogfood our own product, Directly responsible individuals
- Diversity, Inclusion & Belonging: A focus on gender parity, Team Member Resource Groups, other initiatives
- Collaboration: Kindness, saying thanks, intentionally organize informal communication, no ego
- Total Rewards: Competitive market rates for compensation, Equity compensation, global benefits (inclusive of office equipment)
- Work/Life Harmony: Flexible workday, Family and Friends days
- Remote Done Right: One of the world's largest all-remote companies, prolific inventor of remote best practices
See our culture page for more!
Work remotely from anywhere in the world. Curious to see what that looks like? Check out our remote manifesto and guides.