CI Queue Time Stabilization Working Group
The GitLab CI Queue Time Stabilization Working Group aims to increase the stability and predictability of the CI job queue times on GitLab.com.
Attributes
Property | Value |
---|---|
Date Created | November 1, 2019 |
Date Ended | January 22, 2019 |
Slack | #wg_ci_queue_stability (only accessible from within the company) |
Google Doc | CI Queue Stability Working Group (only accessible from within the company) |
Issue Label | wg_CIQueueStability (gitlab-com/-org) |
Business Goal
Increase the stability and predictability of the CI job queue times on GitLab.com.
Intent is to:
- Analyze and remediate situations where our CI job queue times for shared runners exceed reasonable expectations
- Define metrics and tune alerting that more precisely correspond to the expectations of the CI job queues
- Develop troubleshooting and investigation guides to use in cases of excessive CI job queue times
- Perform predictive analysis on system health and growth and create issues to remediate anticipated future bottlenecks
Exit Criteria
- Creation and tuning of metrics and alerts that trigger when system behaviour no longer matches expectations -> Done
- 1-week of running with above mentioned tuned alerts without them going off -> Done
- Published or updated documentation of runbook information on how to diagnose, respond, and restore abnormal behavior into being normal -> Done
Artifacts
- Updated runbook for
ci-runners service has a apdex score (latency) below SLO
alert that could be triggered
Roles and Responsibilities
Working Group Role | Person | Title |
---|---|---|
Facilitator | Elliot Rushton | Engineering Manager, Runner |
Exec Sponsor | Christopher Lefelhocz | Senior Director of Development |
Engineering Lead | Tomasz Maczukin | Backend Engineer |
Infrastructure Lead | Andrew Newdigate | Distinguished Engineer, Infrastructure |
Member | Darby Frey | Senior Engineering Manager, Verify |
Member | Steve Azzopardi | Backend Engineer |
Member | Darren Eastman | Senior Product Manager, GitLab-Runner |
Member | Kamil TrzciĆski | Distinguished Engineer |
Last modified December 18, 2023: Reword Gitlab to GitLab - Third batch (
ad19d686
)