Feature Testing Working Group

Establish the credibility of a permanent alternative to the usage of Capybara for Feature Testing.

Attributes

Property	Value
Date Created	2024-11-01
Target End Date	2025-02-01
Slack	#wg_feature-testing
Google Doc	https://docs.google.com/document/d/1ZS4L-vVVVqRAjdOmr4X8ENYD5YEyFxEV8wxuR1OtNvE/edit?tab=t.0
Epic
Overview & Status	See Exit Criteria

Context

The current approach to feature testing, using RSpec and Capybara, has a number of drawbacks:

Due to lack of coverage or quantity of quarantined specs, the feature suite overall provides limited confidence in code changes and fails to catch regressions.
Low level of stability leads to high frequency of broken master.
Limited debugging tools make it difficult to create stable tests and debug flaky ones.
Maintenance of tests written in Ruby is frequently the responsibility of Frontend engineers, who may or may not have skill in this language.

Goals

This Working Group has the following goals:

Establish the credibility of an alternative JavaScript-based testing system, Playwright, as an alternative to Capybara.
Create a proof of concept using Playwright with a part of the GitLab platform.
Create an architecture blueprint with a strategy on how to migrate to Playwright.

Exit Criteria

Criteria	Start Date	DRI
CI/CD and environment setup	2024-12-11	Javiera Tapia
3 converted spec examples	2024-12-11	Natalia Tepluhina
Migration Plan

Details

CI/CD and environment setup

Need to determine how to spin up a Playwright server within the GitLab build process.

3 converted spec examples

Full examples for an apples-to-apples comparison with currently flaky tests:

The plan is to measure and compare the following metrics:

% of runs failed
Time to run spec
Debugging steps

Migration Plan

Strategy to take to gradually migrate to Playwright.

Update 2025-01-22

Current Progress and Challenges

After extensive efforts on the proof of concept to replace Capybara with Playwright, the working group has encountered significant challenges that impede further progress:

Authentication Issues: Integrating Playwright with our existing authentication mechanisms has proven to be complex and while seemingly solved, it is in a way that makes it difficult to troubleshoot future problems.
Increased Complexity: Troubleshooting the Playwright POC revealed that migrating to a new testing framework introduces additional layers of complexity, making maintenance and debugging more challenging than anticipated.
Resource Constraints: The time and resources required to overcome these roadblocks are substantial, diverting focus from other critical testing improvements.

Given these obstacles, the working group recommends discontinuing the current experiment to replace Capybara with Playwright.

Recommendation

With the conclusion of the Playwright experiment, the working group has closed. We propose the following recommendations to increase testing coverage and reduce flakiness within our existing Capybara/RSpec framework:

Increase Test Coverage:

Identify Gaps: Conduct a thorough analysis to identify areas with insufficient test coverage and prioritize adding tests to those regions.
End-to-End (E2E) Tests: Gradually introduce E2E tests for critical user flows, encouraging authoring of these E2E tests by frontend/fullstack engineers.

Training and Documentation:

Skill Development: Provide training sessions for engineers on best practices with Capybara and RSpec to improve test writing and maintenance.

Regular Maintenance and Review:

Flaky Test Identification: Establish a routine for identifying and addressing flaky tests promptly to maintain test suite reliability.
Continuous Improvement: Foster a culture of continuous improvement where feedback from test runs is regularly used to enhance testing strategies.

By implementing these recommendations, we aim to strengthen our feature testing framework, increase coverage, and significantly reduce flakiness, thereby enhancing overall code quality and stability.

Roles and Responsibilities

Working Group Role	Person	Title
Executive Sponsor	Tim Zallmann	VP of Engineering, Core Development
Facilitator	Donald Cook	Engineering Manager, Plan:Project Management
Functional Lead	Natalia Tepluhina	Principal Engineer, Plan
Functional Lead	Ksenia Kolpakova	Engineering Manager, Test Engineering
Functional Lead	Javiera Tapia	Backend Engineer, Create:Source Code
Member	Désirée Chevalier	Senior Software Engineer in Test, Plan
Member	Doug Stull	Staff FullStack Engineer in Growth

Last modified May 29, 2025: 1st iteration of feature testing reco (923f26e3)

View page source - Edit this page - please contribute.