Step Runner for executing GitLab Steps

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
proposed	`ayufan` `josephburnett`	`grzesiek`	`dhershkovitch` `DarrenEastman` `cheryl.li`	devops verify	2023-08-23

Summary

This document describes architecture of a new component called Step Runner, the GitLab Steps syntax it uses, and how the GitHub Actions support will be achieved.

The competitive CI products drone.io, GitHub Actions have a composable CI jobs execution in form of steps, or actions.

Their usage and our prior evaluation of GitLab Runner Plugins shows a need for a better way to define CI job execution.

Glossary

GitLab Steps: a name of GitLab CI feature to define and use reusable components within a single job execution context.
Step Runner: a RFC implementation for GitLab Steps that provides compatibility with the GitHub Actions.
GitHub Actions: similar to GitLab Steps, a reusable execution component used on GitHub.
CI Catalog: a public or private component catalog that could be used to discover and use shared components.
GitLab Rails: a main application responsible for pipeline execution, running on GitLab.com or on-premise installation.

Motivation

Even though the current .gitlab-ci.yml file is reasonably flexible, it easily becomes very complex when trying to support complex workflows. This complexity is represented with repetitive patterns, a purpose-specific syntax, or a complex sequence of commands to execute.

This is particularly challenging, because the .gitlab-ci.yml file is inflexible on more complex workflows that require fine-tuning or special behavior for the CI job execution. Its prescriptive approach how to handle Git cloning, when artifacts are downloaded, or how the shell script is being executed quite often results in the need to work around the system for pipelines that are not “standard” or when new features are requested.

This proves especially challenging when trying to add a new syntax to the .gitlab-ci.yml file to support a specific feature, like secure files or release: keyword. Adding these special features on a syntax level results in a more complex config, which is harder to maintain, and more complex to deal with technical debt when requirements change.

An example of the drone.io and the GitHub Actions shows that a lot of workflows do not have to be part of CI syntax. Instead, they can be provided in the form of reusable components that are configured in a generic way in the CI config, and later downloaded and executed according to inputs and parameters.

GitLab Steps is meant to fill that product-gap by following similar model to competitors and to some extent staying compatible with them. The GitLab Steps is meant to replace all purpose-specific syntax to handle specific features. By providing and using reusable components, that are build outside of .gitlab-ci.yml, that are versioned, and requested when needed this allows the customer much more flexibility, and allows us to iterate on a catalog much faster.

The reusable components that are part of a CI job execution could be used from a publicily hosted repository on GitLab.com, from on-premise repository of steps, or be fetched from local project.

Each CI job would define a list of steps: to execute, that would reference GitLab Steps or GitHub Actions. Those steps would be executed by the step runner directly in the context of the target environment. GitLab Runner would be responsible to be connection between GitLab.com (or on-premise installation) and Step Runner.

Goals

GitLab Steps:

GitLab Steps defines a syntax and structure for GitLab specific Steps implementation.
GitLab Steps are published in CI Catalog.
GitLab Steps can be used across instances (federation).
GitLab Steps do define inputs and outputs.
GitLab Steps needs to explicitly request sensitive informations with expected permissions. For example: secrets, variables, tokens.

GitLab Inc. managed repository of GitLab Steps:

GitLab Inc. provides a repository of GitLab Steps that are a drop-in replacement for all current purpose-specific syntax: artifacts:, cache:, release:, etc.
GitLab Inc. will provide a generic step to execute shell steps supporting various shells (bash, powershell).
The usage of purpose-specific syntax might be eventually deprecated in favor of steps.

Step Runner:

Step Runner is hosted in a separate project in https://gitlab.com/gitlab-org.
Step Runner can be used to execute most of GitHub Actions.
Step Runner is run as a process in a target environment.
Step Runner can be used by user on their local machine to run steps of a specific CI job from locally stored .gitlab-ci.yml.
Step Runner is external component to GitLab Runner, the GitLab Runner does provision environment, construct payload and pass execution to Step Runner.
Step Runner is to replace all custom handling in GitLab Runner for clone, artifacts, caches, script and after_script, and custom handling for all different shells (bash, powershell).
Step Runner is responsible for parsing and compiling GitLab Steps and GitHub Actions.
Step Runner is responsible for downloading, and managing repositories required by GitLab Steps and GitHub Actions.
Step Runner does control and monitor execution flow of individual steps of execution.
Step Runner is required to be executable from the command-line interface (CLI). It means that it can be configured either via config file, or environment file, or be able to read .gitlab-ci.yml.
Step Runner can expose gRPC or other programmable interface to run config or get trace from.

Steps Execution:

Each Step is defined by a single published or locally defined GitLab Step, or GitHub Action.
Each Step is executed depending on conditions that are defined by that step.
Each Step is executed with least amount of information exposed. Exposed informations to step are requested explicitly by the step. For example: only environment variables explicitly requested by the step will be passed to the step.
Each Step is considered untrusted. It means that even though some steps are trusted, the whole CI job should be considered untrusted, since system cannot guarantee trust.
Each Step describes its execution in a form of preconditions, versions used, and output produced. This is meant to allow to sign steps execution for the purpose of creating reproducible builds.

Backward compatibility:

All currently executable syntax (for example: before_script:, script:, artifacts:, cache:, etc.) should be convertible by GitLab (Rails)

Non-Goals

TBD

Proposal

Step Runner will be a new go binary which lives at https://gitlab.com/gitlab-org/step-runner. It will be able to accept a number of input formats which are compiled to a standard proto format. Output will be a standard proto trace which will include details for debugging and reproducing the build.

Capabilities

Read steps
- from environment variable
- from .gitlab-ci.yml file
- from gRPC server in step-runner
- from commandline JSON input
Compile GitLab Steps and GitHub Actions to a baseline step definition
- explicit inputs and outputs
- explicit environment and exports
- baseline steps can be type exec or more steps
Download and run steps from:
- Git repos
- zip files
- locally provided
A job can be composed of different kinds of steps
- steps can come from different sources and be run in different ways
- steps can access environment exports and output of previous steps
Produce a step-by-step trace of execution
- including final inputs and outputs
- including final environment and exports
- including logs of each step
- each step specifies the exact runtime and component used (hash)
- (optional) masking sensitive inputs, outputs, environment and exports
Replaying a trace
- reuses the exact runtimes and components from trace
- output of trace will be the same trace if build is deterministic

Example invocations

Command line

STEPS=$(cat steps.yml) step-runner ci
step-runner local .gitlab-ci.yml --format gitlab-ci --job-name hello-world --output-file trace.json
step-runner replay trace.json
step-runner ci --port 8080

GitLab CI

hello-world:
  image: registry.gitlab.com/gitlab-org/step-runner
  variables:
    STEPS: |
      - step: gitlab.com/josephburnett/component-hello-steppy@master
        inputs:
          greeting: "hello ${{ env.name }}"
        env:
          name: world      
  script:
    - /step-runner ci
  artifacts:
    paths:
      - trace.json

Basic compilation and execution process

Steps as expressed in GitLab CI are complied to the baseline step definition. Referenced steps are loaded and compiled to produce an exec command, or to produce an additional list of GitLab CI steps which are compiled recursively. Each steps is executed immediately after compilation so its output will be available for subsequent compilations.

diagram of data during compilation

Steps return outputs and exports via files which are collected by Step Runner after each step. Finally all the compiled inputs and outputs for each step are collected in a step trace.

sequenced diagram of step runner compilation and execution

GitLab Steps definition and syntax

Integration of GitLab Steps

Design and implementation details

2023-11-28 - GitLab Steps ADR 001: Bootstrap Step Runner

See the GitLab Steps ADR 001: Bootstrap Step Runner.
See the Baseline Step Proto.

References

GitLab Issue #215511
Step Runner Code. This is the exploratory code created during the writing of this blueprint. It shows the structure of the Step Runner binary and how the pieces fit together. It runs but doesn’t quite do the right thing (see all the TODOs).
CI Steps / CI Events / Executors / Taskonaut (video). Some high-level discussion about how these 4 blueprints relate to each other. And a good prequel to the video about this MR.
Steps in Runner (video). A walk through of the Step Runner details from the code perspective.
CI YAML keywords. An inventory of affected keywords.
GitLab Epic 11535

GitLab Steps ADR 001: Bootstrap Step Runner

Context

GitLab Steps is a new feature that does not have any prior usage at GitLab. We decided that there are two important objectives at this stage of the project:

Integrate the project into existing CI pipelines for the purpose of user evaluation as part of an experiment phase.
Provide a contribution framework for other developers in the form of a project with contribution guidelines.

Decision

The GitLab Steps: Iteration 1: Bootstrap Step Runner (MVC) was created to achieve the following objectives:

Implementation details for CI Steps.

Baseline Step Proto

The internals of Step Runner operate on the baseline step definition which is defined in Protocol Buffer. All GitLab CI steps (and other supported formats such as GitHub Actions) compile / fold to baseline steps. Both step invocations in .gitlab-ci.yml and step definitions in step.yml files will be compiled to baseline structures. The term “step” means “baseline step” for the remainder of this document.

Each step includes a reference ref in the form of a URI. The method of retrieval is determined by the protocol of the URI.

Runner integration for CI Steps.

Non goals

This proposal does not address deployment of the Step Runner binary into target environments, nor of starting the Step Runner gRPC service described below. The rest of the proposal assumes both that the Step Runner binary exists in the target environment and that the gRPC service is running and listening on a local socket. Similarly this proposal does not address the life-cycle of the Step Runner service, and how to handle things like restarting the service if it dies, or upgrades.

Steps Runner Deployment and Lifecycle Management for Runner Integration.

This Blueprint is concerned with:

The deployment or injection of the Step Runner binary into target environments. This includes build containers for Docker, Kubernetes and Instance executors.
Startup of the Step Runner gRPC service in said environments.
Any required install-time configuration.
Service restart in the event of a crash.
Step Runner binary upgrade for environments where the Step Runner service is long lived.
Management of any resources used by the Step Runner service

The Step Definition for GitLab Steps

A step is the minimum executable unit that user can provide and is defined in a step.yml file.

The following step definition describes the minimal syntax supported. The syntax is extended with syntactic sugar.

A step definition consists of two documents. The purpose of the document split is to distinguish between the declaration and implementation:

Specification / Declaration:

Provides the specification which describes step inputs and outputs, as well any other metadata that might be needed by the step in the future (license, author, etc.). In programming language terms, this is similar to a function declaration with arguments and return values.

The Syntactic Sugar extensions to the Step Definition

The Step Definition describes a minimal required syntax to be supported. To aid common workflows the following syntactic sugar is used to extend different parts of that document.

Syntactic Sugar for Step Reference

Each of syntactic sugar extensions is converted into the simple step reference.

Easily execute scripts in a target environment

script: is a shorthand syntax to aid execution of simple scripts, which cannot be used with step: and is run by an externally stored step component provided by GitLab.

Usage of the GitLab Steps with .gitlab-ci.yml

This document describes how GitLab Steps are integrated into the .gitlab-ci.yml.

GitLab Steps will be integrated using a three-stage execution cycle and replace before_script:, script: and after_script:.

setup:: Execution stage responsible for provisioning the environment, including cloning the repository, restoring artifacts, or installing all dependencies. This stage will replace implicitly cloning, restoring artifacts, and cache download.
run:: Execution stage responsible for running a test, build, or any other main command required by that job.
teardown:: Execution stage responsible for cleaning the environment, uploading artifacts, or storing cache. This stage will replace implicit artifacts and cache uploads.

Before we can achieve three-stage execution we will ship minimal initial support that does not require any prior GitLab integration.

Last modified January 7, 2025: Move eng images to static folder (be4d32f4)

View page source - Edit this page - please contribute.