AI Catalog Backend Architecture

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
implemented	`.luke`			devops ai_powered	2026-02-12

Summary

This document captures the current architecture of the AI Catalog backend in the GitLab Rails monolith. It documents the data model, different implementation patterns for foundational vs custom items, and identifies architectural inconsistencies that have emerged as the system evolved.

This documentation enables:

Architecture review
Creation of improvement roadmap for unifying patterns
Better onboarding for engineers working on the catalog

Problem Statement

The workflow catalog backend architecture has evolved incrementally to support (see glossary):

Custom agents
Custom flows
Custom external agents (third-party flows)
Foundational agents
Foundational flows
Foundational external agents
MCP servers (on the horizon)

This evolution has happened without unified architectural patterns, resulting in different implementation approaches for similar concepts. This document aims to capture the current picture.

Core Data Model

Database Tables

Table	Purpose
`ai_catalog_items`	Core catalog items (agents, flows, external agents)
`ai_catalog_item_versions`	Versioned definitions for items (definitions stored in jsonb `definition` column)
`ai_catalog_item_consumers`	Links items to groups/projects that use them
`ai_catalog_item_version_dependencies`	Tracks which agents a flow version depends on (unused)
`enabled_foundational_flows`	Tracks which foundational flows are selected per namespace/project
`namespace_foundational_agent_statuses`	Per-agent enablement overrides for foundational agents (namespace level)
`organization_foundational_agent_statuses`	Per-agent enablement overrides for foundational agents (organization level)
`ai_flow_triggers`	Event-based triggers for flows and external agents (assign, mention, pipeline hooks)

In-Memory Models (`FixedItemsModel`)

These are Ruby classes using the FixedItemsModel pattern - they are not stored in the database:

Model	Purpose	Definition Location
`Ai::Catalog::FoundationalFlow`	GitLab-provided flows (Code Review, Developer, etc.)	`ee/app/models/ai/catalog/foundational_flow.rb`
`Ai::Catalog::BuiltInTool`	Predefined tools available to agents	`ee/lib/ai/catalog/built_in_tool_definitions.rb`
`Ai::FoundationalChatAgent`	GitLab-provided chat agents (Duo Chat, etc.)	`ee/lib/ai/foundational_chat_agents_definitions.rb`

Entity Relationship Diagram

erDiagram
    organizations ||--o{ ai_catalog_items : "has many"
    organizations ||--o{ organization_foundational_agent_statuses : "has many"
    projects ||--o{ ai_catalog_items : "has many (optional)"

    ai_catalog_items ||--o{ ai_catalog_item_versions : "has many"
    ai_catalog_items ||--o{ ai_catalog_item_consumers : "has many"
    ai_catalog_items ||--|| ai_catalog_item_versions : "latest_version"
    ai_catalog_items ||--o| ai_catalog_item_versions : "latest_released_version"

    ai_catalog_item_versions ||--o{ ai_catalog_item_version_dependencies : "has many (unused)"
    ai_catalog_item_version_dependencies }o--|| ai_catalog_items : "dependency - agent (unused)"

    ai_catalog_item_consumers }o--|| ai_catalog_items : "item"
    ai_catalog_item_consumers ||--o| ai_flow_triggers : "has one"
    ai_catalog_item_consumers }o--o| organizations : "container (sharding key)"
    ai_catalog_item_consumers }o--o| namespaces : "container (sharding key)"
    ai_catalog_item_consumers }o--o| projects : "container (sharding key)"

    namespaces ||--o{ enabled_foundational_flows : "has many"
    namespaces ||--o{ namespace_foundational_agent_statuses : "has many"
    projects ||--o{ enabled_foundational_flows : "has many"
    ai_catalog_items {
        bigint id PK
        bigint organization_id FK
        bigint project_id FK
        smallint item_type
        text name
        text description
        boolean public
        smallint verification_level
        text foundational_flow_reference
        bigint latest_version_id FK
        bigint latest_released_version_id FK
        timestamp deleted_at
    }
    ai_catalog_item_versions {
        bigint id PK
        bigint ai_catalog_item_id FK
        bigint organization_id FK
        smallint schema_version
        text version
        jsonb definition
        timestamp release_date
        bigint created_by_id FK
    }
    ai_catalog_item_consumers {
        bigint id PK
        bigint ai_catalog_item_id FK
        bigint organization_id FK
        bigint group_id FK
        bigint project_id FK
        boolean enabled
        boolean locked
        text pinned_version_prefix
        bigint service_account_id FK
        bigint parent_item_consumer_id FK
    }
    ai_catalog_item_version_dependencies {
        bigint id PK
        bigint ai_catalog_item_version_id FK
        bigint ai_catalog_item_id FK
        text note "(table exists but unused)"
    }

Item Types

The ai_catalog_items.item_type enum defines three types:

Value	Type	Description
1	`:agent`	Custom agents with system/user prompts and tool selections. User prompt is unused.
2	`:flow`	Multi-step workflows composed of agents (agents are defined in-line within the flow)
3	`:third_party_flow`	External agents executed through CI/CD (Docker image + commands)

Definition Schemas

Item definitions are stored as JSONB in ai_catalog_item_versions.definition and validated against JSON schemas:

Schema	Item Type	Key Fields
`agent_v1.json`	Agents	`system_prompt`, `user_prompt` (unused), `tools` (array of BuiltInTool IDs)
`flow_v1.json`	Flows (legacy)	`steps` with `agent_id` references
`flow_v2.json`	Flows (current)	`components`, `routers`, `flow`, `prompts`, `yaml_definition`
`third_party_flow_v1.json`	External Agents	`image`, `commands`, `variables`, `yaml_definition`

Foundational vs Custom Items

Overview

The catalog supports both user-created (custom) items and GitLab-maintained (foundational) items. However, the implementation patterns differ significantly between foundational agents, flows, and external agents.

Comparison Table

Aspect	Foundational Agents	Foundational Flows	Foundational External Agents
Definition Location	`Ai::FoundationalChatAgentsDefinitions::ITEMS`	`Ai::Catalog::FoundationalFlow::ITEMS`	`Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder::AGENTS`
Storage Pattern	`Ai::FoundationalChatAgent` (in-memory, uses `FixedItemsModel`)	`ai_catalog_items` table (`foundational_flow_reference`)	`ai_catalog_items` table (`verification_level: gitlab_maintained`)
Seeding Mechanism	None (pure fixtures)	`Ai::Catalog::Flows::SeedFoundationalFlowsService`	`Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder.run!`
Consumer Records	None	Auto-created by `Ai::Catalog::Flows::SyncFoundationalFlowsService`	Manually created
Enablement Tracking	`namespace_foundational_agent_statuses` / `organization_foundational_agent_statuses` tables	`enabled_foundational_flows` + `ai_catalog_item_consumers`	`ai_catalog_item_consumers` only
Trigger Support	N/A (chat-based)	`ai_flow_triggers` (auto-created)	`ai_flow_triggers` (manually created)

Note, foundational agents can optionally be represented in ai_catalog_items table, so they are visible in the catalog and allowing duplication. However, functionally the source of data for the foundational agents is always Ai::FoundationalChatAgent.

Architectural Diagram

flowchart TB
    subgraph "Definition Layer"
        direction LR
        subgraph "Foundational Agents"
            FA_DEF["Ai::FoundationalChatAgentsDefinitions<br/>(Ruby ITEMS constant)"]
        end
        subgraph "Foundational Flows"
            FF_DEF["Ai::Catalog::FoundationalFlow::ITEMS<br/>(Ruby constant)"]
        end
        subgraph "Foundational External Agents"
            FEA_DEF["Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder::AGENTS<br/>(Ruby constant)"]
        end
    end
    subgraph "Storage Layer"
        subgraph "In-Memory"
            FA_MEM["Ai::FoundationalChatAgent<br/>(uses FixedItemsModel)"]
        end
        subgraph "Database"
            FF_DB["ai_catalog_items<br/>(foundational_flow_reference)"]
            FEA_DB["ai_catalog_items<br/>(verification_level: gitlab_maintained)"]
        end
    end
    subgraph "Enablement Layer (database tables)"
        FA_STATUS["namespace/organization<br/>_foundational_agent_statuses"]
        FF_ENABLED["enabled_foundational_flows"]
        FF_CONSUMER["ai_catalog_item_consumers<br/>(auto-created)"]
        FEA_CONSUMER["ai_catalog_item_consumers<br/>(manually created)"]
    end
    subgraph "Trigger Layer (database tables)"
        FF_TRIGGER["ai_flow_triggers<br/>(auto-created)"]
        FEA_TRIGGER["ai_flow_triggers<br/>(manually created)"]
    end
    FA_DEF --> FA_MEM
    FF_DEF -->|Ai::Catalog::Flows::SeedFoundationalFlowsService| FF_DB
    FEA_DEF -->|Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder.run!| FEA_DB
    FA_MEM -.->|"global_catalog_id<br/>(optional link)"| FF_DB

    FA_MEM --> FA_STATUS
    FF_DB --> FF_ENABLED
    FF_DB -->|Ai::Catalog::Flows::SyncFoundationalFlowsService| FF_CONSUMER
    FEA_DB --> FEA_CONSUMER
    FF_CONSUMER --> FF_TRIGGER
    FEA_CONSUMER --> FEA_TRIGGER
    style FA_MEM fill:#ffcccc

Enablement Mechanisms

Foundational Agents

Foundational agents use a dedicated status table system, separate from the standard ItemConsumer pattern.

Tables:

namespace_foundational_agent_statuses
organization_foundational_agent_statuses

Schema:

├── namespace_id / organization_id (FK)
├── reference (string) - e.g., "duo_planner", "security_analyst_agent"
├── enabled (boolean)
└── timestamps

Logic (in Ai::FoundationalAgentsStatusable concern):

Duo Chat (reference: 'chat') is always enabled (hardcoded exception)
If an explicit status record exists -> use its enabled value
Otherwise -> fall back to foundational_agents_default_enabled setting

Key characteristics:

Does NOT create ItemConsumer records

Flow Diagram

sequenceDiagram
    participant Admin as Admin UI
    participant Concern as Namespace/Organization<br/>(FoundationalAgentsStatusable)
    participant StatusTable as *_foundational_agent_statuses
    participant Default as Ai::Setting
    rect rgb(240, 240, 240)
        Note over Admin,StatusTable: Write Path
        Admin->>Concern: foundational_agents_statuses=[...]
        Concern->>StatusTable: DELETE existing, CREATE new records
    end
    rect rgb(240, 248, 255)
        Note over Concern,Default: Read Path (enabled_foundational_agents)
        Concern->>StatusTable: Fetch status records
        Concern->>Default: foundational_agents_default_enabled
        Note over Concern: Per agent:<br/>1. Duo Chat → always enabled<br/>2. Status record exists → use it<br/>3. Else → use default
    end

Foundational Flows

Foundational flows use a two-table approach:

Stage 1: Selection (`enabled_foundational_flows`)

Records the admin’s selection of which flows to enable.

Schema:

├── namespace_id OR project_id (exactly one)
├── catalog_item_id (FK to ai_catalog_items)
└── timestamps

Written by: sync_enabled_foundational_flows! by CascadeDuoSettingsService

Cascades: Down the hierarchy (group -> subgroups -> projects)

Stage 2: Activation (`ai_catalog_item_consumers`)

Records the operational configuration for execution.

Written by: SyncFoundationalFlowsService (asynchronously through worker)

Includes: Service account setup, trigger creation, version pinning

Key characteristics:

Auto-creates ItemConsumer records for all projects in group hierarchy
Requires keeping ItemConsumer records in sync through hooks in after project creation and after changes to enabled foundational flow options

Flow Diagram

sequenceDiagram
    participant Admin as Admin UI
    participant Cascade as CascadeDuoSettingsService
    participant EFF as enabled_foundational_flows
    participant Sync as SyncFoundationalFlowsService
    participant Consumer as ai_catalog_item_consumers
    participant Trigger as ai_flow_triggers
    rect rgb(240, 240, 240)
        Note over Admin,EFF: Stage 1: Selection
        Admin->>Cascade: Update foundational flow settings
        Cascade->>EFF: sync_enabled_foundational_flows!
        Note over EFF: Cascades to descendants
    end
    rect rgb(240, 248, 255)
        Note over Cascade,Trigger: Stage 2: Activation (async)
        Cascade->>Sync: schedule worker
        Sync->>EFF: Read enabled_flow_catalog_item_ids
        Sync->>Consumer: Create ItemConsumer records
        Sync->>Trigger: Create FlowTrigger records
    end

Foundational External Agents

Use the standard ItemConsumer pattern directly.

Foundational external agents use a one-time seeding approach.

Defined in: Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder::AGENTS constant

Written by: Gitlab::Ai::Catalog::ThirdPartyFlows::Seeder.run!

Triggered by: Admin API (POST /admin/ai_catalog/seed_external_agents), Rake task, or Admin UI

Key characteristics:

Does not auto-create ItemConsumer records, requires manual enabling

Flow Diagram

sequenceDiagram
    participant Entry as Admin UI / API / Rake Task
    participant Seeder as Gitlab::Ai::Catalog::<br/>ThirdPartyFlows::Seeder
    participant Item as ai_catalog_items
    participant Version as ai_catalog_item_versions
    rect rgb(240, 240, 240)
        Note over Entry,Seeder: Entry Points
        Entry->>Seeder: POST /admin/ai_catalog/seed_external_agents<br/>OR rake gitlab:ai_catalog:seed_external_agents
    end
    rect rgb(240, 248, 255)
        Note over Seeder,Version: Seeding (Self-managed only)
        Seeder->>Seeder: Validate: not SaaS, not already seeded,<br/>feature flags enabled
        loop For each agent in AGENTS constant
            Seeder->>Item: Create Item (item_type: third_party_flow,<br/>verification_level: gitlab_maintained, public: true)
            Seeder->>Version: Create ItemVersion with YAML definition
            Seeder->>Item: Set latest_released_version
        end
    end
    Note over Item: No ItemConsumer or FlowTrigger created<br/>(users enable manually)

Version Pinning

Overview

Version pinning allows consumers to lock to a specific version of a catalog item rather than always using the latest.

Key Characteristics

Pinning follows semver format - MAJOR.MINOR.PATCH (for example, 1.2.3)
Pinning rules are conservative - The code supports resolving a "1" or "1.2" pin to using prefix matching, and nil to resolve to latest released version, but current create and update validations require pinning to the exact semver format
Code for resolving pin to a version is in Ai::Catalog::ItemConsumer#pinned_version and Ai::Catalog::Item#resolve_version

Storage

Column: ai_catalog_item_consumers.pinned_version_prefix.

Creation

Creation happens through the aiCatalogItemConsumerCreate GraphQL mutation and Ai::Catalog::ItemConsumers::CreateService service class.

flowchart LR
    CREATE_START["Create ItemConsumer"]
    HAS_PARENT{Has parent<br/>consumer?}
    INHERIT["Inherit parent's<br/>pinned_version_prefix"]
    GET_LATEST["Get item.latest_released_version"]
    SET_PIN["Set pinned_version_prefix<br/>= latest version (e.g., '1.2.3')"]
    CONSUMER_CREATED["ItemConsumer created<br/>with pinned version"]
    CREATE_START --> HAS_PARENT
    HAS_PARENT -->|Yes| INHERIT
    HAS_PARENT -->|No| GET_LATEST
    GET_LATEST --> SET_PIN
    INHERIT --> CONSUMER_CREATED
    SET_PIN --> CONSUMER_CREATED

Updating

Updating happens through the aiCatalogItemConsumerUpdate GraphQL mutation and Ai::Catalog::ItemConsumers::UpdateService service class.

flowchart LR
    UPDATE_START["Update pinned_version_prefix"]
    VALIDATE_FORMAT{Valid semver<br/>format?<br/>e.g., '1.2.3'}
    RESOLVE["Resolve version from prefix"]
    VALIDATE_LATEST{Resolves to<br/>latest released<br/>version?}
    UPDATE_ERROR_FORMAT["Error: 'pinned_version_prefix<br/>is not a valid version string'"]
    UPDATE_ERROR_LATEST["Error: 'pinned_version_prefix must<br/>resolve to the latest released<br/>version of the item'"]
    UPDATE_SUCCESS["Update successful"]
    UPDATE_START --> VALIDATE_FORMAT
    VALIDATE_FORMAT -->|No| UPDATE_ERROR_FORMAT
    VALIDATE_FORMAT -->|Yes| RESOLVE
    RESOLVE --> VALIDATE_LATEST
    VALIDATE_LATEST -->|No| UPDATE_ERROR_LATEST
    VALIDATE_LATEST -->|Yes| UPDATE_SUCCESS

Resolving item version from a pin

This diagram shows how version pinning resolves which version to use.

Note that while the resolution logic supports all three modes, the current creation and update validation rules only permit exact semver format—the nil and prefix matching paths are not reachable through normal ItemConsumer flows.

flowchart LR
    RESOLVE["Ai::Catalog::ItemConsumer#pinned_version"]
    CHECK_NIL{pinned_version_prefix<br/>is nil?}
    CHECK_EXACT{Exact semver?<br/>2 dots}
    FIND_EXACT["Find version by exact match<br/>(e.g., '1.2.3')"]
    FIND_PREFIX["Find latest matching prefix<br/>(e.g., '1.2' → latest '1.2.x')"]
    FIND_LATEST["Return latest_version"]
    EXECUTE["Return resolved version"]
    RESOLVE --> CHECK_NIL
    CHECK_NIL -->|Yes| FIND_LATEST
    CHECK_NIL -->|No| CHECK_EXACT
    CHECK_EXACT -->|"Yes (e.g., '1.2.3')"| FIND_EXACT
    CHECK_EXACT -->|"No (e.g., '1.2')"| FIND_PREFIX
    FIND_EXACT --> EXECUTE
    FIND_PREFIX --> EXECUTE
    FIND_LATEST --> EXECUTE

Built-in Tools

Overview

Built-in tools are predefined capabilities that can be assigned to custom agents. They represent actions that Duo Workflow Service can execute.

Source of Truth

Duo Workflow Service, synchronised to Rails.

Storage

FixedItemsModel pattern (not in database), defined as fixtures in Ai::Catalog::BuiltInToolDefinitions::ITEMS.

{
  id: 1,                      # Stable ID (referenced in agent definitions)
  name: "gitlab_blob_search", # Machine-readable name
  title: "Gitlab Blob Search", # Human-readable title
  description: "..."          # Description for UI
}

Synchronization of tool data

A script generates ee/lib/ai/catalog/built_in_tool_definitions.rb from Duo Workflow Service definitions.

Synchronization flow

flowchart LR
    subgraph "Duo Workflow Service"
        DWS_TOOLS["Tool Definitions<br/>(Python)"]
    end

    subgraph "Sync Process"
        SCRIPT["Sync Script"]
    end

    subgraph "Rails Monolith"
        DEFS["BuiltInToolDefinitions<br/>(ITEMS constant)"]
        MODEL["BuiltInTool<br/>(FixedItemsModel)"]

    end

    DWS_TOOLS --> SCRIPT
    SCRIPT -->|"generates"| DEFS
    DEFS --> MODEL

Limitations

Data source must be manually updated through synchronization.
There is currently no process to remove tools (issue: !584050).

Association with agents

Tools are associated with agents through the agent definition (jsonb ai_catalog_items.definition).

The agent definition stores the tool ID as defined in Ai::Catalog::BuiltInToolDefinitions::ITEMS.

{
  "tools": [1, 3, 10, 39],
  "system_prompt": "...",
  "user_prompt": "..."
}

Mapping to Duo Workflow Service

Tools are mapped back to their Duo Workflow Service names when passed to Duo Workflow Service.

Flow Triggers

Flow triggers enable automatic execution of catalog flows based on GitLab events.

Model: `Ai::FlowTrigger`

Stored in ai_flow_triggers table. Links a project to either:

A catalog item consumer (ai_catalog_item_consumer_id), OR
A config file path (config_path)

Event Types:

Value	Type	Description
0	`mention`	User mentions the service account
1	`assign`	Issue/MR assigned to service account
2	`assign_reviewer`	Service account added as reviewer
3	`pipeline_hooks`	Pipeline events

Key Validations:

Must have exactly one of config_path or ai_catalog_item_consumer
User must be a service account
If linked to a consumer, the consumer’s item must be a flow or third-party flow
Consumer’s project must match trigger’s project

Execution: `Ai::FlowTriggers::RunService`

Routes execution based on item type:

Item Type	Execution Path
`flow` (foundational/custom)	`Ai::Catalog::Flows::ExecuteService` → Duo Workflow Service
`third_party_flow`	`Ci::Workloads::RunWorkloadService` → CI pipeline with Docker image

For catalog flows, the service:

Resolves the pinned version from the consumer
Builds user prompt from input and resource context
Delegates to Flows::ExecuteService

For external agents (third-party flows), the service:

Fetches flow definition from the item version
Creates an Ai::DuoWorkflows::Workflow record
Runs a CI workload with the image/commands from the definition
Passes context through AI_FLOW_* environment variables

Execution and Integration Points

Execution Contexts

Execution differs by item type. Agents are interactive, invoked by users through chat interfaces. Flows and External Agents are event-driven, triggered automatically when configured GitLab events occur. Some foundational flows can also be invoked directly from the GitLab Web UI.

Item Type	Invoked From	Executes in
Agents	Web UI (Agentic Chat), IDE, Duo CLI	Duo Workflow Service
Flows	Flow Triggers, Web UI (foundational flows only)	Duo Workflow Service
External Agents	Flow Triggers	CI Pipeline (Docker workload)

Agent and flow integration with Duo Workflow Service

Duo Workflow Service is the execution engine for agents and flows. It is a Python-based service with a gRPC API, built on LangGraph.

Integration paths from Rails:

Web UI (Agentic Chat): WebSocket connection through Workhorse, which proxies to Duo Workflow Service using gRPC. The aiCatalogAgentFlowConfig GraphQL query provides the flow configuration.
IDE: The GitLab Language Server includes a Duo Agent Platform client (a.k.a executor) that connects to Duo Workflow Service through Workhorse proxy and executes workflow actions locally.
Flow Triggers: Ai::FlowTriggers::RunService delegates to Ai::Catalog::Flows::ExecuteService, which uses Ai::DuoWorkflows::StartWorkflowService to orchestrate execution through CI pipeline.

For detailed architecture diagrams, see the Duo Workflow Architecture documentation.

External Agent Execution

External agents (third-party flows) do not use Duo Workflow Service. They execute directly as CI workloads:

Ai::FlowTriggers::RunService receives the trigger event
Flow definition (Docker image, commands) is read from ItemVersion#definition
Ci::Workloads::RunWorkloadService creates a CI job
Context is passed in AI_FLOW_* environment variables

Agent identity

When flows and external agents execute on runners through Flow Triggers, the permissions of an agent are granted through composite identity.

Composite Identity is an authentication mechanism that combines a service account (the machine user performing actions) with a human user (who initiated the request) into a single OAuth token. This ensures actions are attributed to the service account while preventing privilege escalation—the token only grants access to resources that both the service account and human user can access.

The service account used for flows and external agents are the user records on Ai::Catalog::ItemConsumer#service_account (which are duplicated within the associated flow trigger record Ai::FlowTrigger#user).

Service Account Management

Service accounts are automatically created when flows or external agents are enabled at the group level. They provide the machine identity for composite identity authentication.

Creation

When Ai::Catalog::ItemConsumers::CreateService creates a group-level consumer for a flow or external agent:

Username Generation: "{prefix}-{item_name}-{group_name}".parameterize
- Foundational flows: prefix is duo (for example, duo-code-review-my-group)
- Custom flows/external agents: prefix is ai (for example, ai-my-flow-my-group)
Name Generation: Item name, prefixed with “Duo " for foundational flows
Service Account Creation through Namespaces::ServiceAccounts::CreateService:
- namespace_id: The group ID
- composite_identity_enforced: true (required for composite identity)
- organization_id: Inherited from group
Reuse Logic: If a service account with the same username exists AND is not already linked to an ItemConsumer, it’s reused rather than creating a new one.

Hierarchy

Group-level consumers (ItemConsumer with group_id): Own the service account (service_account_id column)
Project-level consumers (ItemConsumer with project_id): Reference the parent group consumer through parent_item_consumer_id and inherit its service account

When a project-level consumer is created:

The parent’s service account is added to the project with Developer role
The FlowTrigger record also stores the service account in its user column

Cleanup

When an ItemConsumer is destroyed:

Project consumers: Service account is removed from the project membership
Group consumers: Service account is removed from all projects in the group hierarchy

Soft Delete

Items support soft deletion when consumers exist. This preserves enablements for consumers when an item is public.

Soft Delete Behavior

Soft-deleted items:

Are excluded from finder results by default
Can still be queried directly with showSoftDeleted: true
Retain their consumer and version records
Cannot be administered by non-org-admins (prevented by ItemPolicy)

Model Support

Scope: not_deleted filters to items where deleted_at is null.

Deletion Logic

Ai::Catalog::Items::BaseDestroyService, the base service class used when destroying all item types, chooses between soft and hard delete:

flowchart LR
    START[Delete Request] --> FORCE{force_hard_delete<br/>param = true?}
    FORCE -->|Yes| HARD_ADMIN[Hard Delete<br/>admin only]
    FORCE -->|No| HAS_REFS{Has consumers<br/>or dependents?}
    HAS_REFS -->|Yes| SOFT[Soft Delete<br/>set deleted_at]
    HAS_REFS -->|No| HARD[Hard Delete<br/>destroy record]

    HARD_ADMIN --> DONE[Done]
    SOFT --> DONE
    HARD --> DONE

Authorization

delete_ai_catalog_item: Required for any deletion (maintainer+ required)
force_hard_delete_ai_catalog_item: Required for forced hard delete (admin required)

GraphQL Exposure

Items have a soft_deleted field (maps to deleted? method)
aiCatalogItem query accepts showSoftDeleted argument (defaults to false)
Ai::Catalog::ItemsFinder uses not_deleted scope by default

GraphQL API

The AI Catalog backend data is exposed through GitLab’s GraphQL API.

Type	Directory
Queries	`ee/app/graphql/resolvers/ai/catalog/`
Mutations	`ee/app/graphql/mutations/ai/catalog/`
Types	`ee/app/graphql/types/ai/catalog/`

Authorization

Where Authorization Happens

Authorization is enforced at three layers:

Layer	Location	Mechanism
GraphQL	Mutations	`authorize :permission` + `authorized_find!`
GraphQL	Resolvers (through finders)	Typically through finders with `Ability.allowed?` checks
Services	All services	`Ability.allowed?` in `authorized?` method
Model	`Item` scopes	`public_or_visible_to_user` scope filters by project membership

Policy Classes

Ai::Catalog::ItemPolicy Controls access to catalog items themselves.
Ai::Catalog::ItemVersionPolicy — Contains unused version execution permission (similar permissions in ItemConsumerPolicy are used instead). Delegates to ItemPolicy for base permissions.
Ai::Catalog::ItemConsumerPolicy — Controls item consumer execution. Delegates to both ProjectPolicy for project item consumers, and GroupPolicy for group item consumers.
ProjectPolicy and GroupPolicy - Container-Level permission for creating items and consumers.

Identified Inconsistencies

1. Storage Model Mismatch

Foundational agents use FixedItemsModel (in-memory only) while foundational flows and external agents use database records.

2. Enablement Fragmentation

Three different enablement mechanisms:

Foundational Agents: Dedicated status tables (*_foundational_agent_statuses)
Foundational Flows: Two-stage process (enabled_foundational_flows -> ItemConsumer)
Everything else: Direct ItemConsumer records

3. Consumer Record Inconsistency

Foundational agents do not create ItemConsumer records, while foundational flows auto-create them.

4. Catalog Item Linkage Variance

Agents: Ai::FoundationalChatAgent#global_catalog_id field
Flows: Ai::Catalog::Item#foundational_flow_reference column
External Agents: Standard item with verification_level: :gitlab_maintained

5. Seeding Mechanism Differences

Agents: No seeding (pure fixtures)
Flows: Service-based seeding
External Agents: Admin UI button

Last modified February 20, 2026: Document AI Catalog integration points with Duo Workflow service, websockets, and CI/CD pipelines (234c0e14)

View page source - Edit this page - please contribute.

AI Catalog Backend Architecture

Summary

Problem Statement

Core Data Model

Database Tables

In-Memory Models (FixedItemsModel)

Entity Relationship Diagram

Item Types

Definition Schemas

Foundational vs Custom Items

Overview

Comparison Table

Architectural Diagram

Enablement Mechanisms

Foundational Agents

Flow Diagram

Foundational Flows

Stage 1: Selection (enabled_foundational_flows)

Stage 2: Activation (ai_catalog_item_consumers)

Flow Diagram

Foundational External Agents

Flow Diagram

Version Pinning

Overview

Key Characteristics

Storage

Creation

Updating

Resolving item version from a pin

Built-in Tools

Overview

Source of Truth

Storage

Synchronization of tool data

Limitations

Association with agents

Mapping to Duo Workflow Service

Flow Triggers

Model: Ai::FlowTrigger

Execution: Ai::FlowTriggers::RunService

Execution and Integration Points

Execution Contexts

Agent and flow integration with Duo Workflow Service

External Agent Execution

Agent identity

Service Account Management

Creation

Hierarchy

Cleanup

Soft Delete

Soft Delete Behavior

Model Support

Deletion Logic

Authorization

GraphQL Exposure

GraphQL API

Authorization

Where Authorization Happens

Policy Classes

Identified Inconsistencies

1. Storage Model Mismatch

2. Enablement Fragmentation

3. Consumer Record Inconsistency

4. Catalog Item Linkage Variance

5. Seeding Mechanism Differences

In-Memory Models (`FixedItemsModel`)

Stage 1: Selection (`enabled_foundational_flows`)

Stage 2: Activation (`ai_catalog_item_consumers`)

Model: `Ai::FlowTrigger`

Execution: `Ai::FlowTriggers::RunService`