Semantic Code Search Ad-hoc Indexing

Design document for Semantic Code Search Ad-hoc Indexing

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
implemented	`partiaga`		`wortschi`	devops ai platform	2026-06-29

Overview

Ad-hoc indexing is a lazy-loading mechanism that automatically triggers initial indexing when a user attempts to perform semantic code search on a project that hasn’t been indexed yet.

Benefits

Dramatically reduced storage: Only active projects are indexed, reducing storage from 39-118 TB to manageable levels
Cost efficiency: Elasticsearch cluster can be significantly smaller
Scalability: Easier to manage growth incrementally rather than all at once

Trade-off

First-access latency: The first semantic code search on a project will be slower as embeddings are generated

Execution Flow

A user or AI agent attempts a semantic code search on an unindexed project. This can be done through various tools (MCP or glab) that eventually flows to the Semantic Code Search REST API.
The Semantic Code Search REST API invokes a search on the Ai::ActiveContext::Queries::Code class
If the project has not yet been indexed but it’s eligible for indexing, Ai::ActiveContext::Queries::Code triggers the Ai::ActiveContext::Code::AdHocIndexingWorker. This queues an ad-hoc indexing job that will run asynchronously.
Ai::ActiveContext::Queries::Code returns a message indicating: initial indexing has been started, try again in a few minutes
The Semantic Code Search REST API returns the message to the invoking user or tool.
In a few minutes, when the user or AI agent performs a search on the project, the Semantic Code Search tool or REST API will return relevant search results.

`Ai::ActiveContext::Code::AdHocIndexingWorker`

On perform, AdHocIndexingWorker calls RepositoryIndexWorker.perform_async. From there, initial indexing will start for the given project.

For further details on initial indexing and related state management, please see Index state management.

Rate limits

Ad-hoc indexing is rate-limited per namespace to prevent a single namespace from overwhelming the system with indexing requests.

Key: semantic_code_search_ad_hoc_indexing
Scope: Root namespace (shared across all projects in the namespace)
Limit: 10 requests per hour. This is a low rate limit because initial indexing only needs to be done once per project.

Sequence Diagram

sequenceDiagram
  participant Client@{ "type" : "entity" } as User or MCP tool or glab CLI

  box Rails
    participant API as REST API
    participant ACQuery as Ai::ActiveContext::<br />Queries::Code
    participant ACRepository as Ai::ActiveContext::<br />Code::Repository
    participant ACAdhocIndexingWorker as Ai::ActiveContext::Code::<br />AdHocIndexingWorker
  end

  participant VectorStorage@{ "type" : "database" } as Vector Storage

  Client->>API: performs an API request
  API->>ACQuery: calls 'filter'
  ACQuery->>ACRepository: finds a 'ready' record for the project<br />(this indicates whether a project<br /> has been indexed or not)
  alt record exists for project
    ACRepository->>ACQuery: returns a 'ready' record
    ACQuery->>VectorStorage: performs query
    VectorStorage->>ACQuery: returns result
    ACQuery->>API: returns result
    API->>Client: returns result
  else
    ACRepository->>ACQuery: returns no 'ready' record
    alt project is not eligible for indexing
      ACQuery->>API: returns project is not eligible for indexing
      API->>Client: returns project is not eligible for indexing
    else adhoc indexing runs into rate limits
      ACQuery->>API: returns indexing is rate limited
      API->>Client: returns indexing is rate limited
    else project is eligible and ad-hoc indexing is not rate limited
      ACQuery->>ACAdhocIndexingWorker: triggers asynchronous execution
      ACQuery->>API: returns initial indexing has started
      API->>Client: returns initial indexing has started
    end
  end

Last modified July 1, 2026: Add Semantic Code Search design docs (e8bd4a2e)

View page source - Edit this page - please contribute.