Retrieval Augmented Generation (RAG) for GitLab Duo on self-managed

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
implemented	`shinya.maeda` `mikolaj_wawrzyniak`	`stanhu`	`pwietchner` `oregand` `tlinz`	devops ai-powered	2024-01-25

RAG is an application architecture used to provide knowledge to a large language model that doesn’t exist in its training set, so that it can use that knowledge to answer user questions. To learn more about RAG, see RAG for GitLab.

Goals of this blueprint

This blueprint aims to drive a decision for a RAG solution for GitLab Duo on self-managed, specifically for shipping GitLab Duo with access to GitLab documentation. We outline three potential solutions, including PoCs for each to demonstrate feasibility for this use case.

Constraints

The solution must be viable for self-managed customers to run and maintain
The solution must be shippable in 1-2 milestones
The solution should be low-lock-in, since we are still determining our long term technical solution(s) for RAG at GitLab

Proposals for GitLab Duo Chat RAG for GitLab documentation

The following solutions have been proposed and evaluated for the GitLab Duo Chat for GitLab documentation use case:

You can read more about how each evaluatoin was conducted in the links above.

Chosen solution

Vertex AI Search is going to be implemented due to the low lock-in and being able to reach customers quickly. It could be moved over to another solution in the future.

View page source - Edit this page - please contribute.

Retrieval Augmented Generation (RAG) for GitLab Duo on self-managed

Goals of this blueprint

Constraints

Proposals for GitLab Duo Chat RAG for GitLab documentation

Chosen solution

Elasticsearch

PostgreSQL

Vertex AI Search