Personalization Engine

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
proposed	`jmontal` `dstull`		`jtucker_gl` `ghosh-abhinaba` `jmontal` `dstull`	devops growth	2025-11-18

Summary

The Personalization Engine is a system designed to adapt the GitLab experience to individual users based on their behavior, role, and context. This design document outlines the initial implementation approach using a user metric completions tracking system that provides a foundation for personalization experiments while establishing patterns for future extensibility.

The initial implementation focuses on tracking specific user actions and milestones to enable targeted personalization experiments for new users, with a database-driven approach that allows the Growth team to iterate quickly without dependencies on external data sources.

Glossary

CQRS (Command Query Responsibility Segregation): An architectural pattern that separates read and write operations into different models, allowing independent optimization and scaling of each
DIP (Data Insights Platform): GitLab’s standardized data access layer designed to provide a unified querying interface across multiple data sources
MVC (Minimum Viable Change): The smallest change that delivers value and can be iterated upon
Snowplow: GitLab’s event tracking pipeline that collects and routes analytics events
ClickHouse: A columnar database management system optimized for analytical queries and high-volume data ingestion
Internal Events: GitLab’s unified event tracking system that routes events to multiple destinations (Snowplow, Redis counters, etc.)
Root Namespace: The top-level namespace in GitLab’s hierarchy (group or personal namespace)

Motivation

GitLab users face several challenges when navigating and utilizing the platform effectively:

Information Overload: Users are presented with too much information and too many options, making it difficult to focus on what matters most to them.
Generic Experience: All users receive the same interface and recommendations regardless of their role, experience level, or usage patterns
Poor Discoverability: Valuable features and capabilities remain hidden or hard to find for users who would benefit from them.
Inefficient Workflows: Users spend time navigating to frequently used features or recreating workflows that could be streamlined.
Onboarding Friction: New users struggle to understand how to get started and achieve their goals quickly.
Feature Adoption Gaps: Users don’t discover relevant features that could improve their productivity and success.

Goals

Enable Rapid Experimentation: Provide the Growth team with the ability to run personalization experiments quickly without waiting for complex data infrastructure.
Track User Milestones: Capture key user actions and completions that indicate progress and engagement.
Support Targeted Experiences: Enable personalized UI elements, recommendations, and guidance based on user behavior.
Establish Extensible Patterns: Create a foundation that can evolve to support more sophisticated personalization in the future.
Maintain Performance: Ensure the tracking system has minimal impact on application performance and user experience.

Non-Goals

Dynamic Audit Event Filtering: The initial implementation will not dynamically pull data from existing audit events or other data sources.
Snowflake Integration: This implementation does not integrate with Snowflake or other analytics data warehouses for real-time personalization.
Self-Managed Support (Initial): The first version is scoped to GitLab.com (SaaS) and will not support self-managed instances.
Comprehensive User Profiling: This is not a complete user profiling system; it focuses on specific, actionable metrics.
Machine Learning Recommendations: The initial version does not include ML-based recommendation engines.

Proposal

Path Forward: User Metric Completions

We propose implementing a database table design for tracking user metric completion in the Personalization Engine. This provides a static solution for v1 with possible extensibility for future dynamic audit event filtering.

Overview

The Personalization Engine will track specific user actions and milestones through a dedicated database table that records completion status for predefined metrics. This approach allows the Growth team to:

Start working on personalization experiments faster
Avoid dependencies on Snowflake or other external data sources
Maintain control over what metrics are tracked
Iterate quickly based on experiment results

User Journey Flow

User Flow Diagram

Advantages

Faster Time to Market: Allows Growth to start working on personalization experiments faster
Proven Pattern: Similar to implementation patterns that were proposed in user-centric onboarding experiences and current namespace-based onboarding progress
Performance Control: Direct database access provides predictable performance characteristics
Flexibility: Easy to add new metrics as experiments evolve

Trade-offs

Not Dynamically Driven: Data is not pulled from existing data sources (such as audit events and Snowflake)
Scope Limitation: Most likely scoped to new users initially
Maintenance Overhead: Requires maintaining separate tracking logic rather than leveraging existing event streams
Data Duplication: Some tracked data may duplicate information available elsewhere in the system

Ownership and Scope

v1 Ownership: The Personalization Engine is owned and operated by the Growth team for activation experiments on GitLab.com.

Decision-making: The Growth team determines which metrics to track and which experiences to personalize based on the Personalization Engine goals (Setup → Aha → Habit moments).

Implementation: Growth engineers implement both the metric tracking and the personalized UI elements. This is not a self-service platform for other teams in v1.

Future evolution: If other product teams express interest in using the Personalization Engine for their own personalization needs, we’ll evaluate expanding it into a shared platform with documented contribution processes. For now, it’s scoped to Growth’s activation experiments.

Concrete examples of personalization (v1):

Contextual prompts encouraging users to complete their first merge request
Onboarding tooltips that appear/disappear based on user progress milestones
Personalized guidance in empty states based on what the user has already accomplished
Targeted feature discovery based on activation stage (Setup vs. Aha vs. Habit)

Design and implementation details

Data Model

Database Schema

Personalization metrics table

The design uses a single table with normalization on read, keeping user-level tracking as the primary model while adding optional namespace context for future aggregation capabilities.

create_table :personalization_metrics do |t|
    t.bigint :user_id, null: false
    t.bigint :namespace_id, null: true
    t.integer :metric, null: false, limit: 2 # ActiveRecord enum
    t.timestamps_with_timezone null: false

    t.index [:user_id, :namespace_id, :metric], unique: true, name: 'unique_personalization_metric_user_id_namespace_id_and_metric'
    t.index [:namespace_id, :metric], name: 'index_personalization_metrics_on_namespace_id_and_metric'
end

Metric column: ActiveRecord enum defined in the PersonalizationMetric model. Example values:

code_push_activity (0)
pipeline_success (1)
merge_request_completion (2)

The smallint type (limit: 2) supports up to 32,767 metric types.

Key Design Decisions

User-level tracking: Maintains user-level metrics as the primary tracking mechanism (as currently designed)
Optional namespace context: Adds namespace_id to capture the context where the action occurred, enabling future namespace-level personalization
Normalization on read: Data is stored at the user level and normalized/aggregated when querying for namespace-level personalization
Metric identifier: Integer-based identifier for flexibility in adding new metrics
Uniqueness constraint: Ensures data integrity at the user-metric level
Composite indexing: Supports efficient queries for both user-level and namespace-level lookups
Safe concurrent writes: Upsert operations will follow established GitLab patterns (similar to OrganizationUser) to handle race conditions when metrics are recorded concurrently

Constraint Considerations with Namespace Deletion

The namespace_id foreign key will use ON DELETE SET NULL to preserve user metric data when namespaces are deleted. When a namespace is deleted, the namespace_id is nullified while keeping the user’s metric record intact, since the user is the primary entity being tracked.

PostgreSQL NULL Handling: PostgreSQL treats NULL values as DISTINCT in unique constraints (reference), which means the unique constraint [:user_id, :namespace_id, :metric] allows multiple rows with the same user_id and metric when namespace_id is NULL. This behavior supports our design goals:

Users can have the same metric recorded multiple times across different namespaces
When namespaces are deleted, the nullified namespace_id values don’t create uniqueness violations
User-level metrics (recorded without namespace context) coexist with namespace-specific metrics

Implementation: No special trigger or application-level conflict handling is required for the deletion scenario, as PostgreSQL’s NULL handling naturally supports our data model.

Trade-offs

Query complexity vs. write simplicity:

Write simplicity: Maintaining user-level tracking keeps the write path straightforward and performant. Recording metrics remains a simple insert/upsert operation without complex aggregation logic.
Query complexity: Namespace-level personalization requires aggregating user-level data at query time. This adds computational overhead to read operations but provides flexibility in how data is aggregated and interpreted.
Scalability consideration: As the number of tracked metrics and users grows, namespace-level queries, may require optimization through caching or materialized views.

Maintaining Consistency Between Event Definitions and Metric Mappings

The METRIC_MAPPINGS hash creates a link between event YAML files (e.g., config/events/push_code.yml) and the PersonalizationMetric enum values. To ensure consistency:

v1 validation:

Custom linting validates METRIC_MAPPINGS keys match event YAML files
CI check fails if mappings are out of sync
Integration tests exercise the full event → metric recording flow

Sources to keep in sync:

Event YAML files (config/events/*.yml) - Define which events trigger tracking
Model enum (PersonalizationMetric) - Define metric types in database
METRIC_MAPPINGS hash - Links events to metrics

Future considerations:

Code generation from YAML to enum (or vice versa) if the number of metrics grows significantly
Shared constants file if other systems need to reference these metrics

Service Layer Architecture

The Personalization Engine will use a service-oriented architecture with clear separation of concerns and an abstraction layer to enable future storage backend changes.

Internal Events Integration

Leveraging Analytics Single instrumentation layer

The Personalization Engine will integrate with GitLab’s existing Internal Events system using the proven Single Instrumentation Layer, similar to how AI usage tracking is implemented.

Architecture Overview

Instead of modifying core tracking logic, we’ll extend the event tracking system through event definition configuration:

PersonalizationTracking Module: A dedicated tracking module that receives events from the Internal Events router
Event Configuration: YAML-based event definitions that declare personalization tracking
Storage Adapter: An abstraction layer supporting multiple backends (PostgreSQL, and ClickHouse)

Implementation Pattern

Tracking Module Example lib/gitlab/tracking/personalization_tracking.rb

module Gitlab
    module Tracking
        module PersonalizationTracking
            class << self
                def track_event(event_name, **context)
                    storage_adapter.record_metric(event_name, context)
                end

                private

                def storage_adapter
                    if Feature.enabled?(:personalization_clickhouse_storage)
                        ClickHouseAdapter.new
                    else
                        PostgresAdapter.new
                    end
                end
            end

            class PostgresAdapter
                def record_metric(event_name, context)
                    user = context[:user]
                    namespace = context[:namespace]&.root_namespace || context[:project]&.root_namespace
                    metric_type = METRIC_MAPPINGS[event_name]

                    return unless user && metric_type

                    # Check if metric already exists to avoid unnecessary upsert
                    return if PersonalizationMetric.exists?(
                        user_id: user.id,
                        namespace_id: namespace&.id,
                        metric: metric_type
                    )

                    PersonalizationMetric.upsert(
                        {
                            user_id: user.id,
                            metric: metric_type,
                            namespace_id: namespace&.id
                        },
                        unique_by: [:user_id, :namespace_id, :metric]
                    )
                end

                METRIC_MAPPINGS = {
                    'push_code' => :code_push_activity,
                    'pipeline_succeeded' => :pipeline_success,
                    'merge_request_merged' => :merge_request_completion
                }.freeze
            end

            class ClickHouseAdapter
                def record_metric(event_name, context)
                    # Future implementation for ClickHouse/DIP
                end
            end
        end
    end
end

Event Configuration config/events/push_code.yml

action: push_code
category: source_code
internal_events: true
extra_trackers:
    - tracking_class: Gitlab::Tracking::PersonalizationTracking
      protected_properties: {}

Event Flow

Application code triggers event Gitlab::InternalEvents.track_event('push_code', user: current_user, project: project)

Automatic routing:

Snowplow tracking
Redis counter updates if metric definitions exist
PersonalizationTracking.track_event (via extra_trackers)
Record to personalization_metrics table

Advantages

Non-Invasive: No modifications are required to Internal events core logic
Proven Pattern: Follows the same architecture as ee/lib/gitlab/tracking/ai_tracking.rb
Declarative Configuration: Events configured via YAML, similar to ee/config/events/code_suggestion_rejected_in_ide.yml
Independent Testing: Personalization tracking can be tested in isolation
Feature Flag Control: Easy to enable/disable without code changes
Future-Proof: Storage adapter pattern enables seamless migration to ClickHouse/DIP

Reference Implementations

Core Internal Events: lib/gitlab/internal_events.rb - Event routing and extra_trackers invocation
Contribution Analytics: lib/gitlab/tracking/contribution_analytics_tracking.rb - Similar tracking pattern
AI Tracking Example: ee/lib/gitlab/tracking/ai_tracking.rb - Similar tracking pattern
Event Schema: db/docs/ai_code_suggestion_events.yml - Documentation pattern for tracked events
Event Configuration: ee/config/events/code_suggestion_rejected_in_ide.yml - YAML configuration example

Migration Path to CQRS

The storage adapter pattern naturally supports future CQRS evolution:

Write Path: PostgresAdapter → ClickHouseAdapter for high-volume event ingestion (via batched writes or Snowplow pipeline ingestion to avoid ClickHouse antipatterns)
Read Path: Direct PostgreSQL queries → DIP querying API for analytics
Transition: Feature flags control gradual migration without API changes
Compatibility: Maintains same interface as DIP support evolves

Note on ClickHouse Write Patterns: Small, frequent writes to ClickHouse are an antipattern that can degrade query performance due to inefficient data merges. The migration path accounts for this by either:

Batching writes at the application layer before sending to ClickHouse
Leveraging the Snowplow pipeline for event ingestion (which handles batching)
Using DIP’s ingestion API once available, which will handle optimal write patterns

This approach aligns with the design document’s goals of rapid experimentation while establishing extensible patterns for future sophistication.

Storage Abstraction

To support future migration to alternative storage backends (such as ClickHouse, Data Insights Platform (DIP), or other data stores), the implementation should use an adapter pattern or similar abstraction:

Storage Interface: Define a clear interface for metric recording and querying operations
Initial Implementation: ActiveRecord-backed adapter using the application database
Future Flexibility: Enable swapping storage backends without rewriting application logic
Insulation Layer: Isolate storage implementation details from business logic and API consumers

This abstraction allows the MVC to start simple with ActiveRecord and the local database while preserving the ability to migrate to more specialized storage solutions as requirements evolve. For example, the Data Insights Platform (DIP) is GitLab’s standardized data access layer designed to provide a unified querying interface across multiple data sources. Once DIP supports the necessary backends (like ClickHouse), we can migrate to use it as our read interface while maintaining the same API surface for consumers.

Metric Recording

A metric recording service will be responsible for:

Validating and recording user metric completions
Supporting user-level metrics with optional namespace context
Capturing the namespace where the action occurred when available
Delegating storage operations to the storage adapter
Optimized upsert logic: Check for existing records before attempting upsert to minimize unnecessary database operations (following the OrganizationUser pattern)

Recording Location Guidelines:

To maintain database performance and proper replica utilization, metrics should only be recorded in contexts that already write to the database:

State-changing requests: PUT, POST, DELETE operations that modify data
Background jobs: Sidekiq workers processing async events

Metrics should never be recorded during:

GET requests: To avoid session stickiness to the primary database and maintain read replica utilization
Read-only operations: Any operation that doesn’t already require a database write

This ensures metric recording doesn’t introduce unnecessary primary database load.

v1 Implementation:

Synchronous recording within the request cycle (similar to onboarding_progresses table, stable since GitLab 13.8)
Metrics recorded at root namespace level (no hierarchy aggregation needed)
Expected load similar to existing onboarding progress tracking

Future Evolution:

Async processing via Redis/queue for high-volume metrics
Migration to analytical database (ClickHouse/DIP) when write volume exceeds PostgreSQL comfort zone
Event-driven architecture for real-time personalization

Metric Querying

A metric query service will provide:

Checking if a user has completed specific metrics
Retrieving user progress across all tracked metrics
Normalizing and aggregating user-level data for namespace-level personalization queries
Supporting queries for personalization experiments and UI customization
Delegating read operations to the storage adapter

The storage abstraction enables caching strategies and query optimizations to be implemented independently of the business logic. For namespace-level queries, the service will aggregate user-level metrics at read time, allowing flexible interpretation of namespace progress and engagement patterns.

Integration Points

The metric recording service will need to be triggered when specific user actions occur. The specific integration points and implementation approach will be determined during development based on existing GitLab patterns and performance requirements.

Data Access

The Personalization Engine will need to expose metric data to support:

UI Personalization: Frontend components need to query user metric completion status to customize the interface

The specific access patterns and technologies will be determined during implementation based on performance requirements and existing GitLab patterns.

Performance Considerations

The Personalization Engine must maintain GitLab’s performance standards:

Metric Recording: Should not block or slow down user-facing operations
Metric Querying: Must support real-time UI personalization without introducing latency
Database Impact: Minimize load on the primary database
User Scalability: Design should accommodate growth in number of users
Metric Extensibility: As an MVC, adding new metrics will require code changes; future iterations may explore more dynamic approaches

Specific optimization strategies (caching, indexing, partitioning, etc.) will be determined during implementation based on measured performance characteristics and actual usage patterns.

Monitoring and Observability

The Personalization Engine will establish comprehensive monitoring to ensure reliable operation and inform scaling decisions.

Operational Metrics:

System Health:
- Recording service errors and success rates
- Query service errors and response times
- Database connection pool utilization
- Background job processing metrics
Performance Metrics:
- Query latency (p50, p95, p99) for metric lookups
- Write throughput: Metric records created per day/hour
- Read throughput: Query frequency and patterns
- Database query performance and slow query tracking
Storage Metrics:
- Table size and growth rate
- Index efficiency and usage
Usage Patterns:
- Which metrics are being tracked and their frequency
- User and namespace coverage
- Feature flag adoption rates
Business Impact:
- Personalization experiment effectiveness
- User engagement with personalized features
- A/B test results and conversion metrics

Specific logging formats, alerting thresholds, and monitoring dashboards will be defined during implementation.

Capacity Planning

Initial Load Estimation:

To ensure the database schema is appropriately designed and set expectations for backend transitions, we’ll use existing metrics as proxies:

Write volume proxy: Use onboarding_progresses table as a reference - it tracks similar user milestone completions with comparable write patterns and scale
User base: Scope v1 to new users (estimated subset of total GitLab.com users) to bound initial scale
Metric cardinality: Start with 3-5 tracked metrics per user to establish baseline storage requirements
Read patterns: Personalization UI queries will be user-scoped (single-user lookups), not analytical aggregations

Expected v1 Characteristics:

Write path: Asynchronous metric recording via background jobs to avoid blocking user requests
Read path: Simple indexed lookups by user_id and metric type
Storage: Estimated growth based on (new users per month × metrics tracked × record size)

Baseline Establishment:

Timeline: First 3 months of production use
Approach: Monitor actual usage patterns against initial estimates to validate assumptions and establish performance baselines

Scaling Decision Points:

Based on monitored metrics, we’ll trigger scaling actions when:

Write volume consistently exceeds PostgreSQL comfort zone → Evaluate async batching or ClickHouse migration
Read latency degrades beyond acceptable thresholds → Implement caching layer or read replicas
Storage growth trajectory indicates capacity issues → Plan migration to DIP or implement partitioning
Query complexity impacts performance → Consider materialized views or CQRS pattern

The storage adapter pattern enables backend migration based on observed data, allowing us to scale incrementally as actual needs emerge while initial estimates ensure the schema is appropriately designed.

Implementation Approach

The Personalization Engine will be implemented in phases to enable rapid iteration and learning:

Phase 1: Foundation

Create database table and indexes with system for recording and querying metrics
Extend internal events to include personalization efforts outlined in internal events integration section
Implement activation metrics (Setup, Aha, and Habit)

Phase 2: Expansion

Add remaining metric integrations
Expand data access capabilities
Begin running personalization experiments

Alternative Solutions

Alternative 1: POC Connector - Direct Snowflake Integration

Description: Query Snowflake directly for user behavior data.

Pros:

Access to comprehensive historical data
Leverage existing analytics infrastructure
No need to maintain separate tracking

Cons:

Violates architectural principle that Snowflake is for analytics, not operational use ( see Data Insights Platform design for the established data access patterns)
High latency for real-time personalization
Requires complex data access patterns
Dependency on external system for core functionality
Does not use existing infrastructure (DIP)
Rejected: Does not align with architectural principles

Alternative 2: DIP Extension - Access to Snowflake Data

Description: Extend DIP to support Snowflake connectivity and use it as the data access layer.

Pros:

Aligns with architectural mandate to use DIP (see Data Insights Platform design which establishes DIP as the standardized data access layer)
Centralized data access layer
Future-proof solution

Cons:

DIP does not currently support Snowflake
Not on DIP roadmap for FY26
Timeline exceeds Growth team’s needs
Blocks personalization experiments
Rejected: Timeline incompatible with business needs

Alternative 3: DIP Extension - Snowplow Events Direct to ClickHouse

Description: Add ClickHouse as an additional destination in the Snowplow pipeline so DIP can access it.

Pros:

Aligns with DIP architecture
Leverages existing event streams
Scalable for high-volume data

Cons:

Requires infrastructure work with unclear ownership
Timeline uncertainty (likely FY27Q1)
Dependency on Analytics team roadmap
Blocks immediate experimentation needs
Deferred: May be considered for future iterations

Blockers

This is currently blocked by DIP ClickHouse support work

Future Considerations

Extensibility Path

The current design provides a foundation for future enhancements:

Dynamic Metric Definitions: Allow defining new metrics without code changes
Metric Aggregation: Support complex aggregations across multiple base metrics
Predictive Metrics: Use ML to predict user behavior and needs
Real-time Streaming: Integrate with event streaming for real-time updates

CQRS Pattern Evolution

The storage abstraction layer enables evolution toward a possible Command Query Responsibility Segregation (CQRS) pattern:

Separate Write and Read Models: The adapter pattern allows different storage backends for writes (commands) and reads (queries)
Write Optimization: Metric recording could write directly to ClickHouse, optimized for high-volume event ingestion
Read Optimization: Metric querying could use DIP (Data Insights Platform) as the read interface, providing a standardized querying layer
Async Synchronization: ClickHouse data would be accessible through DIP’s querying API once DIP supports ClickHouse as a backend
Independent Scaling: Write and read workloads could scale independently based on their specific performance characteristics

This approach would allow the Personalization Engine to handle high-volume metric recording while providing fast, complex queries for personalization through GitLab’s standard data access patterns.

Migration to DIP

When DIP supports the required data sources, we can:

Maintain the same API surface
Gradually migrate backend to query DIP instead of local database
Use feature flags to control rollout
Deprecate local tracking once DIP is feature-ready for this use case

Self-Managed Support

Future iterations may include:

Opt-in metric tracking for self-managed instances

References

Implementation References

Design Documents

Documentation

Last modified December 9, 2025: Fix personalization image links (131974fd)

View page source - Edit this page - please contribute.

Personalization Engine

Summary

Glossary

Motivation

Goals

Non-Goals

Proposal

Path Forward: User Metric Completions

Overview

User Journey Flow

Advantages

Trade-offs

Ownership and Scope

Design and implementation details

Database Schema

Personalization metrics table

Key Design Decisions

Constraint Considerations with Namespace Deletion

Trade-offs

Maintaining Consistency Between Event Definitions and Metric Mappings

Service Layer Architecture

Internal Events Integration

Leveraging Analytics Single instrumentation layer

Architecture Overview

Implementation Pattern

Event Flow

Advantages

Reference Implementations

Migration Path to CQRS

Storage Abstraction

Metric Recording

Metric Querying

Integration Points

Data Access

Performance Considerations

Monitoring and Observability

Capacity Planning

Implementation Approach

Phase 1: Foundation

Phase 2: Expansion

Alternative Solutions

Alternative 1: POC Connector - Direct Snowflake Integration

Alternative 2: DIP Extension - Access to Snowflake Data

Alternative 3: DIP Extension - Snowplow Events Direct to ClickHouse

Future Considerations

Extensibility Path

CQRS Pattern Evolution

Migration to DIP

Self-Managed Support

References

Related Epics and Issues

Implementation References

Design Documents

Documentation