Validity Checks for Secret Detection findings

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.

Status	Authors	Coach	DRIs	Owning Stage	Created
ongoing	`craigmsmith, atiwari71, ngeorge1`	`theoretick`	`craigmsmith, atiwari71, abellucci, amarpatel`	devops secure	2025-07-08

Summary

Verify the validity and liveness of Secret Detection findings by programmatically checking them against their issuing services. This enables security teams to prioritize remediation of active secrets over revoked or inactive credentials.

See the Verify validity/liveness of Secret Detection findings epic for more details.

Motivation

Goals

Provide token validity status for both GitLab and partner tokens to Ultimate customers
Enable partner token validation (AWS, GCP, Postman etc.) with strict rate limit compliance
Support both SaaS and self-managed deployments without additional infrastructure
Preserve error budgets by treating rate limits as expected conditions rather than failures

Non-Goals

Cross Scanner Integration (extend to DAST)
Extending support to other target types
Building a generic API gateway service
Supporting arbitrary third-party validation endpoints
Real-time synchronous validation during scan execution

Proposal

We will automate the verification process for tokens discovered during security scans. This feature will:

Verify token status (Active, Inactive, Unknown) of both GitLab and partner tokens, and display results on vulnerability pages
Enable sorting and filtering findings by token status
Support GitLab and partner platform tokens
Work for cloud and self-managed/air-gapped instances
Include telemetry to measure usage and effectiveness

Customers can opt in using Security Configurations. Once enabled, discovered tokens will be automatically verified against issuing services, with status information displayed in the Vulnerability details page and Security Dashboard, allowing security teams to prioritize remediation of active credentials.

Decisions

Challenges

Rate limit compliance: Partner API limits are restrictive (e.g., Postman: 5 req/s)
Error budget preservation: Must distinguish between rate limiting and actual failures
Refresh endpoint abuse protection: Manual token status refresh functionality must be protected against malicious or excessive usage
Feature-level abuse prevention: Overall validity checks feature requires safeguards to prevent abuse, particularly around partner token status checking volume and frequency
Future scalability: May need dedicated partner endpoints for GitLab.com scale

Design and implementation details

The validity checks capability relies on a three-phase rollout: Experiment, Beta, and GA. Each phase builds upon the previous to deliver a comprehensive token status checking experience.

Experiment Phase - GitLab Token Status Display

The experiment phase focuses on GitLab tokens only and implements token status checking using a Sidekiq worker approach. When a Secret Detection scan is executed and its report is ingested, a Sidekiq worker will be triggered automatically. This worker will:

Process all GitLab tokens discovered during the scan
Check each token’s current status in the Database
Assign one of three status values to the corresponding vulnerability finding:
- Unknown: Status check could not be completed
- Active: Token is currently active
- Inactive: Token is no longer active

sequenceDiagram
    participant SD as Pipeline SD Scan
    participant RW as Report Ingestion Worker
    participant TW as Token Status Worker
    participant DB as Database
    Note over SD: gl-secret-detection-report.json
    SD->>RW: Trigger report ingestion
    RW->>TW: Trigger token status check
    TW->>DB: Query token status in Database
    DB-->>TW: Return status
    TW->>DB: Update finding in Database

Beta Phase - Enhanced User Experience and Controls

The Beta phase focuses on improving user experience and providing customer controls for the validity checks feature.

GA Phase - Partner Integration

The GA phase expands token status verification to include partner platform tokens, providing comprehensive visibility across GitLab and partner services.

Architecture Overview

graph LR
    A[Secret Detection] --> B[UpdateTokenStatusWorker]
    B --> C[PartnerTokenValidationWorker]
    C --> D{Rate Limit Check}
    D -->|Under Limit| E[Partner API Call]
    D -->|Throttled| F[perform_in with delay]
    F --> C
    E --> G{Response}
    G -->|Success| H[Save to Database]
    G -->|Network Error| I[Retry with backoff]
    I --> F
    
    style D fill:#e3f2fd
    style E fill:#fff3e0
    style H fill:#e8f5e8

Execution Flow

sequenceDiagram
    participant SD as Secret Detection
    participant UW as UpdateTokenStatusWorker
    participant PW as PartnerTokenValidationWorker
    participant RL as ApplicationRateLimiter
    participant API as Partner API
    participant DB as Database
    
    SD->>UW: Trigger on report ingestion
    UW->>PW: Schedule validation
    PW->>RL: throttled?(rate_limit_key)
    
    alt Rate Limited
        RL-->>PW: true (throttled)
        PW->>PW: perform_in(delay, args)
        Note over PW: Requeue without exception
    else Under Limit
        RL-->>PW: false (proceed)
        PW->>API: call_partner_api
        alt Success
            API-->>PW: Token status
            PW->>DB: Update finding
        else Network Error
            API-->>PW: Error
            PW->>PW: perform_in(backoff, args)
            Note over PW: Retry with backoff
        end
    end

Security Considerations

Based on security analysis in #562364, we’ve identified and mitigated the following risks:

Denial of Service (DoS) Protection

Risk: Malicious users could flood Sidekiq with verification requests by generating gl-secret-detection-report.json files with numerous secrets across multiple pipelines.

Mitigations:

Implement per-project rate limiting for token validation requests
Add circuit breaker pattern for partner API failures
Monitor and alert on unusual validation patterns

Response Parsing Security

Risk: Parsing third-party API responses exposes the monolith to malformed data injection and unexpectedly large responses.

Mitigations:

Strict response size limits (max 10KB per response)
Response schema validation before parsing
Timeout enforcement on API calls
Sanitize all data before storage

Credential Management

Risk: Partner API keys stored in monolith could be exposed in self-managed instances.

Mitigations:

Current partner APIs (AWS, GCP, Postman) use public endpoints - no credentials needed
For future private APIs, use encrypted settings with key rotation
Implement allowlist of permitted partner endpoints
Audit logging for all partner API configuration changes

Input Validation

Risk: Malicious or malformed tokens could exploit partner APIs or our validation logic.

Mitigations:

Token format validation before API calls
Length limits on token values (max 4KB)
Character set validation
Rate limiting per token pattern to prevent enumeration

Monitoring and Alerting

Implement comprehensive monitoring to detect security issues

Alternative Solutions

SDRS Service Approach (Withheld)

Initially proposed separate service architecture. Rejected after discovering:

All partner APIs are public
No protected credentials needed
Added infrastructure complexity for self-managed

View page source - Edit this page - please contribute.

Validity Checks for Secret Detection findings

Summary

Motivation

Goals

Non-Goals

Proposal

Decisions

Challenges

Design and implementation details

Experiment Phase - GitLab Token Status Display

Beta Phase - Enhanced User Experience and Controls

GA Phase - Partner Integration

Architecture Overview

Execution Flow

Security Considerations

Denial of Service (DoS) Protection

Response Parsing Security

Credential Management

Input Validation

Monitoring and Alerting

Alternative Solutions

SDRS Service Approach (Withheld)

GitLab Secret Detection Validity Checks ADR 001: Use sidekiq worker approach for token status checking

GitLab Secret Detection Validity Checks ADR 002: Use REST over gRPC for SDRS communication

GitLab Secret Detection Validity Checks ADR 003: Use Secret Detection Response Service (SDRS)

GitLab Secret Detection Validity Checks ADR 004: Use direct partner API calls instead of SDRS