Data Team Organization
Data Team Organization
The Data Team Organization model is guided by three primary business needs:
- The need for bespoke data solutions unique to the GitLab business.
- The need for high-performance and reliable data storage and compute platform to support distributed analyst teams.
- The need for centers of excellence for data technologies and advanced analytics.
- The need for flexible data solutions driven by varying urgency and quality requirements.
Based on these needs, the Data Team is organized in the following way:
- Data Pods: Pods are assembled to provide concentrated focus on delivering & maintaining data products for strategic company initiatives. Pods are staffed with multiple data personas including Data Analyst, Data Scientist, Analytics Engineer, and supported by Data Engineer as stable counterpart.
- Analytics Engineering: Transform raw data into clean, structured, and usable formats for data decision-making. The Lead Analytics Engineer serves as a stable counterpart for business departments and functional analytics teams.
- Data Platform & Engineering Team: Center of Excellence for data technologies, including owning and operating the Data Stack
- Data Science Team: Center of Excellence for advanced analytics, including delivery of data science projects to the business
Data Pod Assignments
POD | Data Product Manager | Analytics Engineer | Data Analyst | Data Scientist |
---|---|---|---|---|
Company KPIs | @iweeks | @lisvinueza @chrissharp @annapiaseczna | @annie-analyst | |
Customer Intelligence | @nmcavinue | @jonglee1218 | ||
Customer Product Adoption | @mdrussell | @michellecooper @utkarsh060 |
Analytics Engineering - Stable Counterpart Assignments
Department / Division | Functional Analytics Team / Steering Committee | Analytics Engineer |
---|---|---|
Sales | Revenue Strategy and Analytics | @lisvinueza |
Marketing | Marketing Strategy and Analytics | @chrissharp |
Finance | FP&A Analytics | @chrissharp |
Customer Success | CS Strategy and Analytics | @mdrussell |
Product | Product Data Insights | @michellecooper |
Engineering | Engineering Analytics | @michellecooper |
Security | Engineering Analytics | @michellecooper |
Support | N/A | @michellecooper |
People | People Analytics | @rakhireddy |
R&D | Product Usage Data Architecture Steering Committee | @pempey |
Data Platform Team Stable Counterpart Assignments
POD | Data Engineer |
---|---|
Enterprise Metrics | @juwong |
Customer Intelligence | @rigerta |
Customer Product Adoption | @rbacovic |
Manager, Data
In support of the Data Pod, the Manager, Data fulfills the below responsibilities from the Senior Manager, Data Job Responsibilites:
- Works with the Director, Data to envision and draft Quarterly Objectives, driven by requirements gathered from multiple business partners.
- Monitor, measure, and improve key aspects of the Data Pods.
- Regularly meet with business partners to it/data-team/stand and solve for data needs.
- Serve as a primary or back-up Maintainer on the Data Team Project. Provide final review, feedback, and approval of Merge Requests submitted by the Data Pod and stable counterparts.
Lead Analytics Engineer (Stable Counterparts for the Business)
In support of the Data Pod and Stable Counterpart relationships, the Lead Analytics Engineer fulfills the below responsibilities from the Senior Analytics Engineer Job Responsibilites:
- Own one or more stakeholder relationship in Go To Market, Research & Development, General & Administrative, Financial Analytics, or Engineering Analytics business functions.
- Co-DRI of Key Results along with the Manager, Data.
- Lead work breakdown sessions for OKRs.
- Work with functional stakeholders to prioritze
P3-Other
issues. - Serve as a primary or back-up Maintainer on the Data Team Project. Provide final review, feedback, and approval of Merge Requests submitted by the Data Pod and stable counterparts.
- Review the weekly stand-up and provide support as needed to unblock team members and answer questions.
Data Platform Team Stable Counterpart
Following the GitLab Stable Counterpart principles, every Data Pod have a Data Platform Team Stable Counterpart assigned. The Data Platform Stable Counterpart divides their time, work and priorities between the Data Platform Team and Data Pod (general an average of 50% each, P2-OKR scheduled ahead of the quarter in collaboration with the respective Pod). The Stable Counterpart is aware of the direction and priorities of the Data Pod and when needed brought into discussion with the Data Platform Team. I.e. when there is a bigger demand than the Stable Counterpart can handle in the assigned availability or architectural direction needs to change. The Stable Counterpart recognize, flags and address this with the applicable stakeholders (in general the Lead/DRI of the Data Platform Team and the Data Pod).
The stable counterpart is expected to participate in the following meetings asynchronously or synchronously. When in doubt, please reach out to the Data Pod Manager to learn which meetings on the calendar you should participate in. In general, the meetings in scope are as follows:
- Data Pod Iteration Planning Meetings.
- Data Pod Team Meetings.
Data Program Recruiting
Recruiting great people is critical to our success and we’ve invested much effort into making the process efficient. Here are some reference materials we use:
- a GitLab Data Recruiting video to say “Hi” and give you some insight into how we work and what we work on. Enjoy!
- Data Roles and Career Development to help existing team members and prospects it/data-team/stand growth opportunities
- a Take Home Test that we ask each candidate to complete; this test is good for the candidate and for us because it represents the type of work we perform regularly and if the candidate is not interested in this work it helps them make a more informed decision about their application
Data Roles and Career Development
Data Internships
Data Platform
graph LR; subgraph Data Engineering Roles supe:de(Data Engineer)-->supe:sde(Senior Data Engineer); supe:sde(Senior Data Engineer)-->supe:fde(Staff Data Engineer); end click supe:de "https://handbook.gitlab.com/job-families/finance/data-engineer#data-engineer"; click supe:sde "https://handbook.gitlab.com/job-families/finance/data-engineer#senior-data-engineer"; click supe:fde "https://handbook.gitlab.com/job-families/finance/data-management#staff-data-engineer";
Intermediate and Senior Data Engineer Onboarding Timeline
By Day 30 | By Day 60 | By Day 90 | By Day 120 |
---|---|---|---|
Complete People and Data Onboarding | Perform triage activities | Extract new data sources | Own a specific area of the data platform |
Create a MR to contribute to handbook or templates | Investigate incidents and issues | Work on OKR assignments | Propose new ideas and come up with Data Platform improvement initiatives |
it/data-team/stand the current setup of the data platform | Make small/corrective changes to the platform infrastructure or data pipelines | Contribute on work breakdown |
Data Analyst
graph LR; subgraph Data Analyst Roles supe:ida(Data Analyst Intern)-->supe:jda(Junior Data Analyst); supe:jda(Junior Data Analyst)-->supe:da(Data Analyst); supe:da(Data Analyst)-->supe:sda(Senior Data Analyst); supe:sda(Senior Data Analyst)-->supe:fda(Staff Data Analyst); end click supe:ida "https://handbook.gitlab.com/job-families/finance/data-analyst#data-analyst-intern"; click supe:jda "https://handbook.gitlab.com/job-families/finance/data-analyst#junior-data-analyst"; click supe:da "https://handbook.gitlab.com/job-families/finance/data-analyst#data-analyst"; click supe:sda "https://handbook.gitlab.com/job-families/finance/data-analyst#senior-data-analyst"; click supe:fda "https://handbook.gitlab.com/job-families/finance/data-analyst#staff-data-analyst";
Intermediate and Senior Data Analyst Onboarding Timeline
By Day 30 | By Day 60 | By Day 90 | By Day 120 |
---|---|---|---|
Complete People and Data Onboarding | Extend an existing Tableau dashboard or complete the triage phase for a dbt issue | Run a project end-to-end as DRI with support from a Data Fusion Team | Create ERDs/Data Artifacts (e.g. dashboards) or complete a product evaluation |
Start attending Data Fusion Team and Business Team synchronous meetings | Perform triage activities | ||
Complete First Issue: S to M T-Shirt Size |
Data Science
graph LR; subgraph Data Science Roles supe:ds(Data Scientist)-->supe:sds(Senior Data Scientist)-->supe:stds(Staff Data Scientist)-->supe:pds(Principal Data Scientist); end click supe:ds "https://handbook.gitlab.com/job-families/finance/data-science/#data-scientist-intermediate"; click supe:sds "https://handbook.gitlab.com/job-families/finance/data-science/#senior-data-scientist"; click supe:stds "https://handbook.gitlab.com/job-families/finance/data-science/#staff-data-scientist"; click supe:pds "https://handbook.gitlab.com/job-families/finance/data-science/#principal-data-scientist";
Intermediate and Senior Data Scientist Onboarding Timeline
By Day 30 | By Day 60 | By Day 90 | By Day 120 |
---|---|---|---|
Complete People and Data Onboarding | Meet stakeholders across the organization | Re-train or enhance an existing data science model | Make a contribution to improve the Data Science handbook, packages, or processes |
Start attending Data Science Team meetings | Refine/improve one data science dashboard | Work on OKR assignments | Take ownership of at least one quarterly OKR |
it/data-team/stand the current data science systems and processes |
Analytics Engineering
Analytics Engineering Job Family
Analytics Engineering Job Family
graph LR; subgraph Analytics Engineer Roles supe:ae(Analytics Engineer)-->supe:sae(Senior Analytics Engineer); supe:sae(Senior Analytics Engineer)-->supe:fae(Staff Analytics Engineer); supe:fae(Staff Analytics Engineer)-->supe:pae(Principal Analytics Engineer); end click supe:ae "https://handbook.gitlab.com/job-families/marketing/enterprise-data/analytics-engineer#analytics-engineer-intermediate"; click supe:sae "https://handbook.gitlab.com/job-families/marketing/enterprise-data/analytics-engineer#senior-analytics-engineer"; click supe:fae "https://handbook.gitlab.com/job-families/marketing/enterprise-data/analytics-engineer#staff-analytics-engineer"; click supe:pae "https://handbook.gitlab.com/job-families/marketing/enterprise-data/analytics-engineer#principal-analytics-engineer";
Intermediate and Senior Analytics Engineer Onboarding Timeline
By Day 30 | By Day 60 | By Day 90 | By Day 120 |
---|---|---|---|
Complete People and Data Onboarding | Extend an existing dbt Trusted Data Models | Run a project end-to-end as DRI with support from a Data Fusion Team | Create ERDs/Data Artifacts |
Start attending Data Fusion Team and Business Team synchronous meetings | Perform triage activities | ||
Complete First Issue: S to M T-Shirt Size |
Data Management
graph LR; subgraph Data Management Roles supe:md(Manager, Data)-->supe:smd(Senior Manager, Data); supe:smd(Senior Manager, Data)-->supe:dd(Director, Data); supe:dd(Director, Data)-->supe:sdd(Senior Director, Data); end click supe:md "https://handbook.gitlab.com/job-families/finance/manager-data/#manager-data-intermediate"; click supe:smd "https://handbook.gitlab.com/job-families/finance/manager-data/#senior-manager-data"; click supe:dd "https://handbook.gitlab.com/job-families/finance/data-and-insights-executive/#director-data-and-analytics"; click supe:sdd "https://handbook.gitlab.com/job-families/finance/data-and-insights-executive/#senior-director-data-and-analytics";
Data Manager Onboarding Timeline
By Day 30 | By Day 60 | By Day 90 | By Day 120 |
---|---|---|---|
Complete People, Data, and Manager Onboarding | Meet everyone on the team and business data champions | Complete a Team Assessment | Draft a people development Roadmap |
it/data-team/stand the current setup of the data platform | Work on OKR assignments and map them to the data platform | Lead discussions with Users/Stakeholders on initiatives and OKRs | Draft a program development Roadmap (Process Improvements /Future State) |
Add a new page to the handbook | Make regular contributions to the handbook spanning your area of management | Become DRI for major portions of the Data Handbook | System/Application Change Control Management of one or more modules |
Tool Technology Tandem
Tool Technology Tandems (TTT) are supporting to get the maximum value out of business opportunities we have in the Data Program. TTT are experts in a specific (software) tool or technology to support business opportunities or challenges we have by leveraging the tool or technology to the maximum. Although this is not the goal, we want to get the maximum value out of our technology stack. At the moment we see that we are not leveraging our technology stack to the maximum, where there are useful features or opportunities in our technology that could support in fulfilling business opportunities.
The reason is that from the technology side we don’t know the business and from the business side we don’t know the technology. The TTT will bridge this gap by it/data-team/standing the needs and bring this together in a technological way. We expect from TTT to do consulting, guiding and educating.
Note: TTT will not search for business opportunities to use any tool feature. TTT has to it/data-team/stand business opportunities and translate this into what software could bring to the table.
A single TTT consists of minimum 2 and maximum 3 GitLab Team Members with different roles. There are no requirements in which team a Team Member is part of(so this could be outside of the central Data Team as well) as long as the TTT meets the expectations described below.
Tool / Technology | Tandem |
---|---|
Snowflake | t.b.d. |
Monte Carlo | t.b.d. |
dbt | t.b.d. |
Tableau | t.b.d. |
What do we expect from TTT
- We expect TTT to get in touch with our business partners and all functions that contribute to the data program or work with our Data Platform, to it/data-team/stand their challenges.
- We expect TTT to get up to date with the latest in their area. They it/data-team/stand the full capabilities of the tool / technology, have regular touchpoints with the respective vendor and have a good it/data-team/standing of the latest released features.
- TTT will guide and educate our business partners.
- TTT will initiate design-spikes for quarterly planning.
Data Platform at GitLab
Data Science Handbook
Data Steering Committee at GitLab
Data Team Internships
cdb9f580
)