Magnet.me  -  The smart network where students and professionals find their internship or job.

The smart network where students and professionals find their internship or job.

Engineering Manager, Cloud Capacity, SaaS Production Engineering

Job Remote
Posted 5 Feb 2026
Share:
Work experience
5 to 10 years
Full-time / part-time
Full-time
Job function
Degree level
Required language
English (Fluent)

Build your career on Magnet.me

Create a profile and receive smart job recommendations based on your liked jobs.

Engineering Manager, Cloud Capacity, SaaS Production Engineering

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world.

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems.

An overview of this role

As the Engineering Manager for Cloud Capacity, GitLab SaaS Production Engineering, you will build and lead a high-performing, fully distributed team that operates and scales GitLab's multi-tenant SaaS infrastructure. You'll guide the team through a strategic consolidation effort that aligns multi-tenant and single-tenant deployments around shared tooling and processes, reducing duplication, simplifying operations, and improving reliability across all production environments. You'll own cloud capacity planning and vendor relationships, collaborate closely with Product Management and other Infrastructure Platforms and Engineering teams on roadmap and backlog health, and participate in incident management to help ensure GitLab.com remains available, secure, and scalable for customers. Alongside the technical mandate, you'll focus on developing a strong, collaborative engineering culture and growing team members into capable technical leaders.

What you'll do

  • Lead a high-performing Cloud Capacity team within GitLab SaaS Production Engineering, creating an environment where team members can do their best work and grow.
  • Drive the consolidation of multi-tenant and single-tenant SaaS infrastructure tooling and processes into cohesive, standardized approaches that simplify operations and improve reliability.
  • Own cloud capacity planning and operations, including maintaining effective relationships with cloud partners and other infrastructure vendors.
  • Manage the team's roadmap and project work in partnership with Product Management, ensuring priorities are clear and the backlog remains in a healthy state.
  • Participate in the Incident Management on-call rotation, working with reliability and development teams to meet availability goals for GitLab.com and other SaaS offerings.
  • Collaborate across Infrastructure Platforms, other Infrastructure teams, Support, and Customer Success Management to deliver a consistent, high-quality customer experience.
  • Champion automation, secure-by-default practices, and sound engineering principles to strengthen the availability, security, and scalability of GitLab SaaS production environments.
  • Mentor and develop individual contributors into strong technical leaders, fostering a collaborative, inclusive, and results-focused engineering culture.

What you'll bring

  • Experience leading production, platform engineering, or site reliability engineering teams, including guiding engineers through complex technical and operational change.
  • Strong technical background that enables you to understand distributed systems, SaaS infrastructure, and cloud capacity needs and to make informed decisions
  • Background running and operating consumer-scale platforms in a product company environment, with a focus on availability, security, and scalability.
  • Experience participating in and navigating incident response, collaborating across teams to resolve outages and improve reliability practices.
  • Demonstrated ability to build, develop, and coach engineering teams, including supporting individual contributors in growing into technical leaders.
  • Effective cross-functional collaboration skills, working closely with Product Management, Infrastructure, Support, and Customer Success on shared outcomes.
  • Clear and adaptable communication style, with the ability to explain complex systems to both technical and non-technical audiences in an all-remote, fully distributed context.
  • Openness to candidates with diverse backgrounds and transferable experience in related infrastructure, reliability, or platform leadership roles.

About the team

The Cloud Capacity team sits within the Infrastructure Platforms department, which ensures GitLab operates, delivers, and scales efficiently across GitLab.com, GitLab Dedicated, and self-managed customers. The team operates and evolves our multi-tenant SaaS infrastructure to provide a reliable, secure, and scalable experience for customers. You'll work with a distributed group of production, platform, and reliability engineers who collaborate asynchronously across regions. You'll also partner closely with other Infrastructure teams, Support, and Customer Success Management to improve availability, security, scalability, and the overall customer experience across all production environments.

GitLab Inc. is a company based on the GitLab open-source project, helping developers collaborate on code to build great things and ship on time. We are an active participant in our global community of customers and contributors, trying to serve their needs and lead by example. We have one vision: everyone can contribute to all digital content, and our mission is to change all creative work from read-only to read-write.

IT
Amsterdam
1,000 employees