Invert

Senior Site Reliability Engineer

Reposted 21 Days Ago

Remote

30 Locations

Senior level

Remote

30 Locations

Senior level

The Senior Site Reliability Engineer will design and maintain scalable cloud infrastructure, enforce reliability metrics, optimize spending, and lead incident management efforts while enhancing developer workflows and CI/CD processes.

The summary above was generated by AI

The company

At Invert, we are on a mission to dramatically reduce the dollar and time cost of using biology to manufacture ~everything. Our customers use bioprocessing to do things like: produce new therapies to combat disease, create new biomaterials to solve the environmental crisis, and manufacture essential chemicals cleanly. We provide them with tools to automate the design, execution, and analysis of all that amazing work!

The Invert team is comprised of creative and talented engineers, data scientists, biologists, and more, and we are supported by amazing investors. We value diversity and welcome individuals from all backgrounds to join our remote-first, collaborative environment.

The team

You will be joining our Site Reliability Engineering team, a critical part of our Engineering organization. Our SRE team is at the heart of ensuring our software's reliability, performance, and seamless delivery from code to customer.

Key Responsibilities

Infrastructure and Reliability

Design, build, and maintain scalable and secure cloud infrastructure as code
Develop and enforce Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure software reliability
Enable cost transparency and optimize infrastructure spending

Developer Experience and Productivity

Reduce cognitive load for product engineers by creating streamlined, efficient development workflows
Build and maintain robust CI/CD pipelines that accelerate time from code to customer
Create and maintain intuitive, comprehensive observability solutions for end-to-end system monitoring

Incident Management and On-Call

Lead and continuously improve our Incident Management process
Participate in the on-call rotation, serving as a First Responder to quickly address and resolve system issues
Develop and maintain incident response playbooks and post-mortem practices

The role

You will work closely with

Our Software Engineers
Our Product, CX, Growth, and Sales teams
Our CTO office

Competencies:

Adaptable: Resilient in the face of changing priorities
Ambitious: Intrinsically motivated, driven to succeed
Communicates effectively: Ensures that the right information gets to the right people at the right time
Mentors effectively: Educates and empowers others
Takes ownership: Takes accountability, prioritizes team success
Technically skilled: Experienced in the relevant tech stack
Technically productive: Prioritizes velocity while maintaining sufficient quality
Trustworthy: Acts in the company’s best interests

The package

High-growth startup with impactful work
Fully remote, distributed across US and European timezones
Competitive salary, equity, and benefits
New laptop, monitor, and accessories of your choice
Frequent team offsites
Unlimited PTO

The interview process

The interview process consists of the four stages described below. Candidates are assessed between each of these stages. The hiring manager is responsible for communicating decisions and next steps throughout the process. We aim to complete all stages within two weeks.

Discovery: A 30-minute conversation with the hiring manager to determine whether there is mutual interest in moving forward.
Non-Technical Competencies: Two 60-minute interviews with two different employees to assess non-technical competencies.
Technical Competencies: A 90-minute working session with two employees to assess technical competencies.
References and Founder Chat: Three 15-minute conversations between the hiring manager and previous colleagues to gather external input. Simultaneously, a 30-minute meet-and-greet with one or both of the founders (depending on whether they have already participated in previous interviews).

Top Skills

Ci/Cd

Cloud Infrastructure

Observability

Similar Jobs

GitLab

Senior Site Reliability Engineer, Runway

Yesterday

Easy Apply

Remote

Easy Apply

Senior level

Cloud • Security • Software • Cybersecurity • Automation

The Senior Site Reliability Engineer will design and maintain infrastructure on GCP and AWS, automate operations, lead incident responses, and ensure system reliability and scalability.

Top Skills: AWSGCPGoGrafanaHashicorp VaultIstioKubernetesLinkerdOpenbaoPrometheusPulumiTerraform

P2P.org

Senior SRE (Data team)

11 Days Ago

Remote

Senior level

Information Technology

Ensure the reliability of data platforms, improve service delivery pipelines, troubleshoot issues, and increase observability within a data engineering team.

Top Skills: AirflowArgocdClickhouseGCPHc VaultIstioKafkaKubernetesLokiSupersetVictoria Metrics

Auros

Senior Site Reliability Engineer, EU, UK or Americas

15 Days Ago

Remote

Senior level

Marketing Tech • Cryptocurrency

Responsible for maintaining high-performance trading infrastructure, enhancing security and resilience, collaborating on systems layout, and automating tools for efficiency.

Top Skills: AWSAzureBashCi/CdDockerEbpfGoGCPJavaScriptKubernetesLinuxOpentelemetryPrometheusPythonTypescript

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.