Invert Logo

Invert

Senior Site Reliability Engineer

Posted 8 Days Ago
Remote
30 Locations
Senior level
Remote
30 Locations
Senior level
The Senior Site Reliability Engineer will design and maintain scalable cloud infrastructure, enforce reliability metrics, optimize spending, and lead incident management efforts while enhancing developer workflows and CI/CD processes.
The summary above was generated by AI

The company

At Invert, we are on a mission to dramatically reduce the dollar and time cost of using biology to manufacture ~everything. Our customers use bioprocessing to do things like: produce new therapies to combat disease, create new biomaterials to solve the environmental crisis, and manufacture essential chemicals cleanly. We provide them with tools to automate the design, execution, and analysis of all that amazing work!

The Invert team is comprised of creative and talented engineers, data scientists, biologists, and more, and we are supported by amazing investors. We value diversity and welcome individuals from all backgrounds to join our remote-first, collaborative environment.

The team

You will be joining our Site Reliability Engineering team, a critical part of our Engineering organization. Our SRE team is at the heart of ensuring our software's reliability, performance, and seamless delivery from code to customer.

Key Responsibilities

Infrastructure and Reliability

  • Design, build, and maintain scalable and secure cloud infrastructure as code

  • Develop and enforce Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure software reliability

  • Enable cost transparency and optimize infrastructure spending

Developer Experience and Productivity

  • Reduce cognitive load for product engineers by creating streamlined, efficient development workflows

  • Build and maintain robust CI/CD pipelines that accelerate time from code to customer

  • Create and maintain intuitive, comprehensive observability solutions for end-to-end system monitoring

Incident Management and On-Call

  • Lead and continuously improve our Incident Management process

  • Participate in the on-call rotation, serving as a First Responder to quickly address and resolve system issues

  • Develop and maintain incident response playbooks and post-mortem practices

The role

You will work closely with

  • Our Software Engineers

  • Our Product, CX, Growth, and Sales teams

  • Our CTO office

Competencies:

  • Adaptable: Resilient in the face of changing priorities

  • Ambitious: Intrinsically motivated, driven to succeed

  • Communicates effectively: Ensures that the right information gets to the right people at the right time

  • Mentors effectively: Educates and empowers others

  • Takes ownership: Takes accountability, prioritizes team success

  • Technically skilled: Experienced in the relevant tech stack

  • Technically productive: Prioritizes velocity while maintaining sufficient quality

  • Trustworthy: Acts in the company’s best interests

The package

  • High-growth startup with impactful work

  • Fully remote, distributed across US and European timezones

  • Competitive salary, equity, and benefits

  • New laptop, monitor, and accessories of your choice

  • Frequent team offsites

  • Unlimited PTO

The interview process

The interview process consists of the four stages described below. Candidates are assessed between each of these stages. The hiring manager is responsible for communicating decisions and next steps throughout the process. We aim to complete all stages within two weeks.

  1. Discovery: A 30-minute conversation with the hiring manager to determine whether there is mutual interest in moving forward.

  2. Non-Technical Competencies: Two 60-minute interviews with two different employees to assess non-technical competencies.

  3. Technical Competencies: A 90-minute working session with two employees to assess technical competencies.

  4. References and Founder Chat: Three 15-minute conversations between the hiring manager and previous colleagues to gather external input. Simultaneously, a 30-minute meet-and-greet with one or both of the founders (depending on whether they have already participated in previous interviews).

Top Skills

Ci/Cd
Cloud Infrastructure
Observability

Similar Jobs

9 Days Ago
Remote
42 Locations
Senior level
Senior level
Software • Analytics • Financial Services • Cryptocurrency
The Senior Site Reliability Engineer will manage infrastructure, optimize performance, and ensure reliability for Dune's systems, working collaboratively across teams.
Top Skills: AnsibleBashGoKubernetesNomadPythonTerraform
11 Days Ago
Remote
28 Locations
Senior level
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you will design and maintain scalable infrastructure, automate processes, manage production systems, and enhance platform reliability, leveraging extensive experience in cloud and backend systems.
Top Skills: BashGoGoogle Cloud PlatformMySQLTerraformUnix
11 Days Ago
Remote
29 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will ensure service reliability, implement CI/CD processes, and work with cutting-edge cloud technologies to solve infrastructure challenges.
Top Skills: AnsibleC++DockerGoHelmKubernetesPythonSaltTerraformUnix

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account