DuckDuckGo Logo

DuckDuckGo

Senior Site Reliability Engineer

Posted Yesterday
Remote
30 Locations
Senior level
Remote
30 Locations
Senior level
As a Senior Site Reliability Engineer, you'll build and maintain infrastructure, tackle operational challenges, and automate processes to enhance reliability.
The summary above was generated by AI

Who We Are

Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, annual revenue now exceeds $100m USD and millions use our browser on Mac, Windows, iOS, and Android, our search engine, and the latest — Privacy Pro. Our culture of trust, inclusivity, and empowered project management underpins everything we do, where each team member takes full ownership of their projects, from scoping and execution to postmortem. If you're seeking end-to-end ownership of your work — you've come to the right place!

Your Team and Role

Working on the Site Reliability Team, you'll help build and maintain world-class infrastructure to meet the needs of millions of users protecting their privacy online. You'll utilize high-level languages like Perl, Go, or Python and work on related projects. Recent projects include:

  • Preparing Duck.ai image uploads for production

  • Reduce user impact of instances serving errors to users

As a Site Reliability Engineer, you'll dive deep into complex operational challenges, including software, systems, automation, and process analysis. We are looking for candidates who can read, write, troubleshoot, and deploy all types of software to help us tackle the reliability challenges of large-scale deployments.

About You

  • 7+ years relevant professional experience in reliability, platform, infrastructure, or software engineering.

  • Experience participating in a 24x7 on-call rotation for a large-scale deployment. 

  • Ability to lead and collaborate on high-impact and complex projects from proposal through post-mortem.  

  • Skills to wrangle vague problems, propose innovative solutions, and execute them with a strong focus on metrics. 

  • Experience developing effective tools, services, alerts, and responses to identify and address reliability risks.

  • Investigative ability to root-cause sources of instability in high-traffic, distributed systems.

  • Deep experience administering and troubleshooting Linux and web technologies.

  • Ability to implement automation around infrastructure provisioning and configuration management to prioritize efficiency, scalability and reliability.

  • Foresight to help identify the future technical direction of our deployment with an effort to improve reliability and performance.

  • Advanced programming skills enabling close partnership with software engineers to triage production issues and identify appropriate remediation, including code changes and performance considerations.

  • Ability to leverage cloud-native services and architectures to enhance reliability and scalability, with hands-on experience packaging and deploying applications using Docker and Docker Compose. 

Compensation

$178,500 USD annually and stock options. Compensation is identical within professional levels, regardless of geographic location or team. Compensation for each professional level is transparent across the organization. Our Team Member Support Guide explains how we prioritize your wellbeing, including paid parental leave, office setup, and co-working allowances.

Hiring Process

Hiring works best when it's a two-way street. Learn how we help you get to know DuckDuckGo, envision your future role here, and find out more about how we hire.

Diversity, Equity, and Inclusion

DuckDuckGo provides equal work opportunities to all team members and applicants, prohibiting discrimination and harassment of any type based on race, color, ethnicity, caste, religion, age, sex (including pregnancy), national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by our policies or laws.

We want to ensure that our hiring process is accessible. If you need reasonable accommodation for any part of the application process due to a medical condition or disability, please email [email protected] to let us know the nature of your request.

Please Note:

  • You’ll be required to attend meetings on camera via video conferencing.

  • Expect to travel at least twice a year: once for our all-hands meetup and again for a team retreat (each around 4-5 days). While extenuating circumstances may impact attendance, everyone is strongly encouraged to attend.

  • While we offer a flexible work arrangement with no core hours, expect an average full-time commitment of 40 hours per week.

  • A successful candidate must pass a background check as a condition of joining the team.

  • By applying for this role, you confirm that all information submitted is accurate and complete. Providing false or fraudulent information during the application process may result in denial of an offer, revocation of any existing offer, or other adverse actions, including termination after starting work.

#LI-DNI

Top Skills

Docker
Docker Compose
Go
Linux
Perl
Python

Similar Jobs

12 Days Ago
In-Office or Remote
Athens, GRC
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
Join Plum as a Senior Site Reliability Engineer to ensure resilient, secure, and scalable systems. Operate infrastructure, automate processes, and optimize CI/CD workflows while collaborating across teams.
Top Skills: Argo WorkflowsArgocdAWSCircleCIGCPGithub ActionsGrafanaKubernetesOpentelemetryPostgresPrometheusPythonRabbitMQRedisTerraform
13 Days Ago
In-Office or Remote
33 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our inference platform, leveraging Kubernetes and Terraform while ensuring smooth scalability of systems under load.
Top Skills: BashGrafanaKubernetesMlopsPrometheusPythonRayTerraformTritonVllm
17 Days Ago
In-Office or Remote
29 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
The Senior Site Reliability Engineer ensures system fault-tolerance, scalability, and operational continuity by leveraging cloud technologies and improving CI/CD processes.
Top Skills: AnsibleC++DockerGoHelmK8SPythonSaltTerraformUnix

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account