StepStone Group Logo

StepStone Group

Site Reliability Engineer

Posted 13 Days Ago
Be an Early Applicant
In-Office
Dublin, IRL
Mid level
In-Office
Dublin, IRL
Mid level
The Site Reliability Engineer designs and manages cloud infrastructure, collaborates with software teams, optimizes costs, and ensures compliance while adhering to best practices.
The summary above was generated by AI

We are global private markets specialists delivering tailored investment solutions, advisory services, and impactful, data driven insights to the world’s investors. Leveraging the power of our platform and our peerless intelligence across sectors, strategies, and geographies, we help identify the advantages and the answers our clients need to succeed.

The Site Reliability Engineer is responsible for designing, deploying and managing enterprise solutions utilizing various network, endpoint and cloud technologies. The role will provide subject matter expertise on complex cloud native technologies, topics and issues.

Responsibilities

  • Design, build, and maintain scalable cloud infrastructure on AWS, GCP, or Azure, with a focus on high availability and fault tolerance.
  • Collaborate with software engineers to embed reliability best practices into the software development lifecycle.
  • Participate in on-call rotations, lead incident response, and conduct thorough post-mortems to prevent recurrence.
  • Develop and maintain infrastructure-as-code (IaC) using tools such as Terraform, and CloudFormation
  • Optimize cloud resource utilization and cost management across multi-cloud or hybrid environments.
  • Contribute to the design and improvement of CI/CD pipelines and deployment automation.
  • Ensure cloud environments adhere to financial industry security and compliance standards.
  • Document systems, runbooks, and processes to support team knowledge sharing.

Required Qualifications

  • 3–5 years of experience in a Site Reliability Engineering, DevOps, or Cloud Infrastructure role.
  • Hands-on experience with one or more major cloud providers: AWS, GCP, or Azure.
  • Proficiency in at least one scripting or programming language (Python, Go, Bash, etc.).
  • Experience with infrastructure-as-code tools (Terraform, CloudFormation, Bicep).
  • Strong understanding of networking fundamentals (DNS, TCP/IP, load balancing, VPNs).
  • Familiarity with containerization and orchestration technologies (Docker, Kubernetes).
  • Experience with observability tools such as Datadog, Prometheus, Grafana, or equivalent.
  • Solid understanding of Linux/Unix systems administration.

Preferred Qualifications

  • Experience in the financial services industry or other highly regulated environments.
  • Familiarity with compliance frameworks such as SOC 2 or ISO 27001.
  • Cloud certifications (AWS Solutions Architect, GCP Professional Cloud Architect, Azure Administrator, etc.).

#LI-Hybrid

 

At StepStone, we believe that our people are our most important asset and crucial to our success.  We are an Equal Opportunity Employer that strives to create an environment that empowers our employees and allows them to be heard, regardless of title or tenure.  Our organizational community features multiple Employment Resource Groups as well as mentorship programs to enhance the employee experience for all.  

As an Equal Opportunity Employer, StepStone does not discriminate on the basis of race, creed, color, religion, sex, national origin, citizenship status, age, disability, marital status, sexual orientation, gender identity, gender expression, genetic information or any other characteristic protected by law.

Candidates must be at least 18 years old to apply.

Developing People at StepStone

 

Top Skills

AWS
Azure
Bash
CloudFormation
Datadog
Docker
GCP
Go
Grafana
Kubernetes
Linux
Prometheus
Python
Terraform
Unix

Similar Jobs

2 Days Ago
Easy Apply
Hybrid
Dublin, IRL
Easy Apply
Mid level
Mid level
Big Data • Cloud • Software • Database
The Site Reliability Engineer designs and builds global cloud infrastructure for MongoDB, focusing on automation, monitoring, and performance optimization. Responsibilities include infrastructure resilience, troubleshooting, and participating in on-call rotations.
Top Skills: Amazon Web ServicesDnsGoogle ComputeHTTPKubernetesLinuxAzureProgramming LanguagesTls
9 Days Ago
Hybrid
Dublin, IRL
Senior level
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Lead Site Reliability Engineer ensures platform stability, scalability, and performance, mentoring developers and managing application build operations with a focus on automation and risk management.
Top Skills: AutomationCapacity PlanningMonitoringSite Reliability Engineering
13 Days Ago
Easy Apply
Hybrid
Dublin, IRL
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
The role involves developing and managing distributed storage systems, ensuring reliability, and optimizing infrastructure performance. Responsibilities include defining SLOs and participating in an on-call rotation.
Top Skills: AWSAzureGoGoogle Cloud PlatformKubernetesLinuxPython

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account