Salesforce Logo

Salesforce

Software Engineering LMTS

Posted 4 Hours Ago
Be an Early Applicant
In-Office
Dublin
Senior level
In-Office
Dublin
Senior level
As a Staff Site Reliability Engineer, you'll enhance reliability and performance in distributed systems, leading automation and CI/CD improvements, collaborating with teams on architecture and incident resolution, while improving operational excellence.
The summary above was generated by AI

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.

We're looking for a Staff Site Reliability Engineer to make a significant impact on our large-scale distributed systems. If you're an experienced and passionate individual who thrives in a challenging environment and possesses a strong background in software engineering best practices, automation, and cloud technologies, we encourage you to apply. You'll be instrumental in driving the reliability, scalability, and performance of our critical services.

Responsibilities

  • Support and scale multi-cloud, multi-region services.
  • Build automation and self-healing capabilities to reduce manual operations.
  • Operate and scale monitoring, alerting, and tracing systems for proactive detection.
  • Improve CI/CD practices to accelerate safe, frequent deployments.
  • Define and implement SLIs/SLOs with engineering teams, driving reliability into system architecture.
  • Collaborate on integrating AI-driven automation and observability to enhance reliability.
  • Work within Agile teams, participating in SCRUM ceremonies and iterative delivery.
  • Lead post incident analysis, conduct postmortems, and ensure effective root cause resolution.
  • Use data to uncover trends, inform prioritization, and drive platform improvements.

Required Skills

  • 7+ years of experience in Python, Go, or Java for automation, tooling, and integration.
  • Hands-on experience designing, building and operating large scale distributed systems, identifying shortcomings and optimization opportunities
  • Demonstrated experience in developing and deploying production-grade software applications or services.
  • Proven ability to contribute directly to application codebase improvements for reliability and scalability.
  • Strong understanding of software engineering best practices, including design patterns, testing methodologies, and code reviews, applied in a production environment.
  • Excellent knowledge of Internet technologies and protocols (TCP/IP, DNS, HTTP, SSL, etc.)
  • Ability to locate and address sources of instability in high-traffic, large-scale distributed systems
  • Strong experience with API fundamentals (SOAP, REST)
  • Experience in Public Cloud environments, Kubernetes and modern container orchestration.
  • Knowledge of microservices, service mesh, and zero-trust infrastructure.
  • Solid knowledge of large-scale complex systems from a reliability and availability perspective
  • Hands-on with experience with large scale SDLC pipelines.
  • Strong Linux systems knowledge and troubleshooting skills.
  • Experience in fault modeling and tolerance, chaos engineering, performance and load testing.

Desired Skills

  • Experience operating in global, multi-tenant, or compliance-sensitive environments.
  • Understanding of SRE principles: SLIs/SLOs, availability, resiliency, and incident metrics (TTD, TTR).
  • Data-driven mindset for identifying systemic issues and improving service reliability.
  • Design and Implementation of Observability Solutions
  • Strong written and verbal communication, with emphasis on documentation and knowledge sharing.
  • Experience building and  integrating AI-driven automation and observability to enhance reliability

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that’s inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

Top Skills

APIs
Automation
Ci/Cd
Cloud Technologies
Go
Internet Technologies
Java
Kubernetes
Microservices
Python
Sdlc
Service Mesh

Similar Jobs

12 Hours Ago
Remote or Hybrid
Dublin, IRL
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Staff System Engineer will resolve complex technical issues, enhance system design, drive automation, and ensure customer satisfaction through proactive communication and support.
Top Skills: AzureCi/CdCloud TechnologiesDevOpsJavaScriptLinuxMySQLPythonRubyServicenow
Yesterday
Easy Apply
Hybrid
Dublin, IRL
Easy Apply
Senior level
Senior level
AdTech • Big Data • Digital Media • Marketing Tech
The Senior Data Engineer will design and build data pipelines, work on data analysis systems, and mentor junior engineers while ensuring best practices are followed.
Top Skills: AWSJavaPythonRedshiftSnowflakeSparkSQLVertica
Yesterday
Easy Apply
Hybrid
10 Locations
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will develop and maintain CI/CD infrastructure, enhance application agility, and support deployment systems while collaborating with engineering teams and participating in on-call rotations.
Top Skills: AWSAzureGoGCPKubernetesPython

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account