The Lead Site Reliability Engineer ensures reliable operation of Mastercard's tech services, focusing on performance, scalability, incident management, and automation while mentoring team members.
Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Lead Site Reliability Engineer
Who is Mastercard?
At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Technology at Mastercard
What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable.
And we need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day.
Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences, and offers you the flexibility to shape a career across disciplines and continents. And the opportunity to work alongside experts and leaders at every level of the business, improving what exists, and inventing what's next.
About the Role
The ITSM Service Operations team ensures Mastercard's technology services operate reliably, securely, and at scale. We own the enterprise lifecycle for Change, Service Events, Incident, and Problem Management-balancing rapid innovation with operational stability. As our platforms and transaction volumes continue to grow, we are focused on evolving Service Operations through stronger governance, better automation, and a relentless focus on learning and prevention to support the next phase of Mastercard's growth.
The Service Operations Tooling team is seeking a highly motivated and experienced Lead Site Reliability Engineer (SRE) to join our growing team. You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor.
We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments.
Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle.
Team Specific Skills:
It is not expected that any single candidate would have expertise across all these areas, but a SRE engineer will spend a bit of time throughout their career with all of these aspects of the role: • Operational Readiness Architect:
o Serve as the primary contact responsible for the overall application health, performance, and capacity
o Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
o Partner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment. • Site Reliability Engineering:
o Performs operability and resilience design and implements and maintains highly reliable and scalable infrastructure.
o Perform root cause analysis of incidents and collaborate with development teams to resolve issues.
o Stay up to date with the latest technologies and trends in SRE and cloud computing.
o Participate in on-call rotations and be available to respond to critical incidents.
o Complete end-to-end ownership of the product.
o Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.
o Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability. • DevOps/Automation:
o Tackle complex development, automation, and business process problems. Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation, and refinement.
o Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
o Performs operational and resilience Design and implements solutions for capacity planning and performance optimization.
o Increase automation and tooling to reduce toil and manual intervention • ITSM Practices:
o Analyses ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Lead Site Reliability Engineer
Who is Mastercard?
At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Technology at Mastercard
What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable.
And we need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day.
Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences, and offers you the flexibility to shape a career across disciplines and continents. And the opportunity to work alongside experts and leaders at every level of the business, improving what exists, and inventing what's next.
About the Role
The ITSM Service Operations team ensures Mastercard's technology services operate reliably, securely, and at scale. We own the enterprise lifecycle for Change, Service Events, Incident, and Problem Management-balancing rapid innovation with operational stability. As our platforms and transaction volumes continue to grow, we are focused on evolving Service Operations through stronger governance, better automation, and a relentless focus on learning and prevention to support the next phase of Mastercard's growth.
The Service Operations Tooling team is seeking a highly motivated and experienced Lead Site Reliability Engineer (SRE) to join our growing team. You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor.
We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments.
Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle.
Team Specific Skills:
It is not expected that any single candidate would have expertise across all these areas, but a SRE engineer will spend a bit of time throughout their career with all of these aspects of the role: • Operational Readiness Architect:
o Serve as the primary contact responsible for the overall application health, performance, and capacity
o Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
o Partner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment. • Site Reliability Engineering:
o Performs operability and resilience design and implements and maintains highly reliable and scalable infrastructure.
o Perform root cause analysis of incidents and collaborate with development teams to resolve issues.
o Stay up to date with the latest technologies and trends in SRE and cloud computing.
o Participate in on-call rotations and be available to respond to critical incidents.
o Complete end-to-end ownership of the product.
o Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.
o Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability. • DevOps/Automation:
o Tackle complex development, automation, and business process problems. Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation, and refinement.
o Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
o Performs operational and resilience Design and implements solutions for capacity planning and performance optimization.
o Increase automation and tooling to reduce toil and manual intervention • ITSM Practices:
o Analyses ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard's security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
Top Skills
Automation
Ci/Cd
Cloud Computing
DevOps
Itsm
Mastercard Dublin, Dublin, IRL Office



One South County, South County Business Park, Dublin, Dublin, Ireland, D18
Similar Jobs at Mastercard
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
As a Lead Site Reliability Engineer, you will improve service lifecycles, ensure system reliability, and mentor junior team members. Responsible for incident response, system monitoring, and fostering collaboration between development and operations teams.
Top Skills:
Blaze MeterC++DynatraceGoJavaOraclePerlPythonRubySplunkSQLUnix
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Lead BizOps Engineer oversees technical authority and operational architecture for platforms, driving reliability, operational excellence, and automation in engineering with a focus on SRE and DevOps maturity.
Top Skills:
AnsibleBashChefDynatraceGitGoJenkinsPythonSplunk
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Lead the Payments Network SRE team in improving service quality of network infrastructure, ensuring performance, scalability, and reliability across various networking technologies.
Top Skills:
AnsibleAristaCheck PointCiscoDatadogDnsDynatraceHTTPLanLinuxMplsNetscoutOpentelemetrySd-WanSolarwindsTcp/IpTcpdumpTerraformTlsWanWireshark
What you need to know about the Dublin Tech Scene
From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.




