Sterling Logo

Sterling

Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in IRL
Mid level
Remote
Hiring Remotely in IRL
Mid level
The Site Reliability Engineer will ensure system reliability and performance, manage platform infrastructure, provide operational support, and optimize system performance. Responsibilities include monitoring production environments, collaborating with development teams, and driving automation for improved efficiency.
The summary above was generated by AI

What you can expect
We are looking for an SRE to join the Workvivo Infrastructure team to ensure system reliability, performance, and scalability. You will monitor production environments, build and manage platform infrastructure, and improve software reliability and time-to-market. Responsibilities include optimizing system performance, providing operational support for large-scale applications, and analyzing metrics for performance tuning.
You'll collaborate with development teams on testing and releases, participate in system design and capacity planning, and drive automation for sustainable systems. Some on-call support is required.
About the Team
Workvivo is a digital experience platform dedicated to amplifying workplace culture and fostering employee inclusion, regardless of location. Committed to customer satisfaction, Workvivo focuses on enhancing employees' working lives across diverse industries globally. As part of Zoom, Workvivo aligns with Zoom's mission to prioritize people, enabling meaningful connections, modern collaboration, and driving innovation in businesses and individual interactions.
This is a unique opportunity to make a lasting impact on our infrastructure and development pipelines, working alongside passionate teammates who value growth and innovation.

Responsibilities

  • Running the production environment by monitoring availability and taking a holistic view of system health
  • Building software and systems to manage platform infrastructure and applications
  • Improving reliability, quality, and time-to-market of our suite of software solutions
  • Measuring and optimizing system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Providing primary operational support and engineering for multiple large-scale distributed software applications
  • Gathering and analyzing metrics from operating systems as well as applications to assist in performance tuning and fault finding
  • Partnering with development teams to improve services through intensive testing and release procedures
  • Participating in system design consulting, platform management, and capacity planning and creating sustainable systems and services through automation and uplifts
     

What we’re looking for

  • Have 3+ years professional SRE experience
  • Have experience with AWS cloud services such as ECS,Cloudformation, Lambda and Elasticbeanstalk
  • Have experience working with hosted web applications/SaaS
  • Have the ability to take ownership and responsibility for mission critical tasks
  • Work collaboratively in a team and participate in infrastructure and process discussions and planning
  • Have the ability to program (structured and OOP) using one or more high-level languages, such as Python, PHP, Node.JS,Go and JavaScript
  • Have a proactive approach to identifying problems, performance bottlenecks, and areas for improvement

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits
As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us
Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.
We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Here, you’ll work across teams to deliver impactful projects that are changing the way people communicate and enjoy opportunities to advance your career in a diverse, inclusive environment.


Our Commitment​
We believe that the unique contributions of all Zoomies is the driver of our success. To make sure that our products and culture continue to incorporate everyone's perspectives and experience we never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Zoom is proud to be an equal opportunity workplace and is an affirmative action employer. All your information will be kept confidential according to EEO guidelines.

We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records and any qualified applicants requiring reasonable accommodations in accordance with the law.

If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.

#LI-Remote

Top Skills

Go
JavaScript
Node.js
PHP
Python

Similar Jobs

2 Days Ago
Easy Apply
Remote
28 Locations
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer focused on Environment Automation, you'll automate operations across numerous GitLab environments. Your responsibilities include building deployment packages, managing infrastructure as code, deploying microservices, maintaining observability, and enhancing security measures while collaborating with engineering teams to resolve architectural issues.
Top Skills: GoRuby
2 Days Ago
Easy Apply
Remote
28 Locations
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
The Intermediate Site Reliability Engineer will enhance GitLab's delivery platform by automating release processes, improving monitoring, and optimizing deployment strategies. Key tasks include collaborating with Engineering teams, creating new tools, and ensuring timely and efficient software releases.
Top Skills: Kubernetes
22 Days Ago
Easy Apply
Remote
29 Locations
Easy Apply
Entry level
Entry level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer in FinOps at GitLab, you'll ensure systems are scalable, reliable, and financially optimized. Your role involves automating cost management, collaborating with finance and engineering teams, and promoting FinOps principles across operations for cost optimization and financial accountability.
Top Skills: AnsibleAWSGCPTerraform

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account