Senior Site Reliability Engineer

Sorry, this job was removed at 01:20 p.m. (GMT) on Thursday, Nov 21, 2024
Remote
Internship
Information Technology
The Role

About Airalo

Alo! Airalo is the world’s first eSIM store that helps people connect in over 200+ countries and regions across the globe. We are building the next digital service that revolutionizes the telecom industry. We are a travel-tech company and an equal-opportunity environment that values and executes diversity, inclusion, and equity. Our team is spread across 50+ countries and six continents. What glues us together is our commitment to changing the way you connect.


About you

We hope that you care deeply about the quality of your work, the intrinsic worth of tasks, and the success of your team. You are self-disciplined and do not require micromanagement in terms of your skillset and work ethic. You do your best to flourish as an individual every day while working hard to foster a collaborative team environment. You believe in the importance of being — and staying — authentic, honest, positive, and kind. You are a good interlocutor with clear and concise communication. You are able to manage multiple projects, have an analytical mind, pay keen attention to detail, and love to get your hands dirty. You are cognizant, tolerant, and welcoming of vulnerabilities and cultural differences.


About the Role

Position: Full-time / Employee

Location: Remote-first

Benefits: Health Insurance, work-from-anywhere stipend, annual wellness & learning credits, annual all-expenses-paid company retreat in a gorgeous destination & other benefits


We are looking for an experienced Site Reliability Engineer to join our growing engineering team.We are a company that values SRE principles and practices. We believe in empowering our SREs to make data-driven decisions, automate operational tasks, and continuously improve the reliability of our systems. We foster a blameless culture where everyone is encouraged to learn from mistakes and share knowledge. If you are passionate about building and maintaining highly reliable systems, we would love to hear from you!

Responsibilities include, but are not limited to:

  • Develop and maintain reliable, scalable, and efficient systems.
  • Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and improve system reliability.
  • Conduct blameless post-incident reviews to identify root causes and implement preventive measures
  • Drive automation of operational tasks and incident response.
  • Develop and maintain runbooks and playbooks for common operational tasks and incident response.
  • Mitigate operational risks.
  • Work with software engineers to design systems for reliability, scalability, and maintainability.
  • Continuously evaluate and optimize system performance, capacity, and cost.
  • Participate in on-call rotation and be available to troubleshoot and resolve critical issues.

Must-haves:

  • Bachelor’s degree in Computer Engineering or a similar discipline.
  • 5+ years of experience as a Site Reliability Engineer or in a similar role.
  • 3+ years of experience with AWS services including strong knowledge of container orchestration.
  • 2+ years of Kubernetes experience
  • Deep understanding of observability principles and tools (logging, monitoring, tracing).
  • Experience with incident management and postmortem analysis.
  • Experience and interest in infrastructure as a code approach (Terraform).
  • Experience with chaos engineering and other techniques for testing system resilience.
  • Experience with CI/CD tools such as GitHub Actions.
  • Proficiency in at least one programming language (Python, Go, Java, etc.) for automation and tooling.
  • Comfortable with messaging systems (SNS, SQS, etc)
  • Ability to work independently and collaboratively in a fast-paced environment.
  • Team player and open to new ideas.
  • Good communication skills and fluency in English.

Good to haves:

  • Prior experience with Scrum and other agile methods.
  • Certification in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or similar.
  • Experience with AI-driven SRE tools for anomaly detection and improvements
  • Contributions to open-source SRE projects or communities.
  • Prior work experience in telecommunications.
  • Knowledge of eSIM and GSMA related technologies and services.

If you are interested in this position, please apply via the link.


Please note that our Engineering team works in the CET timezone, so candidates will need to reside in countries with the same time zone or similar to it and will need to already have permit to work in the country where they are based.


We sincerely thank all applicants in advance for submitting their interest in this opportunity. Airalo is an equal opportunity employer and values diversity, equity & inclusion. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We are committed to providing reasonable accommodations upon request for individuals with disabilities throughout our job interview process.

The Company
HQ: Delaware, Delaware
179 Employees
On-site Workplace
Year Founded: 2019

What We Do

Bringing you pain-free connectivity while you travel.

As travelers ourselves, we’ve faced the painful situations of not finding Wi-Fi, losing the SIM card you’ve carefully taped to the back of your phone, and the horror of coming home to an unexpected roaming bill.

We believe that in today’s modern world, connectivity and freedom should be accessible to all. Airalo is here to take away the pain and stress of researching and seeking out the best roaming deal. We’re here to let everyone stay connected globally while keeping it simple and pain-free.

Airalo is the world’s first eSIM store for travelers to access over 200+ eSIMs at the most affordable, local rates from around the world, all via eSIM-compatible smartphone, tablet, or PC. Airalo offers you both connectivity and freedom - you’ll never have to carry multiple SIM cards or change your number again, no matter where you are in the world.

Jobs at Similar Companies

Scythe Robotics Logo Scythe Robotics

Temp Assembly & Test Technician

Artificial Intelligence • Computer Vision • Hardware • Machine Learning • Robotics • Sales • Social Impact
Easy Apply
Longmont, CO, USA
105 Employees

Scythe Robotics Logo Scythe Robotics

Sales Specialist

Artificial Intelligence • Computer Vision • Hardware • Machine Learning • Robotics • Sales • Social Impact
Easy Apply
Remote
4 Locations
105 Employees

Scythe Robotics Logo Scythe Robotics

Sales Respresentative

Artificial Intelligence • Computer Vision • Hardware • Machine Learning • Robotics • Sales • Social Impact
Easy Apply
Remote
3 Locations
105 Employees

Square Logo Square

Engineering Manager, Square Banking

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
8 Locations
12000 Employees

Similar Companies Hiring

Qualtrics Thumbnail
Software • Natural Language Processing • Information Technology • Generative AI • Business Intelligence • Artificial Intelligence
Provo, UT
5000 Employees
Take-Two Interactive Software Thumbnail
Software • Mobile • Information Technology • Gaming
New York, NY
6500 Employees
Consensus Cloud Solutions Thumbnail
Software • Information Technology • Healthtech • Cloud • Business Intelligence • Artificial Intelligence
Los Angeles, CA
398 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account