HubSpot Logo

HubSpot

Director, Reliability Engineering

Reposted 7 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Ireland
Senior level
Remote
Hiring Remotely in Ireland
Senior level
Lead the Reliability Engineering team at HubSpot, driving AI-integrated operations, managing incidents, and enhancing reliability strategies for a scalable platform.
The summary above was generated by AI

POS-31619

Director, Reliability Engineering

Role Summary

Our mission at HubSpot is to help millions of organizations grow better. HubSpot’s engineering organization has grown to more than 2,000 engineers shipping across thousands of services and deploying thousands of times per day. As HubSpot has become core infrastructure for over 200,000 customers worldwide, reliability isn’t just a priority — it’s foundational to customer trust and business growth.

Our Reliability Engineering team has matured from an early SRE function into a strategic pillar within Platform Infrastructure. The team has driven a 76% reduction in critical incidents while the platform scaled 19x in deployables, established company-wide SLO frameworks, and built the incident management practices that keep HubSpot running.

Now we’re entering the next phase: leveraging AI and agentic approaches to fundamentally transform how we detect, respond to, and prevent outages. As Director of Reliability Engineering, you’ll lead this evolution — deepening our reliability capabilities, pioneering AI-assisted operations, and ensuring HubSpot remains a platform customers can confidently bet their business on.

What You’ll Do

Lead and Develop the Team

  • Lead a team of ~20 reliability engineers, fostering a culture of operational excellence, continuous learning, and customer obsession
  • Attract, develop, and retain top talent; build career paths that keep engineers engaged and growing

Own Reliability Strategy

  • Define and drive HubSpot's reliability roadmap, balancing proactive resilience investments with reactive incident reduction
  • Partner with Infrastructure leadership to prioritize reliability initiatives alongside cost, performance, and platform evolution
  • Set and evolve SLO standards that align engineering effort with customer experience

Pioneer AI-Driven Operations

  • Lead the strategy for integrating AI and agentic approaches into incident detection, diagnosis, and mitigation-reducing time-to-resolution and human toil
  • Explore and implement AI-assisted tooling for pattern recognition across incidents, automated runbook execution, and predictive reliability insights
  • Build intelligent systems that learn from our operational history, proactively surface risks, and recommend-or execute-mitigation actions
  • Balance automation with human judgment-designing systems where AI augments engineers rather than creating blind spots

Drive Company-Wide Impact

  • Own incident management end-to-end: response coordination, executive communication during major incidents, and blameless post-incident reviews that drive systemic improvement
  • Influence engineering culture across 100+ product teams-evangelizing reliability practices without compromising team autonomy
  • Identify systemic risks across the platform and drive cross-functional mitigation efforts

Represent Reliability at the Executive Level

  • Serve as the voice of reliability in leadership forums, translating technical risk into business terms
  • Communicate transparently with customers and stakeholders during and after operational incidents
  • Partner with peer directors across Infrastructure, Product Engineering, and Security to align on shared priorities
What You’ll Bring

Required Qualifications

  • 10+ years of experience in software engineering, SRE, or infrastructure, with 5+ years leading teams
  • Track record of building and scaling reliability functions at companies with significant operational complexity
  • Deep technical fluency-you can dive into architecture discussions, incident analysis, and system design with credibility
  • Curiosity and vision for how AI/ML can transform operations; experience with or strong interest in AIOps, agentic automation, or ML-driven observability is a plus
  • Proven ability to drive cultural and process change across a large engineering organization without top-down mandates
  • Strong executive communication skills; comfortable leading incident bridges, presenting to leadership, and representing reliability externally
  • Experience with modern cloud infrastructure (AWS preferred), observability tooling, and incident management practices
  • A philosophy that balances reliability with velocity-you understand that the goal is sustainable speed, not gates

Why This Role

This is a high-visibility, high-impact leadership role at an inflection point. You'll own one of Infrastructure's four core pillars at a company where platform stability directly enables customer growth. You'll have the mandate to shape how AI transforms operational practices-not just at HubSpot, but potentially as a model for the industry. You'll have executive access, strategic influence, and the opportunity to define what modern reliability engineering looks like.


We know the confidence gap and impostor syndrome can get in the way of meeting spectacular candidates, so please don’t hesitate to apply — we’d love to hear from you.

If you need accommodations or assistance due to a disability, please reach out to us using this form.

At HubSpot, we value both flexibility and connection. Whether you’re a Remote employee or work from the Office, we want you to start your journey here by building strong connections with your team and peers. If you are joining our Engineering team, you will be required to attend a regional HubSpot office for in-person onboarding. If you join our broader Product team, you’ll also attend other in-person events, such as your Product Group Summit and other gatherings, to continue building on those connections.

If you require an accommodation due to travel limitations or other reasons, please inform your recruiter during the hiring process. We are committed to supporting candidates who may need alternative arrangements

Massachusetts Applicants: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Germany Applicants: (m/f/d) - link to HubSpot's Career Diversity page here.

India Applicants: link to HubSpot India's equal opportunity policy here.

About HubSpot

HubSpot (NYSE: HUBS) is an AI-powered customer platform with all the software, integrations, and resources customers need to connect marketing, sales, and service. HubSpot's connected platform enables businesses to grow faster by focusing on what matters most: customers. 

At HubSpot, bold is our baseline. Our employees around the globe move fast, stay customer-obsessed, and win together. Our culture is grounded in four commitments: Solve for the Customer, Be Bold, Learn Fast, Align, Adapt & Go!, and Deliver with HEART. These commitments shape how we work, lead, and grow.

We’re building a company where people can do their best work. We focus on brilliant work, not badge swipes. By combining clarity, ownership, and trust, we create space for big thinking and meaningful progress. And we know that when our employees grow, our customers do too.

Recognized globally for our award-winning culture by Comparably, Glassdoor, Fortune, and more, HubSpot is headquartered in Cambridge, MA, with employees and offices around the world.

Explore more:

  • HubSpot Careers
  • Life at HubSpot on Instagram

HubSpot may use AI to help screen or assess candidates, but all hiring decisions are always human. More information can be found here. By submitting your application, you agree that HubSpot may collect your personal data for recruiting, global organization planning, and related purposes. Refer to HubSpot's Recruiting Privacy Notice for details on data processing and your rights.

Top Skills

AI
AWS
Incident Management
Ml
Observability Tooling

Similar Jobs

2 Hours Ago
Remote or Hybrid
Dublin, IRL
Entry level
Entry level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Generate and qualify meetings via phone, email, and social outreach to build early-stage pipeline. Document interactions in ServiceNow, hand off qualified opportunities to GTM teams, support follow-up at marketing events, participate in development simulations, and meet KPI targets.
Top Skills: Ai-Native ToolsCloud ComputingSaaSServicenow
2 Hours Ago
Remote or Hybrid
Dublin, IRL
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead and execute security compliance testing and audits across cloud/SaaS environments. Maintain and enhance the control framework, support global compliance programs (ISO, PCI, SOC2, regional standards), liaise with risk teams, and support customers and certifiers through GRC tooling and compliance reporting.
Top Skills: AICloudFrench HdsGrc SystemIso 27001Iso 27018Italy Qc2PciSaaSSoc 2Spanish EnsUk Cyber Essentials Plus
2 Hours Ago
Remote or Hybrid
Dublin, IRL
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Own territory sales for ServiceNow Risk & Security solutions: generate self-sourced pipeline, build C-suite relationships, lead discovery and demos, manage full deal lifecycle, navigate regulatory requirements, orchestrate partner and internal teams, drive adoption and expansion, and represent ServiceNow at industry events.
Top Skills: Servicenow,Siem,Grc Platforms,Tprm Platforms,Apis,Ai-Powered Tools

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account