CrowdStrike

Sr. IT Monitoring Engineer/SRE (Remote)

Posted 9 Hours Ago

Be an Early Applicant

Remote or Hybrid

16 Locations

Senior level

Remote or Hybrid

16 Locations

Senior level

The role involves designing, implementing, and maintaining monitoring platforms while mentoring engineers and leading incident response efforts.

The summary above was generated by AI

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role
The CrowdStrike Information Technology team is looking for a Senior IT Monitoring Engineer/Site Reliability Engineer (SRE) to lead the design, implementation, and evolution of our enterprise monitoring and observability platforms. In this leadership role, you will architect scalable monitoring solutions, drive reliability initiatives, and serve as a technical authority for monitoring best practices. You will mentor junior team members, collaborate with cross-functional teams to establish SLOs, and play a key role in major incident management. This position requires advanced technical expertise, strategic thinking, and the ability to balance operational excellence with innovation.
What You’ll Need:

Required Skills and Qualifications
8+ years of experience with enterprise monitoring platforms and observability tools (LogicMonitor, DataDog, LogScale, Zscaler Digital Experience (ZDX), ThousandEyes)
Advanced proficiency in multiple scripting/programming languages (Python, Go, Bash)
Expert knowledge of modern monitoring ecosystems (Prometheus, Grafana, ELK)
Demonstrated experience architecting monitoring solutions at scale across hybrid environments
Strong background in SRE practices, including SLO definition, error budgets, and reliability engineering
Advanced knowledge of cloud platforms (AWS, GCP) and their native monitoring capabilities
Expertise in log aggregation, metrics and KPIs collection, and distributed tracing implementations
Experience designing and implementing automated remediation systems
Strong understanding of Infrastructure as Code and GitOps principles
Proven ability to mentor junior engineers and provide technical leadership

Shift timings- 12PM -9PM IST
What You'll Do:
Technical Leadership

Architect and implement enterprise-wide monitoring and observability solutions
Establish monitoring standards, best practices, and governance frameworks
Lead the evaluation and adoption of new monitoring technologies and approaches
Design scalable, resilient monitoring Infrastructure as Code
Serve as the technical escalation point for complex monitoring issues

Reliability Engineering

Lead the implementation of SRE practices across the organization
Partner with service owners to define appropriate SLOs and error budgets
Drive reliability improvements through data-driven analysis and recommendations
Design and implement advanced alerting strategies
Develop comprehensive observability strategies covering metrics, logs, and traces

Incident Management

Lead major incident response for critical service disruptions
Conduct thorough post-incident reviews and drive systematic improvements
Establish incident management processes and tooling improvements
Mentor team members on effective incident response techniques
Analyze incident patterns to identify and address systemic issues

Strategic Initiatives

Develop the monitoring and observability roadmap aligned with business objectives
Lead monitoring platform migrations and major upgrades
Implement cost optimization strategies for monitoring infrastructure
Drive automation initiatives to reduce toil and improve operational efficiency
Collaborate with security teams to integrate security monitoring capabilities

Team Development

Mentor junior engineers on monitoring best practices and SRE principles
Provide technical guidance and code reviews for monitoring implementations
Create documentation and knowledge-sharing materials for the broader organization
Contribute to hiring and team development activities
Foster a culture of continuous improvement and learning

Bonus Points:

Advanced certifications in cloud platforms or SRE practices
Experience leading incident response for complex, high-impact service disruptions
Experience with AIOps and ML-based monitoring approaches
Background in performance engineering or capacity management
Experience with chaos engineering and resilience testing
Bachelor's or Master's degree in Computer Science, Engineering, or related field

#LI-DP1

#LI-VJ1

#LI-Remote

Benefits of Working at CrowdStrike:

Remote-friendly and flexible work culture
Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
Vibrant office culture with world class amenities
Great Place to Work Certified™ across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at [email protected] for further assistance.

Top Skills

APIs

Bash

Gitlab Ci/Cd

Jenkins

Powershell

Python

Terraform

Similar Jobs at CrowdStrike

CrowdStrike

Engineer III, C++ Linux Kernel (Remote, IND)

10 Days Ago

Remote or Hybrid

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

As a Software Engineer, you will design and develop sensor platform modules, own features from design to delivery, and debug customer issues, focusing on Linux kernel and OS internals.

Top Skills: C++Driver DevelopmentLinuxOs Kernel

CrowdStrike

Senior Engineer

10 Days Ago

Remote or Hybrid

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

The role involves developing the IOT Discover product, utilizing technologies for data ingestion, ensuring product quality, and collaborating with teams.

Top Skills: AWSAzureCassandraDockerElastic SearchGCPGoJavaKafkaKubernetesPython

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.