Lead the design of low-latency machine learning inference services for real-time decision-making, ensuring performance, reliability, and scalability.
We’re looking for a highly skilled, independent, and driven Machine Learning Engineer to lead the design and development of our next-generation real-time inference services - the core engine powering our algorithmic decision-making at scale. This is a rare opportunity to own the system at the heart of our product, serving billions of daily requests across mobile apps, with tight latency and performance constraints.
You’ll work at the intersection of machine learning, large-scale backend engineering, and business logic, building robust services that blend predictive models with dynamic, engineering logic - all while maintaining extreme performance and reliability requirements.
What you’ll do?- Own and lead the design and development of low-latency Algo inference services handling billions of requests per day
- Build and scale robust real-time decisioning engines, integrating ML models with business logic under strict SLAs
- Collaborate closely with DS to deploy models seamlessly and reliably in production
- Design systems for model versioning, shadowing, and A/B testing at runtime
- Ensure high availability, scalability, and observability of production systems
- Continuously optimize latency, throughput, and cost-efficiency using modern tooling and techniques
- Work independently while interfacing with cross-functional stakeholders from Algo, Infra, Product, Engineering, BA & Business.
- B.Sc. or M.Sc. in Computer Science, Software Engineering, or a related technical discipline
- 5+ years of experience building high-performance backend or ML inference systems
- Deep expertise in Python and experience with low-latency APIs and real-time serving frameworks (e.g., FastAPI, Triton Inference Server, TorchServe, BentoML)
- Experience with scalable service architecture, message queues (Kafka, Pub/Sub), and async processing
- Strong understanding of model deployment practices, online/offline feature parity, and real-time monitoring
- Experience in cloud environments (AWS, GCP, or OCI) and container orchestration (Kubernetes)
- Experience working with in-memory and NoSQL databases (e.g. Aerospike, Redis, Bigtable) to support ultra-fast data access in production-grade ML services
- Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and best practices for alerting and diagnostics
- A strong sense of ownership and the ability to drive solutions end-to-end
- Passion for performance, clean architecture, and impactful systems
- Lead the mission-critical inference engine that drives our core product
- Join a high-caliber Algo group solving real-time, large-scale, high-stakes problems
- Work on systems where every millisecond matters, and every decision drive real value
- Enjoy a fast-paced, collaborative, and empowered culture with full ownership of your domain
Top Skills
Aerospike
AWS
Bentoml
Bigtable
Fastapi
GCP
Grafana
Kafka
Kubernetes
Oci
Opentelemetry
Prometheus
Pub/Sub
Python
Redis
Torchserve
Triton Inference Server
Similar Jobs
Artificial Intelligence • Information Technology • Consulting
Design and implement ML models, collaborate with teams on production, maintain ML pipelines, and document processes while staying abreast of AI developments.
Top Skills:
AWSDaskDockerGreat ExpectationsLlmsMlNlpPythonRag ArchitectureSpark
Artificial Intelligence • Information Technology • Consulting
The ML Engineer will build and improve ML models, collaborate with teams, develop experimentation roadmaps, and maintain model performance.
Top Skills:
Amazon SagemakerAws BedrockAws LambdaDaskDockerEcrEmrGreat ExpectationsPythonS3Spark
Healthtech • Information Technology • Software
As a Senior Machine Learning Engineer, you'll lead ML initiatives, design and deploy ML services, optimize workflows, and collaborate across teams to enhance AI in healthcare.
Top Skills:
AWSFastapiKubernetesPythonPyTorchWhisper
What you need to know about the Dublin Tech Scene
From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.