Fluidstack Logo

Fluidstack

Infrastructure Engineer (Compute)

Posted 9 Days Ago
Remote
29 Locations
Senior level
Remote
29 Locations
Senior level
Design, deploy, and manage computing infrastructure for GPU supercomputers, ensuring performance, scalability, and reliability. Collaborate with teams and automate lifecycle tasks.
The summary above was generated by AI
About FluidStack

Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more.

Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals.

We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us.

You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.

About the Role

We are looking for an Infrastructure Engineer (Compute) to design, deploy, and manage the compute infrastructure powering Fluidstack's GPU clusters. You will be responsible for ensuring the performance, scalability, and reliability of our compute resources, working closely with hardware and software teams to support our AI workloads.

Focus
  • Design and implement GPU/ASIC infrastructure at the server, rack, and system level.

  • Troubleshoot complex GPU and compute system related failures.

  • Develop and maintain hardware/firmware management services.

  • Automate all aspects of the server lifecycle.

  • Own end-to-end compute lifecycle, including partnering with vendors on RMAs.

  • Serve as the main point of contact for hardware escalation and troubleshooting.

  • Monitor system performance, identifying and resolving bottlenecks.

  • Automate deployment and management tasks to improve efficiency.

  • Collaborate with storage and network teams to ensure cohesive infrastructure operations.

About You
  • 5+ years of experience in compute infrastructure engineering.

  • Strong knowledge of Linux systems administration and performance tuning.

  • Experience with bare metal provisioning tools (MaaS, Metal3, Tinkerbell, or other).

  • Familiarity with GPU hardware and workload optimization, especially kernel and driver level requirements.

  • Proficiency in automation tools (e.g., Ansible, Terraform).

  • Experience operating Kubernetes and SLURM clusters.

Benefits
  • Competitive total compensation package (salary + equity).

  • Retirement or pension plan, in line with local norms.

  • Health, dental, and vision insurance.

  • Generous PTO policy, in line with local norms.

  • Fluidstack is remote first, but has offices in key hubs. For all other locations, we provide access to WeWork.

Top Skills

Ansible
Linux
Maas
Metal3
Terraform
Tinkerbell
Warewulf

Similar Jobs

53 Minutes Ago
Easy Apply
Remote
Hybrid
28 Locations
Easy Apply
Senior level
Senior level
Information Technology • Productivity • Professional Services • Software
Develop and maintain software applications on the ServiceNow platform. Integrate with third-party services, troubleshoot issues, and support implementations.
Top Skills: AWSAzureGCPGitJavaScriptJenkinsServicenow
57 Minutes Ago
Easy Apply
Remote
28 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Natural Language Processing • Software
As a Senior Linguistic Engineer, you will enhance translation models, perform data analysis, and collaborate with cross-functional teams to innovate translation quality at Smartling.
Top Skills: AthenaPythonSagemakerSparkSQL
57 Minutes Ago
Easy Apply
Remote
Hybrid
28 Locations
Easy Apply
Expert/Leader
Expert/Leader
Big Data • Cloud • Software • Database
Drive MongoDB adoption among strategic customers, engage senior technical leaders, develop technical expertise, and contribute to thought leadership.
Top Skills: C#JavaMongoDBPythonRdbms

What you need to know about the Dublin Tech Scene

From Bono and Oscar Wilde to today's tech leaders, Dublin has always attracted trailblazers, with more than 70,000 people working in the city's expanding digital sector. Continuing its legacy of drawing pioneers, the city is advancing rapidly. Ireland is now ranked as one of the top tech clusters in the region and the number one destination for digital companies, with the highest hiring intention of any region across all sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account