CoreWeave is seeking a Senior Director, Fleet Reliability Operations to lead the evolution of their global GPU server fleet and manage a high-performing management team operating globally.
Requirements
- 10+ years of experience in infrastructure or platform engineering, development, SRE or DevOps
- 5+ years in leadership roles managing mission-critical global production environments
- Deep technical understanding of data center operations, fleet provisioning, lifecycle management, and observability tooling
- Fluent in automation workflows, monitoring solutions, and scalable fleet management systems
Benefits
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption