Crusoe is a company that accelerates the abundance of energy and intelligence. They are looking for a Staff Site Reliability Engineer, Compute to support their AI-first cloud infrastructure.
Requirements
- 8+ years of professional experience in Compute SRE, Linux system engineering, or compute infrastructure roles.
- Strong proficiency in Linux kernel internals, with exposure to scheduler, memory allocation, and driver subsystems.
- Experience with virtualization architectures and technologies such as KVM, Xen, QEMU, or VMware.
- Familiarity with SmartNICs/DPUs and kernel bypass techniques.
- Expert-level skills in at least one programming language: Go, C or Rust.
- Experience with system-level debugging, including kdump, kexec, and kernel panic analysis.
- Proficiency in Infrastructure as Code tooling and CI/CD practices for bare-metal or cloud infrastructure.
- Strong understanding of compute scheduling, resource management, and high-throughput networking.
Benefits
- Industry competitive pay
- Restricted Stock Units in a fast growing, well-funded technology company
- Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
- Employer contributions to HSA accounts
- Paid Parental Leave
- Paid life insurance, short-term and long-term disability
- Teladoc
- 401(k) with a 100% match up to 4% of salary
- Generous paid time off and holiday schedule
- Cell phone reimbursement
- Tuition reimbursement
- Subscription to the Calm app
- MetLife Legal
- Company paid commuter benefit; $300 per pay period