We're looking for a senior engineer to design and build the load balancer that will sit at the very front of our research inference stack - routing the world’s largest AI models with millisecond precision and bulletproof reliability.
Requirements
- 5+ years of experience designing in theory for and debugging in practice for the algorithmic and systems challenges of consistent hashing, sticky routing, and low-latency connection management
- 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability infrastructure
- Strong debugging mindset and experience with tracing, logs, and metrics to untangle distributed failures
- Experience with gateway or load balancing systems (e.g., Envoy, gRPC, custom LB implementations)
- Familiarity with inference workloads (e.g., reinforcement learning, streaming inference, KV cache management, etc)
- Exposure to debugging and operational excellence practices in large production environments
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance