Scale AI is pioneering the next era of enterprise AI, and we're looking for a Backend Engineer to help bring large-scale GenAI systems to production. This is a rare opportunity to be at the center of the GenAI revolution, solving hard backend and infrastructure challenges that make AI truly work at enterprise scale.
Requirements
- Design, build, and scale backend systems that power enterprise GenAI products, focusing on reliability, performance, and deployment across both Scale’s and customers’ infrastructure.
- Develop core services and APIs that integrate AI models and enterprise data sources securely and efficiently, enabling production-scale AI adoption.
- Architect scalable distributed systems for data processing, inference, and orchestration of large-scale GenAI workloads.
- Optimize backend performance for latency, throughput, and cost—ensuring AI applications can operate at enterprise scale across hybrid and multi-cloud environments.
- Manage and evolve cloud infrastructure (AWS, Azure, or GCP), driving automation, observability, and security for large-scale AI deployments.
- Collaborate with ML and product teams to bring cutting-edge GenAI models into production through efficient APIs, model serving systems, and evaluation frameworks.
- Continuously improve reliability and scalability, applying strong engineering practices to make AI systems robust, maintainable, and enterprise-ready.
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance