C3 AI is looking for a Site Reliability Engineer to join its team in London, with responsibilities including maximizing system uptime, establishing monitoring and alerting, and working cross-functionally with Services and Engineering teams.
Requirements
- Deploying, managing, and operating scalable and fault-tolerant Linux/Kubernetes/JVM-based infrastructure in AWS, GCP, and other public clouds
- Expertise in Linux Operating Systems, Networking, and Database concepts
- Experience with Cassandra (or another NoSQL alternative)
- Expertise in cloud providers, such as Amazon Web Services, Azure, and GCP
- Experience with configuration management systems such as Ansible or Puppet
- Experience in Ruby or Python; to automate and monitor systems
- Excellent problem-solving, critical thinking, and communication skills
- Experience supporting as a DevOps or sys admin for commercial SaaS solutions
- BS or MS in Computer Science, related field, or equivalent professional experience
Benefits
- Excellent benefits
- Competitive compensation package