We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence.
Requirements
- Assist in implementing and operating Microservices on Kubernetes cloud-based platforms.
- Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.
- Conduct Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
- Build observability for Microservices and cloud platforms like AWS, OCI, Azure, and GCP.
- Contribute to writing and executing disaster recovery plans in collaboration with the Development and DevOps teams.
- Help analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
- Write and maintain scripts for automation using languages like Python, Go, or Bash.
- Assist in defining and maintaining the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business.
- Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures.
- Ensure adherence to security and compliance standards, including ISO27001, SOC2, and GDPR.
- Participate in incident response efforts to troubleshoot and resolve production issues quickly.
- Conduct post-incident analysis to identify root causes and potential workarounds/solutions.
- Contribute to product/technology selection, including implementation of POCs.
- Be adaptable to change and evolving processes and tools.
- Participate in mentoring and training less senior members of the team.
- Be part of the on-call rotation and provide support after work hours and on weekends.
Benefits
- Free snacks and drinks
- Provided lunch on Fridays
- Fully paid medical, dental, and vision insurance (partial coverage for dependents)
- Contributions to 401k funds
- Bi-annual reviews, and annual pay increases
- Health and wellness benefits, including free gym membership
- Quarterly team-building events