ZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports & Cine Optics.
Requirements
- 8-12 years of relevant industry experience
- Minimum of 3 years as a Site Reliability Engineering Lead
- Minimum of 5 years' experience as a Site Reliability Engineer
- Minimum of 8 years' experience with cloud computing platforms like Azure and related services
- In-depth knowledge of system architecture, networking, and microservice based distributed systems
- Expertise in designing and implementing reliable, scalable, and fault-tolerant systems using container Orchestration Technologies like Docker and Kubernetes
- Proficiency in setting up and managing monitoring, alerting, and logging systems for early detection and resolution of issues for container orchestrators like Kubernetes using Tools like Prometheus, Grafana, Open Telemetry Collector or similar tools
- Hands-on experience in incident management, including incident response, troubleshooting, and post-mortem analysis
- Proficiency in coding/scripting languages commonly used in infrastructure automation and monitoring (such as Terraform)
- Knowledge of best practices in disaster recovery planning and execution for cloud based Systems
- Ability to lead and mentor a team of SREs, providing guidance, support, and coaching
- Capability to advocate for SRE best practices and principles within the organization and drive cultural changes as needed
- Willingness to stay updated with the latest trends, tools, and technologies in the field of site reliability engineering
- Strong communication skills to effectively collaborate with cross-functional teams, including Software Developers, Product Owners, and Cloud Platform Engineers