Seeking a self-motivated and enthusiastic Site Reliability Engineer with hands-on experience supporting multiple connected Cloud-based products. The ideal candidate will have a strong background in cloud platforms, infrastructure as code, and automation via programming/scripting languages.
Requirements
- Develop and maintain infrastructure as code (IaC) using Terraform
- Implement and enhance observability solutions using tools like New Relic, DataDog, Sumologic and Splunk
- Perform code deployments and manage CI/CD pipelines using Jenkins, Github, and related tooling
- Automate routine tasks and workflows to increase operational efficiency and reduce manual intervention
- Lead incident response efforts, conduct root cause analysis, and implement long-term solutions for complex issues
- Collaborate with cross-functional teams to review and provide feedback on technical designs
- Participate in on-call rotations and handle critical incidents with confidence and expertise
- Continuously improve documentation for systems and services
Benefits
- Good work-life balance
- Opportunities for growth and professional development
- Collaborative and dynamic work environment