This role reports to VP, Enterprise Architecture and requires a senior technical professional to design and maintain hybrid environments (cloud and on-prem) with scalability, security, and resilience.
Requirements
- Maintain and patch systems, ensuring compliance and high availability.
- Implement and test disaster recovery strategies for hybrid environments.
- Optimize infrastructure resource allocation, performance, scalability, and fault tolerance across hybrid environments.
- Automate repetitive tasks to reduce manual intervention.
- Set up monitoring tools to track system health across cloud and on-prem environments.
- Define and measure SLIs/SLOs for critical services.
- Monitor application performance, latency, and resource utilization.
- Create actionable alerts and dashboards for system and cost performance.
- Automate incident response, monitoring, and alerting.
- Diagnose and develop fixes to be implemented quickly and efficiently for production incidents.
- Participate in a 24x7 on-call rotation for incident management.
- Implement security best practices across the infrastructure stack.
- Perform vulnerability assessments and manage access controls.
- Ensure compliance with industry standards (GDPR, SOC 1, SOC 2, etc.).
- Bridge gaps between development, infrastructure, IT support, operations, and other teams to ensure seamless delivery.
- Provide technical guidance and mentorship to other DevOps Engineers.
- Communicate technical insights effectively to stakeholders.
- Foster a culture of knowledge sharing and collaboration.
- Be an active contributor in the DevOps Community of Practice.
- Design, deploy, and maintain infrastructure using Infrastructure as Code (e.g., Terraform) for cloud environments.
- Manage identity and access solutions, including Microsoft Entra ID (Azure AD), and Okta.
- Build and maintain CI/CD pipelines for efficient software delivery.
- Collaborate with development teams to integrate automated testing and deployment.
- Champion a culture of continuous integration, delivery, and improvement.
- Monitor and analyze cloud usage and costs across AWS and Azure.
- Implement cost optimization strategies (rightsizing, reserved instances, autoscaling).
- Enforce mandatory tagging for cost allocation and governance.
- Develop dashboards and reports to provide visibility into cloud spend for stakeholders.
- Collaborate with finance and engineering teams to forecast budgets and track variances.
- Establish governance policies for cloud resource provisioning and cost accountability.
- Respond to cost anomalies and recommend corrective actions.
- Advocate for FinOps principles within engineering teams to balance performance and cost.
- Provision and manage on-premise infrastructure in data centers, including servers, storage, and networking.
- Build and maintain virtual machine templates for consistent deployments.
- Manage identity and access solutions, including Active Directory and Okta.
- Implement cost optimization strategies (rightsizing).
Benefits
- Paid Time Off
- 401k Matching
- Retirement Plan