Operative is seeking a highly motivated and detail-oriented NOC Engineer to join our Technology Operations Center (TOC) Monitoring Team with a focus on Cloud Platforms. The successful candidate will be responsible for monitoring and troubleshooting cloud infrastructure, ensuring optimal performance, and escalating issues as required.
Requirements
- Monitoring & Observability Tools knowledge (Grafana, New Relic, Zabbix, ELK, AWS CloudWatch etc.)
- Familiarity with Cloud platforms (AWS, GCP etc.) and ability to monitor, manage, and troubleshoot cloud infrastructure and services.
- Working knowledge of AWS CloudWatch including creating monitors, setting up alerts, and analysing logs to detect and troubleshoot infrastructure issues.
- Familiarity with Networking concepts (TCP/IP, DNS, DHCP, etc.) and cloud networking configurations.
- Understanding of virtual machines, cloud storage, and cloud databases.
- Python/Shell scripting knowledge (at least working knowledge is desirable).
- Good knowledge & understanding of Operating Systems (Linux, Windows).
- Working knowledge of AWS Lambda and serverless architecture; ability to monitor Lambda function performance, detect failures, and identify issues in serverless workflows.
- Experience with REST APIs and HTTP concepts; ability to monitor and troubleshoot backend service connectivity and performance issues.
- Good understanding of AI-powered monitoring tools and their role in automating incident detection and remediation in cloud infrastructure.
- Experience with ticketing & workflow tools like Jira for incident and task management.
Benefits
- Competitive salary
- Benefits package