We are seeking an experienced Lead ML+DevOps Engineer to play a crucial role in the scalability and reliability of our AI/ML infrastructure.
Requirements
- Extensive experience in deploying machine learning models to cloud environments
- Strong expertise in Docker container orchestration
- Proficiency in Terraform for infrastructure as code (IaC) and cloud resource management
- Hands-on experience with streaming data platforms (e.g., Kafka, Kinesis)
- Solid understanding of data cleaning, transformation, and ETL processes
- Experience with CI/CD tools and pipelines (e.g., Jenkins, GitLab CI)
- Strong programming skills in Python
- Familiarity with ML frameworks (e.g., TensorFlow, PyTorch) is a plus
- Excellent problem-solving skills and the ability to think critically and creatively
- Strong communication skills with the ability to convey technical concepts to non-technical stakeholders