Join Genpact as a Principal Consultant - GenAI LLM Ops Engineer and shape the future of work. Work on Large Language Model platforms and applications, partner with cross-functional teams, and drive change for global enterprises.
Requirements
- Architect secure, reusable, and modular infrastructure-as-code (IaC) frameworks for GenAI and LLM operations
- Design, deploy, and maintain LLM serving infrastructure (e.g., Azure OpenAI, self-hosted OSS models, vector databases).
- Implement model orchestration (routing, ensemble strategies, fallbacks, retries, cache layers).
- Build CI/CD pipelines for prompt catalogs, model configurations, guardrails, and evaluation suites.
- Define and track LLM-specific SLOs (latency, response quality, safety violations, hallucination rate).
- Implement telemetry (traces, logs, metrics, prompt/response analytics) and A/B experiments.
- Establish alerting & incident response playbooks.
- Lead the development and standardization of CI/CD pipelines for AI/ML model deployment
- Ensure security, privacy, and regulatory compliance (data residency, consent, auditability).
- Manage prompt governance (versioning, approval workflow, change logs, rollback).
- Define and enforce best practices for model versioning, governance and lifecycle management
- Troubleshoot and resolve issues related to LLM deployment, scaling, and performance
- Stay updated with advancements in MLOps, LLMs, and GenAI technologies
Benefits
- Competitive salary
- Benefits package
- Opportunities for growth and advancement