Platform Engineering
MLOps Lead
Own model deployment and observability blueprints.
Bengaluru / Pune, India
7–9 years
Full-time
Posted 11/25/2025
KubernetesArgoTerraformMLOpsCI/CDPythonGPU OrchestrationMonitoring
About the Role
What You'll Do
Own model deployment, observability, and automation blueprints that keep AI workloads resilient in production.
Key Responsibilities
Your Impact
Here's what you'll be responsible for in this role
Design CI/CD templates for data pipelines, model training, and inference
Define infrastructure reference stacks spanning containers, GPUs, and edge
Implement monitoring for drift, bias, data freshness, and SLA adherence
Collaborate with SRE and security on access control and compliance
Document runbooks, release gates, and rollback mechanisms
Requirements
What We're Looking For
Required Qualifications
- 7+ years in DevOps/SRE with 4+ years focusing on MLOps
- Deep knowledge of Kubernetes, Argo, Terraform, and observability suites
- Experience orchestrating training pipelines on managed and self-hosted GPUs
- Familiarity with model registries, feature stores, and experiment tracking
- Strong scripting skills (Python, Go, or Bash) and automation mindset
Required Skills & Technologies
KubernetesArgoTerraformMLOpsCI/CDPythonGPU OrchestrationMonitoring
Benefits & Perks
What We Offer
We believe in taking care of our team members
Ready to Join Us?
Apply for This Position
Take the next step in your career. Fill out the form below and we'll be in touch soon.
Apply for this Position
Fill out the form below to submit your application.