Job Description
As a DevOps Lead Engineer, you will architect and operate a multi-cloud, highly scalable, secure, and compliance-ready cloud platform built on GCP, Kubernetes, GitOps, and Infrastructure-as-Code.
This is a high-impact role where your decisions directly influence platform reliability, deployment velocity, security posture, cost efficiency, and operational excellence.
Role Overview
We’re looking for a hands-on DevOps leader with deep expertise in:
● GCP cloud architecture
● Kubernetes platforms (GKE or equivalent)
● Terraform & IaC best practices
● GitOps workflows
● CI/CD pipelines at scale
● Security & compliance (PCI/SOX preferred)
● Production reliability & observability
You will work across engineering, security, SRE, and client architecture teams to shape the platform’s infrastructure direction and ensure operational excellence.
Key Responsibilities
Cloud Architecture (GCP + Multi-Cloud)
● Design and operate cloud infrastructure spanning multiple regions and environments.
● Define networking, identity, security, and infrastructure standards.
● Drive multi-cloud integration patterns (GCP primary, AWS/Azure optional).
Infrastructure-as-Code & Automation
● Own Terraform architecture, module standards, versioning, validation, and automation.
● Establish GitOps workflows for infrastructure and application deployments.
● Lead migration from legacy or manual setups to fully automated IaC-based systems.
Kubernetes Platform Engineering
● Architect and operate production-grade Kubernetes clusters at scale.
● Implement service mesh, observability, deployment strategies, and workload security.
● Define best practices for manifests, Helm/Kustomize, CI-integrated testing, and rollouts.
CI/CD & Deployment Automation
● Design enterprise CI/CD pipelines (GitHub Actions, Azure DevOps, etc.).
● Implement secure build pipelines, artifact management, and progressive delivery.
Security, Compliance & Governance
● Implement cloud security controls aligned with PCI/SOX and enterprise compliance.
● Manage secrets, encryption, IAM, policies, and audit posture.
● Collaborate with security teams on vulnerability management and incident prevention.
Observability & Reliability
● Build unified logging, metrics, tracing, SLO/SLI frameworks, and dashboards.
● Lead incident response, production readiness assessments, and capacity planning.
● Advocate for reliability engineering, chaos testing, and best practices.
Leadership & Collaboration
● Mentor DevOps and platform engineers.
● Lead technical reviews, RFCs, and architecture discussions.
● Enable engineering teams with self-service tooling and platform capabilities.
Required Skills & Qualifications
Technical
● 10+ years in DevOps, SRE, and cloud platform roles.
● Strong expertise with GCP (VPC, IAM, GKE, Cloud SQL, security products).
● Deep Kubernetes experience (cluster design, service mesh, policies, deployment patterns).
● Expert-level Terraform (modules, state, multi-cloud providers).
● Strong CI/CD experience and GitOps methodologies.
● Strong grounding in cloud security principles, IAM, secrets, encryption.
● Experience with observability stacks (Prometheus, Datadog, OpenTelemetry, etc.).
● Solid scripting: Bash, Python, Go (nice to have).
Leadership
● Ability to lead cloud modernization and DevOps transformation initiatives.
● Strong communication and cross-team collaboration skills.
● Experience working with architects, security, and product teams.
Bonus
● Fintech or regulated domain experience.
● AWS or Azure production experience.
● Platform engineering background building internal developer platforms.
