Hi, I'm Kanzal 👋
DevOps Engineer & Kubestronaut
DevOps Engineer & Kubestronaut. I specialize in Kubernetes, automation, and cloud-native infrastructure.
About
I am a DevOps Engineer and certified Kubestronaut, having earned all five CNCF Kubernetes certifications (KCNA, KCSA, CKA, CKAD, CKS). I design, automate, and operate production-grade cloud-native infrastructure — scaling Kubernetes clusters with Rancher, RKE2, and K3s, building GitOps CI/CD pipelines, and managing multi-cloud environments on AWS and Azure with Terraform. I care deeply about reliability, security, and cost efficiency, with a track record of cutting storage costs and deployment times while keeping systems highly available.
Work Experience
Skills
Infrastructure & Automation
I've engineered various solutions, from automated CI/CD pipelines to high-availability Kubernetes clusters. Here are some of the key infrastructure projects I've built.
Production-Grade Kubernetes Platform
Built and operate production Kubernetes clusters on RKE2 and K3s with high availability, HAProxy load balancing, and Helm-based deployments. Manage the full cluster lifecycle—ingress, persistent storage, RBAC, and zero-downtime upgrades—for 20+ applications across multiple environments.
Rancher Multi-Cluster Orchestration
Deployed and managed 30+ downstream Kubernetes clusters using Rancher. Integrated centralized RBAC via Azure Entra ID and automated cluster provisioning with Terraform, reducing setup time by 60%.
AWS Multi-Tier Architecture & Security
Designed and deployed scalable VPC architectures across multiple Availability Zones. Implemented Application Load Balancers (ALB), Auto-scaling groups with Spot instances for cost optimization, and rigorous security through NACLs and Security Groups.

EFK Stack & Logging Operator
Deployed ELK/EFK stacks using the ECK operator and CNCF Logging Operator for centralized log management. Integrated ElastAlert2 with Prometheus Alertmanager for real-time anomaly detection across multiple clusters.

Cost-Optimized Harbor & S3 Migration
Established an OCI-compliant Harbor registry for 1000+ images with Trivy scanning. Migrated 2TB+ of data from EBS to S3 with lifecycle policies, achieving an 86% reduction in storage costs.

GitOps & CI/CD Pipelines
Engineered end-to-end CI/CD workflows using Drone CI and ArgoCD. Implemented declarative GitOps practices for 20+ applications, enabling zero-downtime upgrades and manual approval gates for production safety.
Infrastructure as Code with Terraform & Terragrunt
Orchestrated complex, multi-environment cloud infrastructure using Terraform and Terragrunt. Implemented modular design patterns, remote state management, and automated deployments to ensure consistency and reduce manual overhead across multi-cloud environments.
High-Availability Postgres with PGO & CloudNativePG
Engineered production-grade PostgreSQL clusters on Kubernetes using CrunchyData Postgres Operator (PGO) and CloudNativePG (CNPG). Implemented automated backups to S3, seamless failover, and comprehensive monitoring for critical data workloads.
Scalable MySQL Clusters with Percona Operator
Automated the lifecycle of MySQL databases on Kubernetes using the Percona Distribution for MySQL Operator. Configured high availability, point-in-time recovery, and seamless scaling to handle enterprise-level traffic and data growth.
Enterprise Centralized Logging with Graylog
Architected and deployed a robust centralized logging solution using Graylog. Configured complex processing streams, custom extractors, and real-time alerts to provide deep visibility into infrastructure and application health.

Scalable Observability with VictoriaMetrics
Implemented VictoriaMetrics as a high-performance, cost-effective long-term storage solution for Prometheus metrics. Optimized query performance and significantly reduced the storage footprint for large-scale monitoring data.

Cloud-Native Kafka Orchestration with Strimzi
Deployed and managed Apache Kafka clusters on Kubernetes using the Strimzi operator. Built scalable, event-driven data pipelines with Kafka Connect and integrated Schema Registry for robust message validation.

Event-Driven Autoscaling with KEDA
Implemented intelligent autoscaling for Kubernetes workloads using KEDA. Configured custom scalers for Kafka, RabbitMQ, and Prometheus metrics to optimize resource utilization and ensure performance during traffic spikes.
Secure Infrastructure Access with Hoop.dev
Integrated Hoop.dev for centralized, audited, and secure access to internal databases and servers. Enhanced security posture by enforcing least-privilege access and providing a unified entry point for administrative tasks.

Secret Management with OpenBao
Deployed OpenBao for secure secret storage and dynamic credential generation. Migrated legacy secret management workflows to this open-source solution, ensuring high security and compliance across environments.
Unified Observability Dashboards with Grafana
Designed and maintained complex Grafana dashboards for a 'single pane of glass' view of the entire infrastructure. Integrated Prometheus, Loki, and Tempo for comprehensive monitoring across metrics, logs, and traces.
Automated Image Lifecycle with Packer & SaltStack
Built Packer templates and SaltStack recipes inside CI pipelines to produce hardened, vulnerability-scanned AWS AMIs and Azure VM images. Enforced automated retention policies to retire outdated images, cutting image sprawl by 70%.
AI Chatbot Infrastructure on Kubernetes
Deployed and managed AI chatbot infrastructure on Kubernetes with automated scaling. Implemented monitoring and logging to track performance and user interactions, and tuned response times and reliability alongside the development team.
Education
Get in Touch
Feel free to reach out to me for any DevOps related queries or just to say hi! You can reach me via email or through my social media channels.





