Available for Projects

Hi, I'm Kanzal 👋

DevOps Engineer & Kubestronaut

DevOps Engineer & Kubestronaut. I specialize in Kubernetes, automation, and cloud-native infrastructure.

KQ

About

I am a DevOps Engineer and certified Kubestronaut, having earned all five CNCF Kubernetes certifications (KCNA, KCSA, CKA, CKAD, CKS). I design, automate, and operate production-grade cloud-native infrastructure — scaling Kubernetes clusters with Rancher, RKE2, and K3s, building GitOps CI/CD pipelines, and managing multi-cloud environments on AWS and Azure with Terraform. I care deeply about reliability, security, and cost efficiency, with a track record of cutting storage costs and deployment times while keeping systems highly available.

Work Experience

Skills

Kubernetes
Terraform
Ansible
AWS
Azure
Grafana
Harbor
ArgoCD
Prometheus
Rancher
OpenTelemetry
Python
Go
Postgres
Docker
Claude
Nginx
Git
Ubuntu
YAML
Vault
Redis
Kubeflow
K3s
Helm
Kafka
Bash
Packer
Red Hat
LangChain
Ollama
Hugging Face
PyTorch
TensorFlow
GitHub Actions
GitLab
GCP
Cloudflare
Linux
MongoDB
Elasticsearch
Envoy
My Projects

Infrastructure & Automation

I've engineered various solutions, from automated CI/CD pipelines to high-availability Kubernetes clusters. Here are some of the key infrastructure projects I've built.

Production-Grade Kubernetes Platform

Production-Grade Kubernetes Platform

Built and operate production Kubernetes clusters on RKE2 and K3s with high availability, HAProxy load balancing, and Helm-based deployments. Manage the full cluster lifecycle—ingress, persistent storage, RBAC, and zero-downtime upgrades—for 20+ applications across multiple environments.

Kubernetes
RKE2
K3s
Helm
HAProxy
High Availability
Rancher Multi-Cluster Orchestration

Rancher Multi-Cluster Orchestration

Deployed and managed 30+ downstream Kubernetes clusters using Rancher. Integrated centralized RBAC via Azure Entra ID and automated cluster provisioning with Terraform, reducing setup time by 60%.

Rancher
Kubernetes
Terraform
Azure Entra ID
RBAC
Infrastructure as Code
AWS Multi-Tier Architecture & Security

AWS Multi-Tier Architecture & Security

Designed and deployed scalable VPC architectures across multiple Availability Zones. Implemented Application Load Balancers (ALB), Auto-scaling groups with Spot instances for cost optimization, and rigorous security through NACLs and Security Groups.

AWS
VPC
EC2
ALB
Auto-scaling
IAM
CloudWatch
EFK Stack & Logging Operator

EFK Stack & Logging Operator

Deployed ELK/EFK stacks using the ECK operator and CNCF Logging Operator for centralized log management. Integrated ElastAlert2 with Prometheus Alertmanager for real-time anomaly detection across multiple clusters.

Elasticsearch
Fluentd
Kibana
ECK Operator
Prometheus
Alertmanager
Cost-Optimized Harbor & S3 Migration

Cost-Optimized Harbor & S3 Migration

Established an OCI-compliant Harbor registry for 1000+ images with Trivy scanning. Migrated 2TB+ of data from EBS to S3 with lifecycle policies, achieving an 86% reduction in storage costs.

Harbor
Trivy
AWS S3
Docker
Cost Optimization
GitOps & CI/CD Pipelines

GitOps & CI/CD Pipelines

Engineered end-to-end CI/CD workflows using Drone CI and ArgoCD. Implemented declarative GitOps practices for 20+ applications, enabling zero-downtime upgrades and manual approval gates for production safety.

ArgoCD
Drone CI
GitOps
Helm
Kubernetes
Infrastructure as Code with Terraform & Terragrunt

Infrastructure as Code with Terraform & Terragrunt

Orchestrated complex, multi-environment cloud infrastructure using Terraform and Terragrunt. Implemented modular design patterns, remote state management, and automated deployments to ensure consistency and reduce manual overhead across multi-cloud environments.

Terraform
Terragrunt
AWS
Azure
Infrastructure as Code
GitOps
High-Availability Postgres with PGO & CloudNativePG

High-Availability Postgres with PGO & CloudNativePG

Engineered production-grade PostgreSQL clusters on Kubernetes using CrunchyData Postgres Operator (PGO) and CloudNativePG (CNPG). Implemented automated backups to S3, seamless failover, and comprehensive monitoring for critical data workloads.

PostgreSQL
PGO
CloudNativePG
Kubernetes
Helm
S3
Scalable MySQL Clusters with Percona Operator

Scalable MySQL Clusters with Percona Operator

Automated the lifecycle of MySQL databases on Kubernetes using the Percona Distribution for MySQL Operator. Configured high availability, point-in-time recovery, and seamless scaling to handle enterprise-level traffic and data growth.

MySQL
Percona Operator
Kubernetes
High Availability
StorageClass
Enterprise Centralized Logging with Graylog

Enterprise Centralized Logging with Graylog

Architected and deployed a robust centralized logging solution using Graylog. Configured complex processing streams, custom extractors, and real-time alerts to provide deep visibility into infrastructure and application health.

Graylog
MongoDB
OpenSearch
GELF
Kubernetes
Scalable Observability with VictoriaMetrics

Scalable Observability with VictoriaMetrics

Implemented VictoriaMetrics as a high-performance, cost-effective long-term storage solution for Prometheus metrics. Optimized query performance and significantly reduced the storage footprint for large-scale monitoring data.

VictoriaMetrics
Prometheus
Grafana
Kubernetes
TSDB
Cloud-Native Kafka Orchestration with Strimzi

Cloud-Native Kafka Orchestration with Strimzi

Deployed and managed Apache Kafka clusters on Kubernetes using the Strimzi operator. Built scalable, event-driven data pipelines with Kafka Connect and integrated Schema Registry for robust message validation.

Apache Kafka
Strimzi
Zookeeper
Kubernetes
Event-Driven Architecture
Event-Driven Autoscaling with KEDA

Event-Driven Autoscaling with KEDA

Implemented intelligent autoscaling for Kubernetes workloads using KEDA. Configured custom scalers for Kafka, RabbitMQ, and Prometheus metrics to optimize resource utilization and ensure performance during traffic spikes.

KEDA
Kubernetes
HPA
Prometheus
Autoscaling
Secure Infrastructure Access with Hoop.dev

Secure Infrastructure Access with Hoop.dev

Integrated Hoop.dev for centralized, audited, and secure access to internal databases and servers. Enhanced security posture by enforcing least-privilege access and providing a unified entry point for administrative tasks.

Hoop.dev
Access Management
RBAC
Security
Auditing
Secret Management with OpenBao

Secret Management with OpenBao

Deployed OpenBao for secure secret storage and dynamic credential generation. Migrated legacy secret management workflows to this open-source solution, ensuring high security and compliance across environments.

OpenBao
Secrets Management
Security
Encryption
Kubernetes
Unified Observability Dashboards with Grafana

Unified Observability Dashboards with Grafana

Designed and maintained complex Grafana dashboards for a 'single pane of glass' view of the entire infrastructure. Integrated Prometheus, Loki, and Tempo for comprehensive monitoring across metrics, logs, and traces.

Grafana
Prometheus
Loki
Tempo
Observability
Automated Image Lifecycle with Packer & SaltStack

Automated Image Lifecycle with Packer & SaltStack

Built Packer templates and SaltStack recipes inside CI pipelines to produce hardened, vulnerability-scanned AWS AMIs and Azure VM images. Enforced automated retention policies to retire outdated images, cutting image sprawl by 70%.

Packer
SaltStack
AWS AMI
Azure
CI/CD
Bash
AI Chatbot Infrastructure on Kubernetes

AI Chatbot Infrastructure on Kubernetes

Deployed and managed AI chatbot infrastructure on Kubernetes with automated scaling. Implemented monitoring and logging to track performance and user interactions, and tuned response times and reliability alongside the development team.

Kubernetes
AI/LLM
Autoscaling
Monitoring
Logging

Education

Bachelor of Science in Computer Science (BSCS)
Bachelor of Science in Computer Science (BSCS)
Computer Science
2023 - Present
Contact

Get in Touch

Feel free to reach out to me for any DevOps related queries or just to say hi! You can reach me via email or through my social media channels.