/journey

Eight years of building and operating infrastructure at scale

From freelance DevOps to leading SRE teams and consulting on bare-metal Kubernetes, cloud architecture, and platform reliability — a path shaped by real production systems and operational challenges.

Present

SRE & Cloud Engineer — CoDeTech

SRE & Data Engineer Consultant — Namava

Building cloud infrastructure for decentralized platforms and consulting on Kubernetes, observability, and platform reliability for a major streaming service. Also available for independent consulting engagements.

2024

SRE & Data Engineer Consultant — Namava

Streaming platform · Contract

Brought in to redesign infrastructure from the ground up — bare-metal Kubernetes, observability, CI/CD, and cost optimization for a high-traffic streaming service.

Designed multi-cluster bare-metal Kubernetes environments across data centers with BGP-based load balancing on edge nodes
Built internal tooling in Go and Python for stream log processing, improving analytics and reducing manual debugging
Created a reusable GitLab CI catalog with dozens of pipeline components, cutting deployment setup from hours to minutes
Deployed a full observability stack with Grafana, Prometheus, VictoriaMetrics, and ClickHouse — measurable reliability gains
Tuned infrastructure and workloads, delivering significant capacity increases and meaningful cost reduction

2022

SRE & Cloud Engineer — CoDeTech

Decentralized platforms · Remote

Joined to build cloud infrastructure from scratch for decentralized products, working across multiple cross-functional teams on B2B and B2C solutions.

Designed and deployed multiple production environments from scratch with Terraform on GCP
Launched observability stack with Grafana Mimir, Tempo, and Prometheus — surfaced bottlenecks leading to order-of-magnitude throughput gains
Implemented Istio service mesh for microservices security, traffic management, and observability
Migrated several monolithic applications to cloud-native architectures

SRE & Cloud Team Lead — AloPeyk

On-demand delivery · Tehran

Promoted to lead the SRE team, responsible for platform availability, Kubernetes operations, and CI/CD standardization across the organization.

Drove platform availability to 99.98% through operational improvements and SRE practices
Managed large-scale on-premise Kubernetes clusters running thousands of production containers
Built multiple internal tools and automation scripts with Python, Flask, and FastAPI — significantly reducing manual work
Replaced standalone CI/CD configs with modular manifests across all projects
Deployed Proxmox virtualization cluster on bare metal with local and SAN storage

2019

SRE & Cloud Engineer — AloPeyk

On-demand delivery · Tehran

First SRE role at a high-traffic on-demand platform serving hundreds of thousands of requests per minute across dozens of microservices. Led the migration from VM-based infrastructure to Kubernetes.

Reduced release time by 70% and cut infrastructure costs by 25% through Kubernetes migration
Automated Docker builds and Helm-based deployments via GitLab CI/CD
Built Ansible roles for provisioning and maintenance, saving dozens of hours of manual work each month
Deployed ProxySQL for automated database failover and read/write query segregation

2017

DevOps Engineer — Freelance

Independent · Hybrid

Started building infrastructure for clients across different industries — CI/CD pipelines, infrastructure as code, and high-availability database clusters.

Automated CI/CD with GitLab CI and GitHub Actions for multiple teams and applications
Unified IaC with Ansible roles and playbooks, cutting dozens of hours from monthly provisioning work
Set up numerous HA database clusters — MongoDB, PostgreSQL with Patroni, MySQL Galera, Redis Sentinel

/places