Mohsen Mottaghi

Site Reliability Engineer & Cloud Architect

Latest Blog Posts

My journey

I've been working on in SysAdmin, DevOps, SRE, and Cloud more than 7 years.

Present

Currently working as a Site Reliability Engineer at Core Decentralized Technologies
And also providing consulting services to companies in the various fields of technology that are looking for a reliable and scalable solutions

Early 2022

Started working as a Site Reliability Engineer at Core Decentralized Technologies

At CoDeTech, I was responsible for the design, implementation, and maintenance of the company's infrastructure and services.

Designed and deployed 4 infrastructures from scratch with T erraform on the Google Cloud platform.

Launched monitoring and distributed tracing stack containing Grafana Mimir, T empo, Grafana, and Prometheus to find bottlenecks and speed up the 30x ordering system.

Implemented Istio as a service mesh and ingress gateway to enhance microservices communication security, traffic management, and observability within the Kubernetes cluster.

Redesigned and restructured more than 4 monolithic application infrastructures to cloud-native ones.

Collaborated with three cross-functional teams to develop and deliver production-ready solutions for public and private clientele.

2022 - 2024

Promoted to SRE Team Lead at AloPeyk.

Led a 4-member SRE team, improving platform availability from 98% to 99.98%.

Managed an on-premises Kubernetes cluster running over 5,000 containers.

Replaced standalone GitLab CI/CD configurations with modular CI/CD manifests, enabling SRE teammates to update and patch all project pipelines 10x faster.

Developed Ansible roles for periodic checks and maintenance, reducing daily SysAdmin workload by 30%.

2019 - 2022

Started working as a Site Reliability Engineer at AloPeyk.

As a Site Reliability Engineer, I worked on multiple projects to enhance the company's infrastructure and services, including:

Reduced the release process time by 70% and achieved a 25% cost reduction by migrating from a VM-based infrastructure to a containerized environment using an on-premises Kubernetes cluster.


Developed GitLab CI/CD pipelines to automate Docker image creation, streamlined service deployment with Helm charts on Kubernetes, and optimized the release process to improve overall efficiency.


Prepared Ansible roles for routine tasks like provisioning instances from scratch and periodical maintenance to remove 30 hours of SysAdmins' monthly work.


Deployed a ProxySQL layer to automate database failover and optimize query distribution, improving read query performance and reducing application response times significantly.

2017 - 2019

Started working as a System Administrator at AloPeyk.

Created, tested, and deployed complete CI/CD pipelines for applications and teams using Gitlab CI and GitHub Actions.


Unified infrastructure as code (IaC) with Ansible roles and playbooks to provision and manage virtual machines and applications to remove more than 25 monthly work hours.


Built MongoDB replica sets and PostgreSQL clusters with Patroni, ETCD, and HAProxy.


And etc...

Featured Projects

I am working on Open Source projects and contributing to the community. My primary focus is on developing tools that assist developers and engineers in creating better and more efficient systems. I am releasing these projects on the InfraZ.io website to keep everything organized in one location.

Consultation

With years of experience in DevOps, SRE, and Cloud Architecture, I help businesses build scalable, reliable, and cost-efficient infrastructure. Whether you're a startup looking to establish a solid foundation or an enterprise seeking to optimize your cloud strategy, I can guide you through every step.


MOHSENMOHSENMOHSEN