Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

MUTHARASAN GOPALAKRISHNAN

DEVOPS ENGINEER | SRE
Chennai

Summary

Senior DevOps / SRE Engineer with 9+ years of experience building and operating scalable, reliable infrastructure across cloud and Kubernetes environments. Specialized in infrastructure-as-code, observability, and automation. Proven track record of reducing MTTR by 40%, improving uptime by 50%, and eliminating operational toil through SLI/SLO-driven reliability, error budget management, blameless incident reviews, and production-scale systems engineering.

Overview

9
9
years of professional experience
5
5
Certifications

Work History

System Administrator — DevOps / Cloud / SRE

Sellermania
03.2020 - Current
  • Owned SLI/SLO definitions and error budget tracking for 150+ Linux production servers; established blameless post-mortem process, reducing MTTR by 40% and improving overall uptime by 50%.
  • Built and owned a Monitoring & Reliability Platform using Prometheus, Grafana, ELK, Zabbix, and CloudWatch — delivering end-to-end observability, automated alerting pipelines, and capacity planning across all production environments.
  • Designed and operated a Kubernetes Container Platform on Amazon EKS — managing cluster lifecycle, RBAC, namespaces, Helm charts, ingress controllers, rolling updates, and auto-scaling for high-availability workloads.
  • Architected an End-to-End CI/CD platform on Kubernetes using Jenkins, GitLab CI, GitHub Actions, and Argo CD (GitOps) — covering automated build, test, SonarQube quality gates, Artifactory artifact management, and zero-downtime deployments.
  • Built an AWS Infrastructure Automation Platform using Terraform, Terraform Modules, and AWS CloudFormation — provisioning and managing EC2, VPC, IAM, S3, Route 53, Load Balancers, Auto Scaling, and CloudTrail with full IaC coverage.
  • Developed Bash and Python automation scripts for deployments, health checks, toil elimination, maintenance, and operational runbooks — significantly reducing manual intervention.
  • Performed Linux server patching, performance tuning, capacity planning, hardening, and zero-downtime lifecycle management across the production fleet.
  • Configured enterprise networking: DNS, DHCP, TCP/IP, VLANs, VPNs, subnetting, firewall rules, and pfSense policies for secure network segmentation.
  • Designed backup, disaster recovery, and storage solutions using AWS S3 and Linux-based systems.
  • Collaborated with Dev, QA, and Security teams in Agile/Scrum sprints to improve deployment reliability and release velocity; mentored junior engineers on DevOps best practices and incident management.

Production Support Engineer

Sellermania
04.2017 - 03.2020
  • Provided L2 production support for mission-critical Java, PHP, and trading applications — managing incidents, service recovery, hotfix deployments, RCA, and SLA compliance.
  • Monitored production environments using Zabbix, Grafana, and ITRS Geneos; proactively resolved performance, deployment, and application issues before SLA breach.
  • Executed application releases, post-release validation, defect fixes, and production deployments using UDeploy, Shell scripting, and SQL.
  • Supported MySQL troubleshooting, batch jobs, and on-prem to cloud migration activities with near-zero downtime; led stakeholder communication during major incidents.

Education

B.Tech - Computer Science & Engineering

Sri Ganesh College of Engineering and Technology
Puducherry, India
01-2014

Skills

  • Cloud & IaC: AWS (EC2, S3, IAM, VPC, EKS, Route 53, CloudWatch, CloudTrail, SQS, Auto Scaling) Terraform Terraform Modules AWS CloudFormation Ansible
  • Containers & Orchestration: Kubernetes Amazon EKS Docker Helm Argo CD GitOps RBAC Ingress Namespaces
  • CI/CD: Jenkins GitLab CI GitHub Actions SonarQube Artifactory Git
  • Observability & Monitoring: Prometheus Grafana ELK Stack Zabbix CloudWatch ITRS Geneos SLI SLO Alerting
  • Scripting & Automation: Bash Python Shell scripting SQL
  • Networking & Security: DNS DHCP TCP/IP VLANs VPNs pfSense Firewall IAM Policies Security Groups

Certification

Kubernetes Administration | Udemy, 2026

Timeline

System Administrator — DevOps / Cloud / SRE

Sellermania
03.2020 - Current

Production Support Engineer

Sellermania
04.2017 - 03.2020

B.Tech - Computer Science & Engineering

Sri Ganesh College of Engineering and Technology
MUTHARASAN GOPALAKRISHNANDEVOPS ENGINEER | SRE