Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic
Nagesh Kampati

Nagesh Kampati

Senior DevOps / Site Relibility Engineer
Hyderabad

Summary

Strong work experience in Agile development environment and Site Senior DevOps & SRE Engineer with 10+ years of experience designing, automating, and operating highly available, scalable, and secure cloud-native platforms. CKA-certified, with deep hands-on expertise in Kubernetes cluster management, governance, and production-grade deployments. . Strong background in site reliability engineering, focusing on resilience, performance, and fault tolerance in production environments.

Overview

11
11
years of professional experience
4
4
Certifications

Work History

DevOps Engineer

Tata Consultancy Services
Hyderabad
05.2021 - Current

CME Group (Site Reliability Engineer)

  • Strong Site Reliability Engineering (SRE) background, focused on resilience, performance, fault tolerance, and implementing liveness/readiness probes, auto-scaling, and load balancer–based high availability.
  • Led cloud migrations to GCP, delivering end-to-end observability using OpenTelemetry, Prometheus, and Grafana, and automating infrastructure with Terraform and Ansible to support resilient systems at scale.
  • Deep hands-on expertise in Kubernetes cluster management, governance, and production-grade deployments, managing both stateless and stateful workloads, with data replication and failover strategies.
  • Designed UC4-based start and stop schedules for upstream and downstream workloads following GCP migration, enabling controlled environment availability, reduced operational risk, and optimized cloud resource usage.

PayPal (Container Platform Team)

  • Setting up DevOps CI/CD pipelines from scratch to automate deployments in hybrid environments (Dev, QA, and Prod) using Git, Maven, Docker, Ansible, and Jenkins.
  • Writing Ansible playbooks and Puppet manifests to automate deployments and day-to-day operational tasks to reduce or eliminate manual efforts.
  • Responsible for operating and managing large-scale Kubernetes and Apache Mesos clusters by managing control-plane and data-plane components, enabling vertical and horizontal pod autoscaling, and cluster administration using Kubectl commands.
  • Implementing monitoring tools such as Splunk, HashiCorp Consul, GCP monitors, and PagerDuty to monitor application and infrastructure metrics.
  • Managing 30k Unix-based VMs in public and private cloud environments, and troubleshooting, debugging, and alert handling for system-related and service (systemd and Docker containers) issues.
  • Performing 12/7 on-call rotations.
  • Setting up DevOps CI/CD pipelines from scratch to automate deployments in hybrid environments (Dev, QA, and Prod) using Git, Maven, Docker, Ansible, and Jenkins.
  • Writing Ansible playbooks and Puppet manifests to automate deployments and day-to-day operational tasks to reduce or eliminate manual efforts.
  • Responsible for operating and managing large-scale Kubernetes and Apache Mesos clusters by managing control-plane and data-plane components, enabling vertical and horizontal pod autoscaling, and cluster administration using Kubectl commands.
  • Implementing monitoring tools such as Splunk, HashiCorp Consul, GCP monitors, and PagerDuty to monitor application and infrastructure metrics.
  • Managing 30k Unix-based VMs in public and private cloud environments, and troubleshooting, debugging, and alert handling for system-related and service (systemd and Docker containers) issues.
  • Performing 12/7 on-call rotations.

DevOps Engineer

JDA Software
05.2019 - 05.2021
  • Created end-to-end build and release pipelines for Android and iOS mobile apps built on the Xamarin platform.
  • Integrated static code analysis, coverage, quality, security, and vulnerability assessment tools like SonarQube, BlackDuck, CheckMarx, etc., into the build pipeline.
  • Contributed to aligning the product with the mobile application security verification standard (MASVS). Implemented continuous delivery with Azure DevOps (pipelines) and Microsoft App Center.
  • Integrated a central repository manager into CI/CD using JFrog Artifactory to effectively manage project dependencies.
  • Automated CI/CD workflows with GitHub Actions, and written custom actions for reusability and better productivity.
  • Worked closely with software development and testing team members to design and develop robust solutions to meet client requirements for functionality, scalability, and performance.

Site Reliability Engineer

Copart India Technology Center
11.2017 - 07.2018
  • Developed CI/CD system with Jenkins on Kubernetes container environment, utilizing Kubernetes and Docker for the CI/CD system to build, test and deploy.
  • By using teraform we can provision the infrastructure .
  • Create and administer Docker containers wherever requested by the Dev teams.
  • Optimize the AWS resources for better security and cost optimization.
  • Created Slack bot to have a quick insight on the critical resources. Play with ansible playbooks to deploy services.
  • Create Docker images for appropriate environment and push it to the Repo.
  • Analyze the Security logs thru the Sumologic.
  • Run maintenance scripts and Sanity Checks on Prod/ Dev servers.
  • Build and Deploy Microservices thru Jenkins. Monitoring the environment using Nagios, Manage Engine. Automated internal website logins using selenium with Python.
  • Maintain the Batch Jobs on AS400 Servers Build and Release management using GIT, MAVEN and Jenkins administration

Linux Admin

Netenrich Technologies
11.2014 - 11.2017
  • File system administration – Worked extensively on file system creation and extension.
  • Responsible for all OS issues, security, performance, upgrades, and patching.
  • User management - Managed setup and maintains the groups and user accounts.
  • Build a new Red Hat/CentOS Linux operating system in multiple environments, such as physical, virtual, and AWS cloud environments.
  • Kernel/OS up gradation, patches, configuration changes, updating packages in Red Hat/CentOS servers using RPM, and local YUM repositories RHN.
  • Disk Management: Creating, modifying, mounting, and unmounting the file system locally or remotely (NFS, iSCSI, and Samba).
  • Adding the new disks based on the LUN IDs and extending/reducing the (LVM) logical volumes online, and managing the swap issues.
  • Performance monitoring: Monitor the Linux server for CPU utilization, memory utilization, and disk utilization.
  • Password management applies policies such as expiry date, unsuccessful login, lock/unlock the users, and configure LDAP client.
  • Cron jobs automate several tasks, such as backups (incremental/full) and performance reports based on the scripts. Following ITIL processes, such as incident, problem, and change management.

Education

Bachelor of Science - Computer Science

JNTU
Kakinada
03.2014

Skills

Kubernetes, Docker

Accomplishments

  • Received best performer of the quarter award for Implementing CI/CD automations and touchless deployments.
  • PILLAR TEAM AWARD H1-2019 Sep JDA Software Pvt Ltd Presented to Luminate Mobility Team in recognition of our team valuable contributions.


Certification

Kubernetes Certified Administrator

Timeline

DevOps Engineer

Tata Consultancy Services
05.2021 - Current

DevOps Engineer

JDA Software
05.2019 - 05.2021

Site Reliability Engineer

Copart India Technology Center
11.2017 - 07.2018

Linux Admin

Netenrich Technologies
11.2014 - 11.2017

Bachelor of Science - Computer Science

JNTU
Nagesh KampatiSenior DevOps / Site Relibility Engineer