Results-driven AWS DevOps and Observability Engineer with 6.2 years of IT experience, including 5 years in AWS DevOps and Observability, and 1.2 years in Technical Support. Proficient in cloud automation, CI/CD, monitoring, and infrastructure optimization, with hands-on expertise in AWS, Terraform, Kubernetes, Jenkins, Splunk, and Python scripting. Adept at improving system reliability, automating deployments, and optimizing cloud costs. Seeking a role to leverage expertise in DevOps, automation, and observability to drive operational excellence.
Overview
6
6
years of professional experience
Work History
Senior DevOps Engineer & SRE
Technumen Systems Pvt Ltd(client-Liberty Mutual Insurance)
04.2024 - 11.2024
Led application observability using Splunk and Datadog, analyzing logs, and performance metrics.
Worked extensively with Splunk for log analysis, identifying patterns and anomalies across microservices. Collaborated with developers to optimize log formats, improve search queries, and enhance troubleshooting efficiency.
Designed and optimized Splunk dashboards for real-time monitoring.
Improved log analysis processes, reducing issue resolution time by 50%.
Conducted root cause analysis (RCA) for critical errors (500, 404, timeout issues, etc.).
Performed postmortem analysis using Splunk logs, identifying trends to prevent recurring incidents.
Developed a Python script to fetch CPU usage metrics from all services by making API calls to Datadog host URLs, enabling real-time performance monitoring, and automated analysis.
Implemented SRE best practices by defining SLIs, SLOs, and error budgets, ensuring data-driven decisions for service reliability, and on-call management.
Assisted in CI/CD pipeline enhancements and infrastructure monitoring.
Senior DevOps Engineer
Speroware Technologies(client-CGI)
09.2021 - 04.2024
Developed CI/CD pipelines in Jenkins with Git, SonarQube, and Docker, reducing deployment time by 60%.
Built and deployed containerized applications using Docker and Kubernetes on AWS EKS, improving application scalability and reliability.
Coordinated with other teams for successful rollouts of new features or bug fixes.
Configured, managed, and monitored cloud-based services such as AWS EC2, S3, EBS, ELB, and RDS using Terraform.
Implemented modular Terraform configurations with reusable modules for networking, IAM, and compute resources, improving consistency, reusability, and scalability across multi-environment deployments.
Enhanced security of Terraform state files by storing them in Amazon S3 with versioning and encryption ensuring data integrity, recoverability, and compliance with security policies.
Developed Python scripts for automating tasks such as EBS snapshot backups and CloudWatch log analysis.
Optimized cost through reserved instances, selection and changing of EC2 instance types based on resource need, S3 storage classes and S3 lifecycle policies, leveraging Autoscaling, etc.
Enhanced log analysis capabilities by integrating Splunk with AWS CloudWatch logs, enabling faster root cause analysis for production issues.
Implemented automated alerts and incident response workflows using CloudWatch Alarms, SNS, and AWS Lambda, reducing incident resolution time.
Deployed applications on AWS EKS clusters, optimizing auto-scaling policies to handle more traffic during peak loads.
Worked on different resources of Kubernetes, such as Pods, Deployments, Replicas, ConfigMaps, Secrets (HashiCorp Vault), Namespaces, Services, Ingress Controllers, Limits, Roles, and RoleBindings.
Implemented an ingress controller to manage the ingress routing rules for the Kubernetes cluster, along with services, and implemented TLS certificates on the ingress service.
Optimized AWS costs by automating idle EC2 instance termination, EBS volume cleanup, and S3 lifecycle policies, achieving a 25% cost reduction.
Monitored system health using Splunk for early detection of issues.
Designed and optimized Splunk dashboards for real-time monitoring of critical applications.
Created priority-based incident management workflows (P1, P2, P3) with automated alerting via email.
DevOps Engineer
Speroware Technologies(client-Infosys)
01.2020 - 08.2021
Created CI and CD pipelines with Jenkins and Docker to automate the build process of applications.
Used Git and GitHub for version control and source code management.
Integrated SonarQube for static code analysis and security scanning.
Worked on build tools like Maven for the building of deployable artifacts, such as WAR and JAR, from source code using Jenkins.
Built optimized container images using Docker and Multi-Stage Dockerfiles, reducing image sizes by 40%, and improving build efficiency for microservices.
Designed a multi-microservices architecture, deployed on Amazon EKS, for containerized workloads.
Used Terraform for infrastructure as code (IaC) to automate AWS resource provisioning.
Configured Terraform remote state management using the S3 backend, with DynamoDB for state locking, preventing race conditions in collaborative infrastructure deployments.
Designed and configured VPC architectures with public/private subnets, NAT Gateway, and security groups, ensuring secure communication and high availability.
Set up Prometheus and Grafana dashboards to monitor EKS workloads, and trigger alerts for high CPU/memory usage.
Set up AWS SNS and SQS for asynchronous, event-driven communication between services.
Technical Support Engineer
Webroas
09.2018 - 12.2019
Updating the day-to-day changes that are published or announced regarding the client's notifications.
Proposed bug fixes and design enhancements, as per the client’s feedback.
Involved in data modifications and document updates.
Address and improve any technical issues.
Monitoring the application day-to-day and making changes or updates in the database based on client notifications.
Provide prompt and accurate feedback to clients.
Education
Bachelor of Engineering - Electronics And Communications Engineering
Lead Sales Representative at Liberty Mutual Insurance/Comparion Insurance AgencyLead Sales Representative at Liberty Mutual Insurance/Comparion Insurance Agency
Customer Service Lead at State Auto Insurance Companies / Liberty Mutual InsuranceCustomer Service Lead at State Auto Insurance Companies / Liberty Mutual Insurance