Summary
Overview
Work History
Education
Skills
Timeline
Generic

Abhay Rastogi

Bangalore

Summary

Results-driven professional with extensive experience in automating deployment and system configurations, ensuring seamless integration and delivery processes. Advanced expertise in cloud platforms and scripting enhances operational efficiency and drives innovation. Proven ability to collaborate effectively with cross-functional teams while maintaining high standards and consistently achieving successful project outcomes. Committed to leveraging technical skills and industry knowledge to contribute to organizational success.

Overview

11
11
years of professional experience

Work History

DevOps Engineer

Juniper Networks
01.2022 - Current
  • Managed 11+ clusters, each comprising 300+ servers, overseeing the complete infrastructure build, deployment, and release management lifecycle.
  • Architected and integrated security and observability tools (Sumo Logic, Qualys, Wazuh) into AWS from the ground up using Ansible, Packer, and Terraform.
  • Provisioned and deployed complete new environments from scratch to production for customers on AWS and GCP.
  • Led Proof of Concept (POC) initiatives for new services; coordinated with development teams to deploy successfully from staging to production.
  • Managed and optimized core AWS services including VPC, EC2, S3, IAM, Route 53, CloudWatch, and Auto Scaling groups.
  • Managed and optimized core GCP services including VPC, Instance Groups, Cloud Storage Buckets, IAM, Load Balancers, Cloud DNS, Secret Manager, and Auto Scaling.
  • Created and updated AMIs using Packer to implement new features and promptly remediate vulnerabilities.
  • Automated configuration management and operational tasks using Ansible playbooks.
  • Designed and maintained robust CI/CD pipelines to automate and streamline the software delivery process.
  • Executed zero-downtime upgrades for critical infrastructure including Kubernetes, PostgreSQL, Storm, and Elasticsearch.
  • Designed and implemented containerization strategies using Docker and Kubernetes, leading to improved resource utilization and operational efficiency.
  • Reduced system downtime for critical applications by implementing advanced monitoring and proactive alerting systems.
  • Provided 24/7 on-call support for production systems, ensuring high availability and enabling rapid incident resolution.
  • Collaborated with development teams to streamline and automate software releases, significantly reducing time-to-market.
  • Developed and executed disaster recovery plans to ensure business continuity and minimize risk.
  • Orchestrated automated CI/CD pipelines, dramatically improving release frequency and reliability.
  • Coordinated and executed deployments for new software, feature updates, and critical patches.
  • Automated tasks and developed tools using scripting languages (Python, Bash).

Member of Technical Staff II

VMware
05.2019 - 01.2022
  • Owned end-to-end responsibility for maintaining high service availability, consistently meeting and exceeding defined SLA objectives.
  • Served in a 24/7 on-call rotation to rapidly diagnose and resolve outages, manage escalations, and perform upgrades and patches on Isilon storage systems.
  • Developed innovative solutions to complex technical problems, improving overall efficiency and productivity of the team.
  • Developed comprehensive documentation for various projects that facilitated knowledge sharing among teammates.
  • Proactively managed and optimized infrastructure to eliminate single points of failure, addressing issues related to resource constraints (RAM, CPU), system health, and user requests.
  • Leveraged AWS to build and manage a SaaS platform, utilizing key services including IAM, S3, EC2, Elastic Load Balancer, Route 53, and CloudWatch.
  • Automated operational and configuration tasks using Shell, Python, and Ansible, with full integration into Jenkins CI/CD pipelines.
  • Engineered and managed Kubernetes clusters and Docker containers for efficient image creation and application deployment.
  • Orchestrated automated CI/CD pipelines, which dramatically improved release frequency and reliability.
  • Coordinated and executed zero-downtime deployments for new software releases, feature updates, and critical security patches.
  • Developed custom automation scripts and tools using Python and Bash to streamline operations.
  • Managed DNS, DHCP, and IP address management (IPAM) through Infoblox for network configuration and automation.

Site Reliability Engineer

Yahoo! Inc.
11.2018 - 05.2019
  • Environment: Docker, GIT, sensu, Linux, hbase Apache, AWS, load balancing, nginx, chef, Ansible, Elastic search, Kafka, Azure, Jetty, Jenkins, Python
  • Handling 3000+ Linux servers component provisioning & handling configuration & build and deployment of infrastructure & services
  • Responsibility to maintain mail service availability with in the Sla.
  • Improved incident management workflows by creating comprehensive documentation on troubleshooting procedures and common issues resolution steps.
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.
  • Conducted root-cause analyses after major incidents to identify areas for process improvement or technical enhancement opportunities.
  • Improved deployment efficiency, automating processes using CI/CD pipelines.

Operation Engineer

Sprinklr
03.2017 - 09.2018
  • Environment: Docker, GIT, sensu, Linux, Apache, AWS, load balancing, haproxy, MongoDB, Mysql, Ansible, Elastic search, Kafka, Azure, glassfish, jenkins, Python
  • Worked on 4000+ Linux servers & component provisioning & handling configuration & build and deployment of infrastructure & services
  • Managed the infrastructure, avoided single point of failure, monitored RAM and CPU space and adding alerts as required.
  • Virtualisation & Load Balancing
  • Created server & components by application of Virtualisation Concept using AWS.
  • Employed HAProxy and Elb (aws) to conduct load balancing for handling a constantly increasing user base & ensuring scalability & reliability
  • Cloud Based Distributed Computing
  • Employed complex concepts & technologies including git, glass fish, Docker, apache, nodes, networking, etc.
  • Working on mesos , kafka and elastic search clusters.
  • Setup/Managing Linux servers on Amazon (EC2, EBS, ELB, ASG, SSL, Security Groups, RDS and IAM)
  • Launching server for asg group adding in elb and making entry in route53.
  • Working with tools like Sensu, New Relic, Kibana, Graylog.
  • Deployed the applications using WAR on multiple WebLogic Server and Maintained Load balancing, high availability and Failover functionality
  • Configuring jobs in MongoDB and MySQL and perform curd operations.

Associate Consultant

Capgemini
05.2014 - 03.2017
  • Environment: Jenkins, Atlassian Stash, Docker, Vagrant, Red Hat Satellite, AWS and vmware and physical server, ITSM
  • Launching server in Aws console adding security groups, disk
  • Create ELB for server and adding ELB in route53
  • Launching auto scaling group as per requirement.
  • Task automation, service management and application deployment using Bash scripting, ansible modules and Jenkins.
  • Developed environment provisioning solutions using Red Hat Satellite and vmware
  • Server virtualization management and basic storage management with vSphere.
  • Performing day to day activities as Unix admin like patching, adding disk, creating volume and LVM, adding route on server.
  • Taking snapshot of server in AWS and VSphere.
  • ITIL Process definition, documentation, implementation and management for the Release, Change, Knowledge, Configuration, Incident and Problem management processes of the project.
  • Creating NFS server, samba server in production server and mounting on client.
  • Systems Administrator (Red hat , HP-UX, CentOS and Windows 8) of +9000 virtual and physical servers

Education

B.Tech - Information Technology

SRMCEM
Uttar Pradesh
06.2013

Skills

  • Kubernetes
  • Terraform
  • Terragrunt
  • Storm
  • Flink
  • Kafka
  • RDS
  • Cloudsql
  • Maintenance and troubleshooting
  • Performance optimization
  • Project planning
  • Task prioritization
  • Incident management
  • Amazon web services
  • Google Cloud
  • Source and version control: Git
  • Continuous integration systems
  • Build releases
  • Cross-functional teamwork
  • Shell script
  • Python
  • Microservices architecture

Timeline

DevOps Engineer

Juniper Networks
01.2022 - Current

Member of Technical Staff II

VMware
05.2019 - 01.2022

Site Reliability Engineer

Yahoo! Inc.
11.2018 - 05.2019

Operation Engineer

Sprinklr
03.2017 - 09.2018

Associate Consultant

Capgemini
05.2014 - 03.2017

B.Tech - Information Technology

SRMCEM
Abhay Rastogi