Overall 10+ years of experience in the IT Industry, currently working as a Principal Engineer.
Hands on experience in Managing Production level AWS and GCP Infrastructure with expertise in services like EC2, EKS, S3, Load balancing, IAM, EBS, MSK, CloudWatch, RDS, Lambda, AWS Glue, Cost Explorer etc...,
Expertise in managing infrastructure as code (IaC) using tools like Terraform, Packer, Vault and Helm.
Experience building High scalable and reliable Kubernetes platforms on EKS and GKE.
Experience implementing HA, blue/green architecture, HPA, VPA, Geo Based routing in production microservices platforms.
Build Disaster Recover plans for Various Components in the SaaS Platform by Managing Backups, Monitoring and Building Automations for restoration in cross region or cross zone.
Experienced with CICD, Devops, Security, Compliance, Monitoring, Logging and Infrastructure Optimization to build a highly reliable platform for micro services in AWS and google cloud platforms.
Building Monitoring, Alerting, Logging with Metrics and Dashboards for various infra components to identify Issues early and fix them before truing into full blown production outages with usage of tools like Prometheus, Cloud watch, Grafana, Run scope, PagerDuty, custom Metrics exporters, Splunk and automations jobs.
Experience in Automating and Managing Large scale Production Event Streaming Platform Like MSK and Confluent Kafka.
Performed migrations from Confluent Kafka to AWS MSK with zero Incidents for live customers of Prisma cloud.
Experience in Automating and Managing Production scale distributed database platforms like Singlestore(memsql) on kubernetes with regular upgrades, security patching and incident management.
Experience in setting up Organization wide standards with respect to onboarding into messaging platform like Kafka.
Maintain Best possible Collaboration and Communication with Team to share, train, mentor and see them succeed.
Build HA for All components and continuously assess for deviation and build roadmaps to bring resiliency.
Regular security patching for production and non-production to keep environments secure.
Experience managing Stateless and Stateful applications on Kubernetes platform
create new, secure and efficient microservices oriented processes and tools from Prisma cloud.
Containerization of application with Docker and orchestrate through the Kubernetes.
Build Pipelines for Continuous Integration and Delivery (Gitlab CI, Jenkins and Spinnaker)
Well versed with configuration management in Git, Jenkins, Maven and Management of application servers etc.
Hands on experience in python, shell, PowerShell and Golang.
Expert in Problem solving and Incident Management.
Experience on microservice Platform management for Production, DEV and QA
Worked on Git, GitLab, bitbucket and GitHub to maintain and secure code.
Strong communication and analytical skills and a demonstrated ability to handle multiple tasks as well as work independently or in a team.
Adoption of DevOps mindset and culture in the company.
Overview
10
10
years of professional experience
3
3
Certifications
4
4
Languages
Work History
Principal Engineer
Palo Alto Networks India
05.2021 - Current
Design, build, test, maintain and enhance the Event Streaming Platform product in Prisma Cloud (Palo alto Networks) with 100% automation.
Build and Maintain Disaster recovery plans for various components in Saas platform with many successful restores in cross region and cross zone.
Build HA for All components and continuously assess for deviation and build roadmaps to bring resiliency.
Regular security patching for production and non-production to keep environments secure.
Take ownership and drive projects towards completion without incidents.
Build and maintain cloud deployments in multiple cloud platform environments through infrastructure-as-code.
Incident Response and Troubleshooting with Developers, QA, Perf, Team members and practice Blameless Culture.
Maintain Best possible Collaboration and Communication with Team to share, train and mentor team and see them succeed.
Drive platform performance, scalability, security, and reliability through continuous deployment, monitoring, logging, alerting and automation in the entire lifecycle of the platform (inception, design, through development and operationalization).
Operate the platform by defining metrics to quantify the health of the platform and its consumers.
The operational aspects of the platform by defining and building remediation steps in case of an incident.
Take part in on-call rotations with the team and take the lead in preventing incidents and maintaining platform SLAs, through automation and blameless postmortems.
Keep up with the cutting edge in relevant technologies, and drive implementation of new solutions.
Continuous Learning and Improvement to reskill myself, when necessary, always look to learn for mistakes and help maintain and great learning culture across team.
Document & evangelize the platform across other teams within Prisma cloud (Palo alto Networks)
Help our platform consumers to use our platform to integrate applications and get quickly and securely into production with fast, agile iterations.
Experience with Large scale Event Streaming Platform migration from Confluent Kafka to AWS MSK using tools like Mirror Maker 2.
Experience with monitoring, logging and alerting products like Prometheus, ELK, Grafana.
Senior Software Engineer
Lowe’s India services Private Limited
09.2017 - 05.2021
building a platform for micro services on google cloud platform with primary tools such as GCP, GKE, terraform, docker, Jenkins, spinnaker, GCS & Istio.
Develop Jenkins pipelines for Immutable infrastructure, which involves automated build of GKE cluster from stretch with production ready configuration and workloads to cutover with blue/green.
Use Terraform to convert all the components configuration to code.
Build process and automated pipeline for consumer for easy build and deployment of application on to microservices platform called carbon using Jenkins and Spinnaker.
Upgrade and Maintain the GKE Platform. Support and resolve issues for consumers in Dev/Stage/Prod.
Manage the Networking components on GCP Like Cloud Armor, Cloud NAT, Master authorization network, Google Load Balancer, VPC, Subnets, Routes and etc.,
Manage Istio Service Mesh Related Components Like Virtual Services, Authorization Policy, Cannery Deployments,
Destination Rules, Istio Ingress Gateway, Egress Gateway, Service Entry.
using Terraform, shell scripts, Jenkins Pipelines. Integrate Unit testing and Automated testing of code for reliable infrastructure.
Work with development and testing teams during spirits to develop build and deployment pipelines.
Building the CI/CD pipeline for the projects and converting the Jenkins job configuration to code.
CI/CD implementation using Jenkins Declarative or Scripted Pipelines.
Build Disposable Jenkins and help customers run their own Jenkins as and when required.
Help customers to develop and automate the manual tasks using automations tools like Jenkins.
Build end to end deployment pipeline for projects for various environments including production and maintain them.
Upgrade and maintain automation tools.
Work with agile teams for Build, test, deployments and enhancements for CI/CD pipelines
Manager - HR Operations/ People Business Partner at Palo Alto Networks (India) Technologies Pvt LtdManager - HR Operations/ People Business Partner at Palo Alto Networks (India) Technologies Pvt Ltd