Summary
Overview
Work History
Education
Skills
Accomplishments
PERSONAL STRENGTHS
Projects
Certification
Timeline
Generic

SHASHI KUMAR

New Delhi

Summary

With 3+ years of experience in Site Reliability Engineering and a proven track record in enhancing cloud services and optimizing application performance, I have significantly contributed to the uptime and efficiency of large-scale systems. My expertise in incident response and security makes me an asset in maintaining high-performance standards and ensuring customer satisfaction.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

FarEye
Noida
08.2021 - Current
  • Experience with Docker and Kubernetes for deploying containerized applications.
  • Handling the monitoring and helping all the teams with alerting to keep the production up and running.
  • Migrated multiple EKS environments from version 1.29 to 1.31.
  • Created AWS resources as per the team requirement using Terraform.
  • Improved the product's availability through Horizontal Pod Autoscaling, toleration, topologySpreadConstraints, and PodDisruptionBudget on Kubernetes, ensuring uninterrupted service and enhancing user experience.
  • Established robust Kubernetes cluster monitoring through Prometheus, Thanos, and Grafana and introduced container log aggregation using Grafana Loki and Promtail.
  • Working with different teams to establish tools/processes to increase the quality and reliability of the test automation and reduce the manual processes.
  • I have set up the monitoring on microservices running on k8s and VMs.
  • Troubleshoot production issues on databases and applications hosted across VMs and microservices running in k8s, as well as owning the RCAs and solutions.
  • Managing and Streamlining Deployment.
  • Creates documentation on Confluence to troubleshoot many of the alerts and system admin tasks.
  • Good knowledge of Debezium, Kafka, Elasticsearch, and shell scripting.
  • Helped the team in infra-cost reduction.
  • Provided on-call technical support, ensuring rapid incident resolution and maintaining SLAs for continuous high availability and performance.
  • Worked on automation to take the thread dump of the running microservices and store it in S3.
  • Used JIRA for bug tracking, ad hoc tasks, and creating the dashboard for issues.

Education

Bachelor of Engineering - Electronics and Communications

Acharya Institutes
Bangalore
08.2020

Skills

  • Cloud Services Optimization
  • AWS
  • Linux
  • Github
  • CI/CD
  • Kubernetes
  • Docker
  • Terraform
  • Helmchart
  • Prometheus
  • Thanos
  • Grafana
  • NewRelic
  • Shell Scripting
  • Python
  • Postgresql
  • ELK Stack
  • Kafka
  • Performance Engineering
  • Monitoring Solutions
  • System Scalability
  • Incident Response

Accomplishments

  • Set Up Monitoring on k8s Cluster In Multiple Environments
  • Cost Reduction through Optimization

PERSONAL STRENGTHS

  • Analytical and critical thinker
  • Quick learner
  • Self Motivated
  • Flexibility and adaptability
  • Hard working

Projects

Wireless Spy camera using raspberry pi, To detect any obstacles or objects which comes in the specified range using ultrasonic sensor and detects the image of that objects and stores it in the hard drive which is present in the raspberry Pi.

Certification

  • Linux for Beginners
  • Learning Kubernetes

Timeline

Site Reliability Engineer

FarEye
08.2021 - Current

Bachelor of Engineering - Electronics and Communications

Acharya Institutes
SHASHI KUMAR