Summary
Overview
Work History
Education
Skills
Timeline
Generic

MUKESH KUMAR SAHOO

Bangalore

Summary

A SRE Engineer with 2 years of hands-on experience supporting Production and optimizing critical tasks in administration on Linux servers.Experience in SRE Technologies Like Linux, Windows, Shell Script, ServiceNow, Jira, Confluence Prometheus and Grafana. Experienced in creating various dashboards, metrics, alarms, notifications, Observability for servers using Grafana, Prometheus. Established infrastructure and service monitoring using Prometheus and Grafana in Windows and Linux Clusters. Experienced in Automation Script using Bash Shell. Experience in Linux Administration for troubleshoot Linux Environments. Working in different area such as Cloud Ops Engineer, Production Support and Maintenance. User accounts management, developing scripts for various System Performances Monitoring purposes and troubleshooting. Strong knowledge of Linux-based infrastructures, Linux/Unix administration, and scripting (e.g., Bash). Good experience in Linux commands like grep, sed, awk, top, free, df, du, ps, uptime nslookup, chmod and file handling. Knowledge on SDLC (Software Development Life Cycle) and ITIL V3 Process. Good Knowledge in Amazon Cloud Services like EC2, VPC, IAM,Route 53,Lambda,Load Balancing, and Cloud Watch Services. Good understanding and exposure to Project Development Life Cyclic in agile methodology. Excellent team player having ability to meet deadlines and work under pressure.

Overview

2
2
years of professional experience

Work History

Site Reliability Engineer

EDUNEWRON Services Pvt Ltd
04.2022 - 08.2024
  • 24/7 On-call support and Working in Agile methodology JIRA
  • Handling ServiceNow Incidents and Supporting L2 tickets
  • Troubleshoot issues related to application deployment, Pods, infrastructure
  • Creating Dashboards in different accounts and import/export dashboards as Json
  • Build visualization and Dashboards using Grafana
  • Monitor and troubleshoot infrastructure and application issues
  • Create monitoring alerts on different tools like Prometheus and Grafana and maintain them
  • Conduct initial diagnosis and troubleshooting of technical issues, aiming for swift resolution or escalation to the appropriate teams
  • Collaborate closely with cross-functional teams to facilitate the resolution of incidents and minimize downtime
  • Utilize incident management tools to log, track, and prioritize incidents according to established procedures
  • Implement preventive measures and contribute to root cause analysis to avoid recurring incidents
  • Co-ordinate server maintenance, security patching and user management
  • Maintains UNIX/Linux Operating System to provide optimum performance and system availability
  • Monitored System Activities like CPU, Memory, Disk and Swap space usage to avoid any performance issues
  • Technologies: Linux, Grafana, Bash, Jira, Confluence, Terraform, AWS, Docker and ServiceNow
  • Improved incident management workflows by creating comprehensive documentation on troubleshooting procedures and common issues resolution steps.
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.
  • Conducted root-cause analyses after major incidents to identify areas for process improvement or technical enhancement opportunities.
  • Developed custom software tools to automate routine tasks, boosting team efficiency and accuracy.

Education

B.Tech - Mechanical Engineer

IGIT

Skills

  • Shell Scripting
  • Python
  • Prometheus
  • Grafana
  • Linux
  • Windows
  • Jira
  • Confluence
  • ServiceNow
  • Amazon Web Service
  • MySQL
  • Git
  • Docker
  • Terraform
  • Jenkins

Timeline

Site Reliability Engineer

EDUNEWRON Services Pvt Ltd
04.2022 - 08.2024

B.Tech - Mechanical Engineer

IGIT
MUKESH KUMAR SAHOO