Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Trainings
Trainings
Trainings
Generic

Rajesh Kumar

Site Reliability Engineer
Bengaluru,India

Summary

  • Total 15 years of IT experience: 13 years in Software Configuration Management, Infrastructure & DevOps production Support and Administration. 2 years experience in Backend Operations
  • 8 years of experience in Team-management, managing multiple teams covering 24x7 support across Geographies
  • Extensive experience and good knowledge on Devops tools (Git,Gerrit,Jenkins,Artifactory, Nagios, Docker, Terraform, Ansible, Prometheus, Kubernetes, Grafana)
  • Experience with server-side technologies such as Apache, nginix, HAProxy, Networking concepts.
  • Good Knowledge in cloud service provider AWS(EC2, S3, IAM, VPC, Cloudwatch, LoadBalancer, Auto-scaling)
  • Proficient in Team Management, Project Management, Client interaction, Operations Management, Incident & Change Management, Service Delivery Operations, Knowledge Transfer Planning, Runbook and SOW & SOP Preparation

Overview

1
1
Certificate
15
15
years of professional experience

Work History

Site Reliability Engineer

HCL Technologies Ltd
BANGALORE, Karnataka
06.2020 - Current
  • Administer and trouble-shoot Git & Gerrit Infrastructure that includes Master/slave issues, replication issues between regions, load-balancing, server health-checks, planned maintenance/outage activities. Assist developers in their SCM tasks, Security certificate updates
  • Migration of Nagios monitoring to Prometheus and Grafana based monitoring.
  • Have extensive experience in Jenkins Administration activities, and good knowledge on CI/CD pipelines. Review build results, debug build issues, and discuss technical issues with developers, architects, and managers. Perform Jenkins upgrades as and when required, Node management, job management, overall capacity assessments. Job creation, Git Gc job on Jenkins slaves, Managing of Jenkins storage in NFS shares.
  • Ensure docker containers are up and running for Jenkins builds and analyze and fix any issues that hampers Jenkins build on Docker containers. Initial docker setup on Linux machines, Configuring and adding docker instances to Jenkins. Good at docker commands, writing docker files.
  • Administer Nagios to constantly monitor DevOps infra and add/remove monitoring items and drive alert reduction process. Nagios Alert setup, threshold amendments and Alert reduction.
  • Migration of Nagios Monitoring setup to Prometheus-Grafana, custom alert setup on the Devops applications.
  • KB Documentation creation/updation
  • Good knowledge on Linux and Shell Scripting. Good understanding of Cloud Technologies like Amazon Web Services (AWS) VPC, EC2, S3, IAM, Cloud Watch, Load Balancer, Auto-Scaling.
  • Good understanding of fundamental technologies like TCP/IP, HTTP, Nginx, HAProxy and DNS & best practices related to security, performance, web server configuration, monitoring, trending, and high availability.

Senior DevOps Engineer

HCL Technologies Ltd
Bengaluru, Karnataka
05.2017 - 05.2020
  • Managing team of 7 employees, overseeing hiring, training, and professional growth of employees & providing 24x5 support.
  • Experience in T&M, Managed Services and Outcome based Delivery model, customer service industry, technical support, product support, end-user support, Consulting, and developer support.
  • Administer and trouble-shoot Version control and code-review tools such as Git & Gerrit Infrastructure that includes Master/slave issues, replication issues between regions, load-balancing, server health-checks, planned maintenance/outage activities. Assist developers in their SCM tasks, Security certificate updates.
  • Have extensive experience in Jenkins Administration activities, and good knowledge on CI/CD pipelines. Review build results, debug build issues, and discuss technical issues with developers, architects, and managers. Perform Jenkins upgrades as and when required, Node management, job management, overall capacity assessments. Job creation, Git Gc job on Jenkins slaves, Managing of Jenkins storage in NFS shares.
  • Ensure docker containers are up and running for Jenkins builds and analyze and fix any issues that hampers Jenkins build on Docker containers. Initial docker setup on Linux machines, Configuring and adding docker instances to Jenkins. Good at docker commands, writing docker files. Proof of concept for Nagios containerized solutions.
  • Administer Nagios to constantly monitor DevOps infra and add/remove monitoring items and drive alert reduction process. Nagios Alert setup, threshold amendments and Alert reduction.
  • Good knowledge on Linux and Shell Scripting. Good understanding of Cloud Technologies like Amazon Web Services (AWS) VPC, EC2, S3, IAM, Cloud Watch, Load Balancer, Auto-Scaling.
  • Good understanding of fundamental technologies like TCP/IP, HTTP, Nginx, HAProxy and DNS & best practices related to security, performance, web server configuration, monitoring, trending, and high availability.
  • KB Documentation creation/updation
  • Wrote bash scripts for daily maintenance activities to reduce manual efforts, manage CI/CD Infra downtimes

Technical Specialist

HCL Technologies Ltd
Bengaluru, Karnataka
05.2010 - 04.2017
  • Worked as IBM Rational tools administration for a UK Banking client
  • Team Lead managing 24x7 support & Performing L2 Infrastructure support for a UK Banking client adhering to ITIL process.
  • Troubleshooting of ClearCase issues following strict Incident Management process such as installation of ClearCase on Windows client, solving ClearCase view issues, Solving Rebase/ Deliver issue, baselines configurations, including labeling, branching/merging, versioned files and stream creation.
  • ClearQuest: Installation of ClearQuest on windows client, user access management, Team Menu items creations, People record management, work request template creation. Report generation.
  • Performing Build via Rapid Deploy and Troubleshooting Build failures.
  • Raising pre-approved change records and standard changes for ClearCase server activities.
  • Participate in CAB and Incident Management calls pertaining to infrastructure.

Senior Analyst(Datacenter Operations)

HCL Technologies Ltd
Bengaluru, Karnataka
11.2008 - 05.2010
  • SCOM, MOM & Nagios: Proactive monitoring of Windows, Exchange and UNIX Servers, Oracle database & applications, provide application support for SCOM platform and create and maintain monitoring and notification rules, create console roles and reviews &create custom reports on results of SCOM monitoring
  • Raising incidents against ticketing tools Remedy and CA to respective support (Windows, UNIX & Messaging) groups
  • Increased customer satisfaction by resolving issues.

Technical Support Engineer

HCL Technologies Ltd
Chennai, Tamil Nadu
07.2006 - 10.2008
  • Troubleshot issues with broadband connection related issues via e-mail & chat and E-mail clients (Microsoft Outlook, Outlook Express), Antivirus/Firewall (Norton)
  • Supported customer with remote assistance when required
  • Provided outbound call/callback support for broadband/ADSL line issues
  • Responded to support requests from end users and patiently walked individuals through basic troubleshooting tasks.


Education

Bachelor of Engineering - Electronics And Communications Engineering

Vellore Institute Of Technology
Vellore
Aug 2000 - Apr 2004

Skills

    Nagios

undefined

Accomplishments

  • Optimized Infrastructure monitoring by reducing 40 % of unwanted alerts.
  • Enabled Autofix for the alerts we receive in production.
  • Drove Incident reduction program resulting in saving time of support team doing repeated tasks.
  • Driving Runbook automation by automating tasks by scripting where possible.

Certification

ITIL V3 Foundation

Timeline

Site Reliability Engineer

HCL Technologies Ltd
06.2020 - Current

Senior DevOps Engineer

HCL Technologies Ltd
05.2017 - 05.2020

ITIL V3 Foundation

08-2013

Technical Specialist

HCL Technologies Ltd
05.2010 - 04.2017

Senior Analyst(Datacenter Operations)

HCL Technologies Ltd
11.2008 - 05.2010

Technical Support Engineer

HCL Technologies Ltd
07.2006 - 10.2008

Bachelor of Engineering - Electronics And Communications Engineering

Vellore Institute Of Technology
Aug 2000 - Apr 2004

Trainings

  • Completed External trainings on Gerrit Administration in 2017.
  • Completed Internal trainings in Project Management and ITIL in 2013.

Trainings

  • Completed External trainings on Gerrit Administration in 2017.
  • Completed Internal trainings in Project Management and ITIL in 2013.

Trainings

  • Completed External trainings on Gerrit Administration in 2017.
  • Completed Internal trainings in Project Management and ITIL in 2013.
Rajesh KumarSite Reliability Engineer