Summary
Overview
Work History
Education
Skills
Websites, Portfolios and Profiles
Timeline
Generic

Sudhir kumar

Gurgaon

Summary

Results-driven DevOps Engineer skilled in AWS, Kubernetes, and CI/CD pipelines. Proven ability to optimize costs and enhance cloud security, ensuring robust and scalable solutions.

Overview

14
14
years of professional experience

Work History

DevOps Engineer

TBO.COM
Gurugram, HR
11.2023 - Current
  • Worked on AWS Cloud services for examples: - Ec2, Lamda, VPC, S3,AWS Autoscaling, EKS, RDS, WAF, Amazon Manged Prometheus, Directory Service, Grafana, Cloudformation, Cloudwatch, SNS, SES etc.
  • Implemented automated deployment scripts using AWS Devops, reducing deployment time.
  • Designed and deployed a scalable Amazon Kubernetes service (EKS) cluster, improving application uptime.
  • Optimized Ec2 and S3, leading to a reduction in operational costs.
  • Migrated on-premises applications to AWS.
  • Developed disaster recovery strategies within Azure, decreasing RTO (Recovery Time Objective).
  • Configured and maintained CI/CD pipelines using Jenkins, Github Actions Runner, achieving faster software release cycle.
  • Implemented AWS security Hub & optimized AWS cost management strategies.
  • Designed and deployed virtual networks and subnets, increasing network performance.
  • Developed scripts for automated backups and monitoring, ensuring a 99.9% data availability.
  • Monitored network and security system protocols, reducing unauthorized access incidents.
  • Provided technical support and troubleshooting for AWS-related issues, achieving an 80% resolution rate within the SLA..
  • Experience in provisioning infra using Terraform & Proficient in IAAS, PAAS & SAAS environments.
  • Managing customers' Linux Infrastructure on AWS clouds & Managing 2000 + Linux EC2 instances and VM's from various public clouds like AWS.
  • Experienced in Set up monitoring and logging solutions (e.g cloudwatch, Amazon Managed Prometheus, Amazon Managed Grafana) to ensure the reliability and performance of applications and infrastructure.
  • Proficient in managing and optimizing source code repositories using Git
  • Familiarity with Git commands, branching strategies (such as Git Flow), and integrating Git/GitHub with CI/CD pipelines.
  • Proficient in creating or writing Ansible playbook as per requirement & troubleshooting ansible playbook errors and implement solution for them.
  • Proficient in creating pipelines for build test, test and release using YAML pipeline.
  • Proficiency in Kubernetes, including cluster management, deployment of applications using YAML manifests, creating and managing deployments, services, and ingress controllers.
  • Proficient in developing and maintaining a Kubernetes-based autoscaling solution, resulting in reduction in infrastructure costs and increase in application performance during peak traffic periods.
  • Design, build, and maintain efficient, reusable, and reliable Docker containers, creating Docker files, managing Docker images, and optimizing Docker performance and implementing automated processes for deployment, scaling, and management of containerized applications.
  • Proficient in configuring, customizing and application deployment or micro services deployments on docker, docker images, docker containers, docker file, docker compose, docker swarm etc.
  • Collaborate with cross-functional teams to integrate Docker into our development and deployment pipeline.
  • Proficient in troubleshooting and resolving issues related to Docker containers and orchestration.
  • Experience in continuous integration and continuous delivery/deployment using Jenkins & automated build & deployments process using shell scripts, MAVEN, ANT, MS BUILD as a build tool for building and deploying artifacts (JAR, WAR) from source code.
  • Proficient in troubleshooting boot process issues. Configuring the Crontab and scheduling the jobs. Installing software packages and patches & server handing and patching of Linux servers in Cloud and on-Prem environment.
  • Proficient user administration creating, deleting, modifying, locking, unlocking, and managing user accounts, groups management, monitoring System performance of virtual memory, managing swap space, disk utilization, and CPU utilization & creating and increasing file systems.
  • Proficient in Configuring and managing YUM, SAMBA, SED, FTP, NFS, SSH, HTTP servers.
  • Proficient in setting up clustering on AIX and LINUX environments.
  • Managing LVM Partitioning in Linux servers. (Creating, Extending and Reducing) & Proficient in SAN configuration.
  • While working on Disk space alerts and incidents, performing disk clean up whenever there is scope for disk clean-up by removing old logs files from respective partition by creating and scheduling shell script in crontab for the same.
  • Proficient in SSL certificates integration and domain hosting & ensuring secure communication channels by installing SSL certificates on ALBs and IIS applications.
  • Enhancing domain management efficiency by configuring subdomains in Route 53 & GoDaddy.
  • Installation, Configuration and Troubleshooting of Apache Tomcat Web Servers related issues & load balancing using Nginx and other tools.
  • Making documentation or SOP's and MOPs of critical tasks & participated in on-call rotations to provide 24/7 support for critical production systems.
  • Proficient in checking ticketing tools (clients on daily basis and need to work on tickets which are assigns to me & Troubleshoot issues and give support to clients when working in On Call shifts.

Cloud Operations Engineer

Taskus india pvt Ltd
06.2021 - 10.2022
  • Assisted in managing cloud resources including virtual machines, storage, networking, and load balancers.
  • Supported collaboration with cloud platforms such as Amazon Web Services, Microsoft Azure, and Google Cloud Platform.
  • Configured VPCs, subnets, security groups, and routing tables.
  • Monitored infrastructure and applications using tools like Prometheus, Grafana, ELK Stack, and Amazon CloudWatch.
  • Analysed logs and metrics.
  • Handled alerts, outages, and production incidents.
  • Performed root cause analysis (RCA).

Windows Administrator

Singicent Information Solutions LLP
Mohali
08.2018 - 06.2021
  • Participated in on-call rotation, providing round-the-clock support to address critical system issues.
  • Managed Active Directory services, overseeing user accounts, groups, and permissions to enhance system security.
  • Resolved technical issues and provided end-user support, offering guidance and troubleshooting assistance for Windows-related problems.
  • Installed and configured various software and hardware for smooth-running systems operation.
  • Configured and maintained DNS, DHCP, and IPAM services to support network infrastructure and improve connectivity.
  • Administered Windows Server environments, including deployment, configuration, and maintenance of server systems to optimise performance.
  • Installed and maintained IT equipment and software to meet workflow requirements.

Technical Support Engineer

CMS IT service Pvt Ltd
02.2012 - 05.2016
  • Provided clear and concise step-by-step technical support to guide clients.
  • Helped customers set up new systems, applications and software.
  • Resolved service user requests within target timeframes.
  • Used support tickets to track and speed up incidents.
  • Used remote access to navigate and link to customer computers.

Education

BSC IT - INFORMATION TECHNOLOGY

Lovely Professional University
Phagwara, IN-PB
2016

Skills

  • Containerisation
  • Python/bash Scripting
  • Version Control
  • Database Management
  • Kubernetes management
  • Management Cloud Technology
  • CI/CD pipelines
  • Docker orchestration
  • Infrastructure as code
  • Cloud security
  • Incident management
  • Technical troubleshooting
  • Cost optimization
  • AWS
  • Kubernetes
  • Cloud Identity and Access Management (IAM)
  • Terraform
  • Microservices architecture
  • Agile methodology
  • Cloud Computing
  • Server Maintenance
  • Infrastructure Monitoring
  • Security & User Administration
  • Shell Scripting
  • Network Configuration
  • Deployment Provisioning
  • Operating System: Linux
  • Operating System: Unix
  • Operating System: Mac
  • Operating System: Windows
  • Version Control System: SVN
  • Version Control System: GIT
  • Version Control System: GITHUB
  • Version Control System: GITLAB
  • Automation/Build Tools: Jenkins
  • Automation/Build Tools: Maven
  • Automation/Build Tools: Ansible
  • Monitoring Tools: Zabbix
  • Monitoring Tools: Nagios
  • Monitoring Tools: Cloudera
  • Monitoring Tools: Grafana
  • Network protocol: TCP/IP
  • Network protocol: UDP
  • Network protocol: DHCP
  • Network protocol: HTTP
  • Network protocol: HTTPS
  • Network protocol: NTP
  • Network protocol: FTP
  • Network protocol: SSH
  • Network protocol: TELNET
  • DBMS: SQL
  • DBMS: Postgres SOL
  • Languages: C
  • Languages: KQL
  • Languages: Shell Script
  • App & Web Servers: Tomcat
  • App & Web Servers: JBoss
  • App & Web Servers: Nginx
  • Cloud & Virtualization: Azure
  • Cloud & Virtualization: AWS
  • Cloud & Virtualization: VMware
  • Cloud & Virtualization: Docker

Websites, Portfolios and Profiles

https://www.linkedin.com/in/sudhir-kumar-830a58127/

Timeline

DevOps Engineer

TBO.COM
11.2023 - Current

Cloud Operations Engineer

Taskus india pvt Ltd
06.2021 - 10.2022

Windows Administrator

Singicent Information Solutions LLP
08.2018 - 06.2021

Technical Support Engineer

CMS IT service Pvt Ltd
02.2012 - 05.2016

BSC IT - INFORMATION TECHNOLOGY

Lovely Professional University
Sudhir kumar