Summary
Overview
Work History
Education
Skills
Certifications
Knowledge Purview
Timeline
Generic
UJJWAL KUMAR

UJJWAL KUMAR

Senior Site Reliability Engineer
Bengaluru

Summary

Site Reliability Engineer with expertise in AWS container services (EKS, ECS, ECR) and cloud-native architectures. Skilled in ensuring high availability, fault tolerance, and security for large-scale applications. Proficient in Terraform, Ansible, and Python scripting for automation, infrastructure provisioning, and reliability improvements. Experienced in managing CI/CD pipelines with Git and GitHub Actions, enabling seamless deployments and operational efficiency. Strong track record of optimizing systems, reducing downtime, and driving DevOps best practices across teams.

Overview

10
10
years of professional experience
4
4
years of post-secondary education

Work History

Senior Site Reliability Engineer

Akamai Technologies India Pvt Ltd
12.2024 - Current
  • Design, automate, and manage cloud infrastructure using Terraform, Ansible, and other Infrastructure as Code (IaC) tools to ensure scalable and reliable deployments.
  • Develop automation scripts and tools in Python to improve system reliability, monitoring, and operational efficiency.
  • Manage source code, infrastructure changes, and workflows using Git with GitHub Actions for CI/CD pipelines.
  • Implement and maintain observability, monitoring, and alerting practices to ensure high availability and performance of critical services.
  • Collaborate with development and operations teams to improve system resilience, optimize deployments, and streamline incident response.

Cloud Support Engineer

Amazon Web Services (AWS)
04.2022 - Current
  • Provide comprehensive support to global customers of AWS, leveraging in-depth knowledge of AWS services to resolve complex issues and enhance user satisfaction.
  • Specialize in container services, including Amazon Elastic Kubernetes Service (EKS), Amazon Elastic Container Service (ECS), and Amazon Elastic Container Registry (ECR), ensuring optimal deployment, management, and operation of containerized applications.
  • Demonstrate expertise in a broad range of AWS services that integrate with container solutions, including Elastic Compute Cloud (Amazon EC2), Virtual Private Cloud (VPC), Amazon Simple Storage Service (S3), AWS Identity and Access Management (IAM), Amazon Elastic Block Store (EBS), and Amazon Elastic File System (EFS).
  • Accredited as a Subject Matter Expert (SME) for Amazon Elastic Kubernetes Service (EKS), recognized for deep technical expertise and tasked with handling high-level escalated tickets. This role involves troubleshooting and resolving complex, high-impact issues that customer-facing engineers are not able to resolve, ensuring customer satisfaction and service reliability.
  • Lead troubleshooting and technical support efforts, significantly reducing resolution time for critical issues and contributing to a improvement in customer satisfaction ratings within the container services domain.
  • Design and implement effective solutions for customers, enabling them to maximize their AWS service usage, improve application performance, and ensure security compliance.
  • Collaborate with cross-functional teams to update and maintain internal knowledge bases, documentation, and customer-facing FAQs, enhancing the overall support process and reducing repeat inquiries.
  • Participate in continuous learning and training sessions on the latest AWS technologies and updates, ensuring the provision of up-to-date and knowledgeable support to AWS customers.

Senior Software Engineer

HCL Technologies
08.2021 - 01.2022
  • Design and implement Kubernetes clusters to ensure high availability, reliability, and scalability of applications across multiple environments.
  • Implement and manage container networking, storage, and security policies to comply with organizational and regulatory standards.
  • Monitor and troubleshoot Kubernetes environments, utilizing logging and monitoring tools to ensure optimal performance and quick resolution of issues.
  • Collaborate with development teams to containerize and migrate legacy applications to Kubernetes, ensuring seamless integration and minimal downtime.
  • Provide guidance and support for application development teams on containerization best practices, Kubernetes usage, and cloud-native architectures.
  • Continuously update and maintain documentation related to Kubernetes infrastructure, deployments, and best practices to ensure knowledge sharing and team collaboration.
  • Participate in on-call rotations to ensure 24/7 availability and reliability of critical applications and services.

System Engineer

Omnesys NEST Trading Platform, Tata Consultancy Services
09.2020 - 04.2021
  • Rendering technical support for multiple Institutional Clients working on Omnesys Trading Platform. Worked as L2 engineer.
  • Migrating application running on physical servers to Kubernetes Environment.
  • Working parallel with Kubernetes Development Engineers for different applications deployment.
  • Monitoring and Managing application running on Docker.
  • Daily BOD activities and supporting CTCL trading platform.
  • Contributing in implementation and support for Trading applications.

Associate System Engineer

Omnesys NEST Trading Platform, Refinitiv India Shared Services Pvt. Ltd
08.2019 - 08.2020
  • Acting as a part of the Operations Team. Worked as an L2 engineer.
  • Contributing in implementation and support for Trading applications.
  • Daily BOD activities and supporting the CTCL trading platform.
  • Rendering technical support for multiple Institutional Clients working on Omnesys Trading Platform.
  • Facilitating the maintenance of all trading servers.
  • Deploying new releases for Application on RHEL Server. Automating deployment task by using Ansible and Bash Scripting.
  • Part of Kubernetes Team for migration of application running on physical server to Kubernetes Cluster.

Linux System Administrator

High Frequency Trading Platform, Triad Square Infosec Pvt. Ltd
07.2018 - 05.2019
  • Played a key part of the Operations Team. Worked as L1 Engineer.
  • Monitored HFT (High Frequency Trading) apps on different RedHat and Ubuntu servers.
  • Engaged in creating and modifying monitors on Zabbix for the Monitoring teams.
  • Maintained smooth operation of multi-user computer systems through collaboration with hardware and network engineers.
  • Implemented, developed and tested installation and update application servers.

System Engineer

Sazpin Software Pvt. Ltd
02.2016 - 08.2017
  • Managed more than 100 servers with various versions of Linux Operating system (Ubuntu, CentOS and RedHat).
  • Engaged in troubleshooting & fixing of server wide errors and issues.
  • Monitored Live Streaming TV channels through Linux servers.
  • Managed and monitored installed systems for highest level of availability.
  • Provided 1st level technical support and troubleshooting to internal and external clients.
  • Managed installation, upgrade and deployment projects and provided on-site direction for network engineers.
  • Wrote and maintained custom scripts to increase system efficiency and performance time.

Education

Bachelor of Technology - Electronics And Communications Engineering

NM Institute of Engineering And Technology
Bhubaneswar, Odisa
09.2011 - 05.2015

Skills

Kubernetes

Certifications

  • SME - AWS Elastic Kubernetes Services
  • Certified Kubernetes Administrator (CKA)
  • AWS Certified Solutions Architect - Associate

Knowledge Purview

  • Experience in containerizing and migrating application to Kubernetes.
  • Deployed various application on Kubernetes Cluster hosted on AWS (EKS).
  • Have written Manifest files for different Kubernetes objects.
  • Worked with HELM Package Manger in creating customs chart for various application and deployed them to Kubernetes Cluster.
  • Used Kubernetes to orchestrate the deployment, scaling and management of Docker Containers.
  • Created Docker Images using a Dockerfile, worked on Docker Container snapshots, removing images and managing Docker Volumes.
  • Experience in scripting languages including Shell Scripting and Ansible.
  • Working knowledge on versioning tools like Git, GitHub.
  • Working knowledge in CI (Continuous Integration) and CD (Continuous Deployment) methodologies using Jenkins. Continuous Delivery of Applications to App Servers like Tomcat.
  • Knowledge on AWS services like EC2, S3, VPC, ELB, Auto Scaling Groups, Route53, IAM.
  • Skilled in Cloud Computing (Linux, AWS).
  • Daily maintenance/administration of multiple RedHat/Linux systems.
  • Worked on different change requests and incidents for production servers.
  • Provided primary system administration, configuration, and troubleshooting of the RedHat/Linux environment and performance issues.
    Scheduling tasks using Crontab.

Timeline

Senior Site Reliability Engineer

Akamai Technologies India Pvt Ltd
12.2024 - Current

Cloud Support Engineer

Amazon Web Services (AWS)
04.2022 - Current

Senior Software Engineer

HCL Technologies
08.2021 - 01.2022

System Engineer

Omnesys NEST Trading Platform, Tata Consultancy Services
09.2020 - 04.2021

Associate System Engineer

Omnesys NEST Trading Platform, Refinitiv India Shared Services Pvt. Ltd
08.2019 - 08.2020

Linux System Administrator

High Frequency Trading Platform, Triad Square Infosec Pvt. Ltd
07.2018 - 05.2019

System Engineer

Sazpin Software Pvt. Ltd
02.2016 - 08.2017

Bachelor of Technology - Electronics And Communications Engineering

NM Institute of Engineering And Technology
09.2011 - 05.2015
UJJWAL KUMARSenior Site Reliability Engineer