Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Personal Information
Languages
Timeline
SoftwareDeveloper
MALKINDER SINGH

MALKINDER SINGH

Staff, Cloud Engineer
Pune,MH

Summary

AWS Certified SysOps Administrator and Red Hat Certified System Administrator with over 7 years of experience in cloud operations and DevOps engineering. Expertise in designing scalable AWS infrastructures and implementing CI/CD pipelines, resulting in enhanced deployment efficiency. Demonstrated success in cost optimization and high-availability architecture, contributing to operational excellence. Proven ability to improve system reliability and drive cross-functional team collaboration to elevate service delivery and customer satisfaction.

Overview

9
9
years of professional experience
3
3
Certification

Work History

Staff, Cloud Operation Engineer,

Druva Data Solutions Pvt. Ltd.
PUNE
11.2023 - Current
  • Developed comprehensive Grafana and CloudWatch monitoring framework for 360-degree application visibility.
  • Led incident management as second-level escalation, applying RCA techniques to enhance response times.
  • Established SLIs and SLOs using APM tools, improving service reliability and uptime.
  • Automated recurring Jira tasks with Python scripts, decreasing manual effort by 30%.
  • Optimized CU/CP instructions to expedite delivery timelines and streamline deployment procedures.
  • Drove cloud cost optimization through automated AWS Cost Explorer monitoring, aligning expenses with budget objectives.
  • Conducted documentation audits and facilitated training sessions to enhance CloudOps operational readiness.
  • Collaborated with DevOps, Security, and Engineering teams to integrate CloudOps practices, enhancing overall efficiency.

Cloud OPS Engineer,

Druva Data Solutions Pvt. LTD
PUNE
03.2022 - 11.2023
  • Conducted daily system monitoring to verify integrity and availability of server resources, systems, and key processes.
  • Reviewed application logs to identify potential issues and maintain operational efficiency.
  • Monitored and reported alerts to cross-functional teams, including Infra, DB, and Network.
  • Actioned application alerts such as service outages and disk issues for prompt resolutions.
  • Implemented automated monitoring and deployment scripts, reducing deployment time significantly.
  • Managed customer data backup and restoration through PagerDuty, ensuring adherence to SLA requirements.
  • Generated weekly and monthly COGS reports, identifying and reducing underutilized resources by 10%.
  • Oversaw infrastructure monitoring using Terraform, Grafana, and Coralogix to enhance visibility.

L2, Cloud Operation Engineer,

Saba Software Pvt. LTD
PUNE
11.2018 - 06.2021
  • Conducted environment audits to stabilize application performance across platforms.
  • Managed Tomcat application servers on Linux and Windows, ensuring optimal functionality.
  • Executed service ticket tasks to deliver solutions that improved customer satisfaction.
  • Monitored AWS infrastructure with Docker and Kubernetes for effective application management.
  • Handled Linux-related incidents to ensure compliance with Service Level Agreements.
  • Wrote shell scripts to automate operational tasks, including data loads and exports.
  • Analyzed application logs to troubleshoot issues and enhance configurations.
  • Managed user accounts and access rights on servers to maintain security protocols.

Cloud OPS Consultant,

Saba Software Pvt. LTD
PUNE
01.2018 - 10.2018
  • Enhanced operational efficiency by managing team performance and creating monthly rosters.
  • Escalated critical issues to level 2 support to uphold service level agreements.
  • Decommissioned customer accounts in accordance with established protocols.
  • Scheduled weekend maintenance activities, including application restarts, to reduce downtime.
  • Developed and maintained technical documentation for streamlined planning and execution.

Junior Cloud Operation Engineer,

Saba Software Pvt. LTD
PUNE
06.2016 - 12.2017
  • Conducted daily system monitoring to ensure integrity and availability of server resources.
  • Reviewed application logs to identify issues and optimize performance.
  • Monitored alerts, reporting findings to Infra, DB, and Network teams.
  • Addressed application alerts related to service outages, disk issues, and load problems.
  • Notified customers of service interruptions to maintain transparency.
  • Managed restarts while performing thorough sanity checks on features.
  • Created JIRA tickets for L2 team to enhance application stability based on alerts from NewRelic and Check_MK.

Education

Master in Computer Applications (MCA) - Computer and Information Systems

Punjab Technical University
Punjab
01.2013

Master of Science in Information Technology - Computer And Information Systems

Punjab Technical University
Punjab
01.2011

A Level - Computer and Information Systems

NIELIT (DOEACC)
Punjab
01.2009

Skills

Monitoring & Incident Management

  • Defined SLIs/SLOs and error budgets; ensured service reliability using APM tools
  • Led incident response, RCA, and post-incident reviews; used tools like PagerDuty
  • Proficient with Grafana, Coralogix, InfluxDB, New Relic, Nagios, and AWS CloudWatch

Cloud & Infrastructure Management

  • Hands-on with AWS: EC2, ECS, RDS, S3, VPC, ECR, Lambda, IAM, CloudFormation, CloudWatch, QuickSight
  • Implemented cost optimization via Cost Explorer, Savings Plans, and Reserved Instances
  • Familiar with Azure and GCP for hybrid cloud and migration use cases

Automation & Scripting

  • Automated infrastructure with Terraform and CloudFormation (IaC)
  • Developed Bash and Python scripts for operational automation and system tasks

DevOps & CI/CD

  • Built CI/CD pipelines with Jenkins, Git, Docker, AWS CodePipeline/CodeBuild/CodeDeploy
  • Deployed containerized microservices via Docker and ECS

Security & Compliance

  • Applied AWS security best practices: IAM policies, TLS/SSL, and vulnerability remediation

Operating Systems

  • Advanced Linux administration (CentOS, Ubuntu, Amazon Linux)
  • Managed Windows Server in cloud and hybrid environments

Networking

  • Configured VPCs, subnets, NAT, SGs, routing; solid grasp of DNS, TCP/IP, VPN, firewalls

Leadership & Collaboration

  • Mentored engineers, led CloudOps/DevOps initiatives, improved reliability and automation
  • Used Jira/Confluence for documentation, SOPs, and team collaboration

Accomplishments

  • Outstanding Achievement Award Q4 FY 2023-2024 at Druva Data Solutions, Pvt Ltd
  • All Stars Award Q2 2023-2024 at Druva Data Solutions, Pvt Ltd
  • Star performance award for quarter 3 at Saba Software, Pvt. Ltd.

Certification

  • AWS certified Sysops -Administrator -Associate
  • Red Hat Certified System Administrator (RHCSA), 160-157-673
  • Red Hat Certified Engineer (RHCE), 160-157-673

Personal Information

  • Date of Birth: 07/11/89
  • Marital Status: Single

Languages

Punjabi
First Language
English
Advanced (C1)
C1
Hindi
Advanced (C1)
C1

Timeline

Staff, Cloud Operation Engineer,

Druva Data Solutions Pvt. Ltd.
11.2023 - Current

Cloud OPS Engineer,

Druva Data Solutions Pvt. LTD
03.2022 - 11.2023

L2, Cloud Operation Engineer,

Saba Software Pvt. LTD
11.2018 - 06.2021

Cloud OPS Consultant,

Saba Software Pvt. LTD
01.2018 - 10.2018

Junior Cloud Operation Engineer,

Saba Software Pvt. LTD
06.2016 - 12.2017

Master in Computer Applications (MCA) - Computer and Information Systems

Punjab Technical University

Master of Science in Information Technology - Computer And Information Systems

Punjab Technical University

A Level - Computer and Information Systems

NIELIT (DOEACC)
MALKINDER SINGHStaff, Cloud Engineer