Summary
Overview
Work History
Education
Skills
Certification
Leadership Team Collaboration Skills
Languages
Timeline
Generic
Mausam Singh

Mausam Singh

SRE DevOps Engineer
Gurugram,HR

Summary

Experienced and self-motivated DevOps and Systems Professional with nearly 9.5 years of industry expertise in Unix/Linux, AWS, Azure, Ansible, and PostgreSQL. Skilled in diagnosing and resolving complex technical issues, creating clear operational runbooks, and optimizing systems for scalability, security, and efficiency. Proficient in automation, CI/CD practices, and collaborating with cross-functional teams to adapt systems to evolving business needs. Strong problem-solving abilities combined with a proven track record of enhancing operational efficiency and delivering high-quality solutions aligned with organizational goals. Holds a Bachelor of Technology (B.Tech) with extensive experience in Linux, cloud, and DevOps technologies.

Overview

10
10
years of professional experience
2
2
Certifications
3
3
Languages

Work History

SRE Technical Analyst 2

Adobe Systems
03.2022 - Current
  • Managed AWS instances across multiple regions, ensuring high availability and scalability for Adobe Campaign's marketing automation platform.
  • Troubleshooting & Resolving issues (escalated) related to Adobe's campaign product hosted on AWS/Azure environment.
  • Provided exceptional support for end-users by addressing technical concerns, reducing downtime, and improving user experience.
  • Configured and maintained AWS components such as EC2, ELB, Route53, S3, AMI, Security Groups, and CloudFront to support Adobe Campaign's infrastructure.
  • Collaborated with cross-functional teams to complete projects on time, ensuring client satisfaction and meeting business objectives.
  • Provisioned and decommissioned new and existing customer infrastructures, working closely with customers to understand their requirements and provide effective solutions.
  • Troubleshot application and infrastructure-level issues across production and other environments, using tools such as Splunk and Nagios to identify and resolve issues quickly.
  • Configured SSL certificates on EC2 instances and ELB to ensure secure communication between Adobe Campaign and customers' systems.
  • Configured SFTP to enable secure file transfer between Adobe Campaign and customers' systems.
  • Managed product version and build upgrades, ensuring smooth deployment and minimizing downtime for customers.
  • Configured GPG to enable encryption and decryption of sensitive data in Adobe Campaign's platform.
  • Monitor application performance and infrastructure health using New Relic, identifying and resolving bottlenecks to improve system reliability.
  • Actively use Grafana dashboards to visualize key metrics and provide actionable insights into system performance and resource utilization.
  • Manage and resolve customer escalations by prioritizing critical issues and driving them to resolution with faster turnaround time resulting in improved Average Resolution Duration (ARD) and overall customer satisfaction.
  • Working with various teams (development/upgrade/security) for planning and execution of maintenance activities for the stage and production application.
  • Executed IP rotations and deliverability audits to protect sender reputation and email performance.
  • Resolved DNS-related blockers and led holiday readiness planning for peak traffic events.
  • Conducted Cassini/Europa audits and contributed to ACMC customer service reviews to improve reliability and customer experience.
  • Automated SSL certificate reimport in AWS ACM using python.
  • Designed and implemented an automated system to schedule recurring operational and maintenance tasks and proactively notify teams via Slack, reducing missed activities and operational risk.
  • Designed and led CertShield, a real-time certificate monitoring and governance system to prevent unauthorized or accidental SSL certificate modifications
  • Managed DB Refresh and DB vacuum to ensuring smooth restoration from stage to production and minimizing downtime for customers.
  • Led the implementation of tech stack upgrades for various customers, including the migration to newer software versions, cloud technologies, and infrastructure enhancements. Ensured seamless transitions with minimal disruption to business operations.
  • Automating repeated task using ansible, shell scripts, python & integrating the same on Rundeck portal for others to use it.
  • On-Call Escalation Management: Acted as the primary point of contact for on-call escalations, ensuring that critical issues and incidents were addressed promptly and effectively to minimize service downtime.
  • API Automation with Python: Developed Python scripts to automate API interactions, including data retrieval, submission, and updates across multiple systems. Streamlined workflows and reduced manual effort by automating repetitive API-based tasks.
  • Job Orchestration & Scheduling: Configured and managed Rundeck & UCO jobs to automate the execution of Python scripts for various tasks, ensuring timely and reliable execution without manual intervention.
  • CLB to ALB Migration: Led the migration of Classic Load Balancers (CLB) to Application Load Balancers (ALB) in AWS, ensuring better scalability, security, and performance for web applications.
  • Automation with Python & Bash: Developed and executed Python and Bash scripts to automate the migration process, including the configuration of new ALB instances, updating DNS records, and validating the application's functionality post-migration.
  • Credential Rotation Automation: Utilized Ansible to automate the credential rotation process for admin users on campaigns and databases, ensuring secure management of sensitive information across environments.
  • Vault Integration: Integrated Ansible with HashiCorp Vault to securely store and manage rotated credentials, ensuring that secrets were dynamically retrieved and updated without exposing sensitive data.
  • Performed root cause analysis for recurring incidents to develop long-term resolutions that prevented future occurrences.
  • Streamlined processes through the development and implementation of automation tools, resulting in improved efficiency.

IT Security Expert & System Administrator

Techblue Software
08.2018 - 03.2022
  • Patching and Application Management: Responsible for the regular patching and maintenance of all running tools, applications, and services across the infrastructure, ensuring systems are up to date and protected against known vulnerabilities. Supporting Linux OS up-gradation & Application upgrade, System patching, Application patching.
  • Email Server Management: Managed and configured email servers for multiple organizations, ensuring secure, reliable, and efficient email communication systems. Administered user accounts, handled server configurations, and optimized performance.
  • LDAP Administration: Oversaw the management and configuration of LDAP (Lightweight Directory Access Protocol) systems, ensuring secure user authentication, access control, and seamless integration with email and other enterprise services.
  • VOIP Configuration: Successfully implemented and configured VOIP systems for UK clients, ensuring high-quality voice communications, optimizing network configurations, and resolving any connectivity or service-related issues.
  • Jitsi Video Calling Implementation: Deployed and managed Jitsi video conferencing solution using Docker, providing clients with a scalable, secure, and cost-effective video calling platform.
  • Strengthened the organization's network security posture by configuring and optimizing the firewall, reducing the risk of external attacks and unauthorized access.
  • Day to day support of the production Infrastructure and Applications.
  • Incident Response & Resolution: Responded to incoming support tickets via chat and phone, addressing client concerns related to system performance, security incidents, and infrastructure issues. Took appropriate actions to resolve technical problems promptly.
  • Day-to-Day Backup Management: Ensured regular backups of critical systems and data, implementing automated backup processes to guarantee data integrity and recovery readiness. Managed backup schedules, monitored backup completion, and resolved any backup failures.
  • Cluster Design & Operations: Designed, deployed, and managed production-grade Kubernetes clusters (self-managed and EKS) for large-scale applications with high availability, resilience, and compliance requirements.
  • Workload Management: Automated deployment, scaling, and monitoring of microservices using Kubernetes Deployments, StatefulSets, and DaemonSets.
  • CI/CD Integration: Integrated Kubernetes with GitLab CI/Jenkins/ArgoCD for GitOps-driven deployments, enabling blue/green and canary release strategies.
  • Infrastructure Monitoring with Zabbix: Utilized Zabbix to monitor the health, performance, and availability of all infrastructure components, including servers, network devices, and applications. Configured monitoring templates, thresholds, and alerts to proactively detect and address issues before they impact operations.
  • Responsible to build deployments on respective servers.
  • Automated Application Setup: Utilized Ansible to automate the deployment and configuration of applications across multiple environments, ensuring consistent and error-free setup for development, testing, and production systems.
  • Containerization of Applications: Containerized applications by defining appropriate Docker images and optimizing their build process to ensure efficient deployment and scalability across environments.
  • Implementation of Open-source GVM Tool: Successfully deployed and configured Greenbone Vulnerability Management (GVM) tool to enhance security risk assessments, ensuring comprehensive identification and remediation of vulnerabilities.
  • Security Risk Reduction: Applied security best practices and threat intelligence to reduce the organization's attack surface, ensuring a robust and secure IT environment.
  • Enhanced the overall security posture by implementing an automated vulnerability management system, reducing manual efforts and increasing efficiency in risk mitigation.
  • Penetration Testing with Burp Suite: Led comprehensive penetration testing engagements using Burp Suite to identify and exploit vulnerabilities in web applications and network infrastructures.
  • Collaboration with Development Teams: Worked closely with developers and IT teams to ensure the proper remediation of vulnerabilities, assisting with patching and secure code practices.
  • Infrastructure Security Management: Actively identified and addressed infrastructure security vulnerabilities, ensuring systems were secure and compliant with organizational policies. Applied patches, implemented security configurations, and mitigated potential risks.
  • Zabbix Monitoring Setup: Implemented and maintained Zabbix for comprehensive infrastructure monitoring, tracking the health, availability, and performance of servers, network devices, and applications. Configured custom templates and thresholds to detect issues proactively.
  • Grafana Visualization: Integrated Zabbix with Grafana to create interactive, real-time dashboards that visualized key system metrics, providing clear insights into the infrastructure's health and performance.
  • Worked with various features of the Version Control System.
  • Managed daily activities to include user support and system administration tasks.
  • IT Security Expert & System Administrator

DevOps Engineer

ScaleMonks Technologies LLP
11.2017 - 07.2018
  • Maintaining Dev environment based on Virtualization (KVM).
  • Creating Instances (Ubuntu/Rhel) on dev environment as per requirement.
  • System Monitoring and Optimization: Regularly monitored disk usage and user activity, providing proactive management to ensure optimal system performance and prevent potential bottlenecks or security risks.
  • Firewall Management (PFSense): Administered and maintained PFSense firewalls to protect organizational networks from external and internal threats. Configured rules and policies to control traffic flow, ensuring secure network access while optimizing performance.
  • Configuration & Customization: Configured instances with the necessary software, tools, and settings, tailoring each environment to meet the specific needs of development teams. Installed required packages, set up system parameters, and ensured proper network configurations.
  • Collaboration with Development Teams: Worked closely with development teams to ensure that environments were correctly set up, troubleshooting and resolving any issues related to instance deployment or configuration.
  • Provided technical support to Linux users.
  • AWS Cloud Management: Managed and optimized cloud resources for clients on AWS, ensuring cost-efficient, secure, and scalable infrastructure tailored to client needs.
  • Automation & Scripting: Leveraged automation tools and scripting (e.g., Ansible, Bash scripts) to streamline the creation and setup of instances, reducing manual intervention and ensuring consistency across environments.
  • CI/CD Integration: Integrated Kubernetes with GitLab CI/Jenkins/ArgoCD for GitOps-driven deployments, enabling blue/green and canary release strategies.
  • Monitoring & Logging: Implemented Prometheus + Grafana dashboards, Alertmanager, and EFK/ELK stack for proactive monitoring, troubleshooting, and SLA compliance.
  • Secrets & Config Management: Managed sensitive configurations using Kubernetes Secrets, ConfigMaps, and external secret managers (Vault, AWS Secrets Manager).
  • DevOps Engineer

Linux System Administrator

Komal Engineers PVT Ltd
09.2015 - 08.2017
  • Installing & managing NFS and samba servers.
  • Installing and maintaining FTP Server along with the bandwidth throttle configured for the regulated control over the bandwidth.
  • LVM Administration: Managed and optimized LVM setups to provide flexible and scalable disk management solutions. Responsible for creating, resizing, and maintaining logical volumes to meet the dynamic storage needs of the organization.
  • User Management: Administered user accounts, permissions, and security policies, ensuring proper access control and compliance with organizational standards. Configured and managed user groups, access rights, and authentication protocols for seamless system access.
  • Configuring SSH for secure connection to the servers.
  • Installation, Configuration, Administration and troubleshooting.
  • Manage Remote Servers.
  • Providing 24
  • 7 supports from VPN.
  • Linux System Administrator

Education

Bachelors of Technology (B.Tech) - Electronics & Communication

Avanthi Institute of Research & Technology
05.2014

12th Grade - undefined

Sri Chaitanya
01.2010

10th Grade - undefined

High school BSEB
01.2005

Skills

Cloud Infrastructure: AWS, Azure

Certification

Red Hat Certified Engineer (RHCE), Fostering Linux, 09/17, Training on RHEL7 (Certificate No.170094379)

Leadership Team Collaboration Skills

  • Commitment & Accountability: Consistently demonstrated a high level of commitment and responsibility for all tasks, ensuring quality results and timely delivery. Took ownership of projects and drove them to completion while maintaining a strong focus on both team and organizational goals.
  • Team Collaboration & Support: Actively collaborated with cross-functional teams to provide technical solutions and valuable suggestions. Fostered a collaborative environment by sharing knowledge, offering insights, and supporting team members in achieving their objectives.

Languages

English
Hindi
Telugu

Timeline

SRE Technical Analyst 2

Adobe Systems
03.2022 - Current

IT Security Expert & System Administrator

Techblue Software
08.2018 - 03.2022

DevOps Engineer

ScaleMonks Technologies LLP
11.2017 - 07.2018

Linux System Administrator

Komal Engineers PVT Ltd
09.2015 - 08.2017

12th Grade - undefined

Sri Chaitanya

10th Grade - undefined

High school BSEB

Bachelors of Technology (B.Tech) - Electronics & Communication

Avanthi Institute of Research & Technology
Mausam SinghSRE DevOps Engineer