Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Pramod Kumar

Pramod Kumar

Site Reliability Engineer / DevOps Engineer
S-120/44, Vivekanand Camp Part-2, Chanakya Puri, New Delhi

Summary

Adept at driving operational excellence, I leveraged Kubernetes and cross-team collaboration to enhance system efficiency at Adobe Inc. Specializing in Docker and incident management, my proactive approach resulted in scalable solutions and significant cost savings, showcasing a blend of technical prowess and team synergy.

Overview

11
11
years of professional experience
4
4
Certifications

Work History

Site Reliability Engineer /DevOps Engineer

Adobe Inc.
06.2022 - Current
  • Migrated legacy systems to modern platforms for increased efficiency and reduced maintenance costs.
  • Streamlined infrastructure management through automation using industry-leading tools such as Ansible, Kubernetes, and Terraform.
  • Contributed to architectural decisions, ensuring scalable and maintainable designs for future expansions of application services.
  • Standardized development environments using containerization technologies like Docker, resulting in consistent deployments across various platforms.
  • Conducted regular performance assessments of application infrastructure, identifying bottlenecks and making necessary optimizations to enhance service quality.
  • Maintained metrics visibility using Datadog and Prometheus/Grafana to create useful dashboards and monitors.
  • Managed AWS assets and integrated multiple AWS resources into solutions appropriate for company projects.
  • Automated daily backups of critical data using AWS S3, reducing the risk of data loss in case of system failures.
  • Developed and maintained CI/CD pipelines using Jenkins, increasing deployment speed and reliability.
  • Automated repetitive tasks using Python scripts, increasing efficiency within the team''s workflow processes.
  • Evaluated new technologies and tools to enhance overall system performance, stability, and security.
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.

Site Reliability Engineer

VARITE (Adobe Third Party)
05.2019 - 05.2022
  • Evaluated new technologies and tools to enhance overall system performance, stability, and security.
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.
  • Collaborated with cross-functional teams to develop, test, and deploy scalable software solutions.
  • Improved incident management workflows by creating comprehensive documentation on troubleshooting procedures and common issues resolution steps.
  • Conducted root-cause analyses after major incidents to identify areas for process improvement or technical enhancement opportunities.
  • Implemented cost-saving measures by optimizing resource utilization across cloud-based infrastructure environments.
  • Contributed to the ongoing refinement of internal processes and procedures within the site reliability engineering discipline through regular reviews, updates, and knowledge sharing activities.
  • Ensured compliance with relevant industry regulations regarding data privacy standards by actively participating in audits assessments.
  • Fostered collaboration between development and operations teams through effective communication strategies during project lifecycles.
  • Managed capacity planning efforts to ensure optimal resource allocation based on current demand projections and future growth expectations.
  • Streamlined incident management procedures, reducing resolution times and improving customer satisfaction.
  • Enhanced system reliability by developing and implementing comprehensive monitoring solutions across all platforms.
  • Reduced system downtime, troubleshooting critical issues and applying timely fixes.
  • Improved deployment efficiency, automating processes using CI/CD pipelines.
  • Improved service reliability, meticulously documenting system architectures and maintenance procedures.

Cloud Operation Engineer

To The New
11.2018 - 05.2019
  • Enhanced cloud infrastructure performance by implementing effective monitoring and alerting systems.
  • Provided timely technical support to users experiencing issues with cloud-based applications or infrastructure components.
  • Improved network security by configuring and managing firewalls, access controls, and encryption protocols.
  • Implemented robust backup strategies to minimize data loss risk in case of system failure or accidental deletion incidents.
  • Participated in on-call rotations as required, providing after-hours support to ensure system stability and swift resolution of critical incidents.
  • Contributed to budget planning efforts by analyzing resource usage trends and recommending cost-effective solutions for scaling up or down resources as needed.
  • Educated non-technical team members about cloud operations concepts, best practices, and the benefits of adopting a cloud-first approach to IT services delivery.
  • Assisted in the development of comprehensive documentation for cloud infrastructure design, implementation procedures, and best practices.
  • Maintained high availability of critical systems through proactive health checks, redundancy solutions, and disaster recovery planning.
  • Assisted in capacity planning efforts by regularly monitoring resource utilization and forecasting future requirements based on historical trends and projected growth rates.
  • Evaluated new technologies relevant to cloud operations engineering in order to recommend potential enhancements or upgrades that would improve overall system performance or costefficiency.
  • Used metrics to monitor application and infrastructure performance.
  • Identified, analyzed and resolved infrastructure vulnerabilities and application deployment issues.
  • Coordinated deployments of new software, feature updates and fixes.
  • Corrected, modified and upgraded software to improve performance.

Linux Specialist

HCL Technologies
11.2016 - 11.2018
  • Optimized network connectivity between geographically distributed data centers using load balancing technologies such as HAProxy and Nginx.
  • Enhanced server performance by optimizing Linux system configurations and conducting regular maintenance.
  • Evaluated and recommended emerging technologies to stay current with industry best practices, maintaining a competitive edge.
  • Reduced downtime by proactively identifying and resolving potential issues through monitoring tools and log analysis.
  • Increased infrastructure scalability with the implementation of virtualization technologies such as VMware and KVM.
  • Conducted comprehensive capacity planning to support business growth, ensuring adequate resources were available when needed.
  • Contributed to cost savings by evaluating vendor offerings and negotiating favorable contract terms for hardware purchases.
  • Collaborated with cross-functional teams to ensure seamless integration of new servers into existing networks.
  • Facilitated successful migrations from legacy systems to modern Linux-based platforms, minimizing disruption to business operations.
  • Safeguarded company information by implementing advanced security measures, including intrusion detection systems and firewalls.
  • Ensured data integrity by implementing robust backup strategies, including offsite storage solutions.
  • Improved overall system stability with thorough testing of software patches before deployment on production environments.
  • Streamlined workflow for the IT team by automating routine tasks using shell scripting and Python.
  • Developed custom monitoring dashboards using Grafana, providing real-time insights into system health for informed decisionmaking.
  • Standardized server provisioning processes using configuration management tools like Ansible, Puppet or Chef.
  • Assisted in the planning and execution of disaster recovery exercises, ensuring teams were prepared for unforeseen events.
  • Collaborated with developers to troubleshoot application issues that stemmed from underlying system problems.
  • Maintained high availability of web applications by configuring and managing Apache HTTP Server and Nginx reverse proxy servers.
  • Installed important security and functionality patches to maintain optimal protections against intrusion and system reliability.
  • Worked with users to determine areas of technology in need of improved usability.
  • Resolved issues and escalated problems with knowledgeable support and quality service.
  • Diagnosed and executed resolution for network and server issues.
  • Maintained flexible schedule and responded to after-hours and weekend emergencies.

Linux Administrator

Stunner IT Solution Pvt. Ltd.
03.2014 - 10.2016
  • Installed system-wide hardware components, confirming interoperation and compatibility with Linux-based software distros.
  • Contributed to the successful completion of complex projects, providing expert Linux administration support throughout project lifecycles.
  • Maintained high availability of critical systems by proactively addressing issues and conducting regular backups.
  • Collaborated with cross-functional teams to design and implement infrastructure improvements, leading to enhanced system stability.
  • Developed organization-wide administration policies to encourage continuity across multiple systems and facilities.
  • Worked closely with development teams to optimize application performance through proper tuning of underlying Linux systems.
  • Evaluated new technologies and made recommendations for their inclusion in future infrastructure upgrades or replacements.
  • Supported virtualization platforms, enabling efficient resource utilization within the IT infrastructure.
  • Implemented effective monitoring solutions, allowing for rapid identification and resolution of potential issues before they impacted endusers.
  • Enhanced system security through regular vulnerability assessments and timely patch management.
  • Provided technical guidance to junior team members, fostering a collaborative learning environment within the department.
  • Created detailed documentation on system configuration and troubleshooting guides for reference by both internal staff and external vendors.
  • Coordinated cross-site installation of networked systems, confirming post-install connectivity.
  • Simplified user access management by implementing centralized authentication mechanisms using LDAP or Active Directory integration.
  • Optimized server performance by implementing efficient system configurations and monitoring tools.
  • Developed custom automation scripts, significantly reducing time spent on routine administrative tasks.

Education

Bachelor of Arts - Arts

University of Delhi
New Delhi, India
04.2001 -

Skills

Kubernetes (Cluster Management, Helm Charts, Operators)

Docker (Container Creation, Multi-stage Builds, Security Best Practices)

Terraform (State Management, Modules, Cloud Integrations)

Ansible (Playbooks, Inventory Management, Roles)

CloudFormation

AWS (EC2, S3, RDS, Lambda, CloudWatch,route53,RDS)

Python (Automation Scripts)

Go (Microservices, Kubernetes Plugins)

Jenkins (Pipeline as Code, Plugins,GitLab CI/CD)

Incident Management and Postmortems

Adobe Campaign

Certification

Red Hat Certified System Admin (RHEL 7)

Timeline

Site Reliability Engineer /DevOps Engineer

Adobe Inc.
06.2022 - Current

Site Reliability Engineer

VARITE (Adobe Third Party)
05.2019 - 05.2022

Cloud Operation Engineer

To The New
11.2018 - 05.2019

Linux Specialist

HCL Technologies
11.2016 - 11.2018

Linux Administrator

Stunner IT Solution Pvt. Ltd.
03.2014 - 10.2016

Bachelor of Arts - Arts

University of Delhi
04.2001 -
Pramod KumarSite Reliability Engineer / DevOps Engineer