Summary
Overview
Work History
Education
Skills
Timeline
Generic
PUTTURU RAJA

PUTTURU RAJA

Tirupati

Summary

Accomplished DevOps Engineer with expertise in Azure DevOps and Kubernetes, previously at EPAM. Enhanced CI/CD processes, reducing deployment time by 50%. Proficient in Docker and cloud platforms, with strong problem-solving skills and effective communication. Adept at driving infrastructure optimization and fostering collaboration in agile environments.

Overview

8
8
years of professional experience

Work History

Devops Engineer (L1)

EPAM
Bangalore
09.2023 - 05.2025
  • Built and deployed containerized web applications using Docker and Azure Container Registry (ACR), including writing custom Dockerfiles and managing private image repositories.
  • Implemented end-to-end CI/CD pipelines using Azure Pipelines and GitHub Actions with YAML, improving delivery speed and deployment consistency.
  • Configured monitoring and alerting with Azure Monitor and integrated incident response workflows to support high-availability production systems.
  • Automated container build and deployment processes to Azure Kubernetes Service (AKS), managing Helm charts and YAML manifests for scalable microservices.
  • Used Docker Compose to streamline local development environments and multi-container orchestration.
  • Designed and implemented CI/CD automation using Azure DevOps, resulting in a 20% reduction in code integration and testing time.
  • Scripted routine maintenance and monitoring tasks to reduce manual efforts, saving senior engineers over 10 hours weekly.
  • Documented infrastructure and deployment runbooks, accelerating the onboarding process by 10%.
  • Managed Kubernetes clusters (masters and nodes), handling upgrades, autoscaling, node pooling, and workload deployments.
  • Analyzed and optimized infrastructure performance, resulting in a 20% boost in overall system throughput.
  • Reduced network downtime by 15% through proactive system health checks and automated remediation scripts.
  • Developed internal technical documentation and knowledge bases, increasing intra-team knowledge sharing by 10%.
  • Supported teams with tooling for build pipelines, infrastructure automation, and cloud environment provisioning.
  • Handled L1 support for Azure DevOps services, automation workflows, and Azure Portal management.
  • Conducted knowledge transfer (KT) sessions for new joiners and facilitated internal training workshops.
  • Responded to high-priority ad-hoc requests and ensured disaster recovery and backup strategies were enforced.
  • Created optimized CI/CD pipelines, cutting deployment times by up to 50% through parallelization and caching.
  • Streamlined Docker-based environments to improve deployment reliability and consistency by 30%.
  • Continuously tuned system and application performance, achieving 99.7% application uptime.
  • Deployed automated Azure DevOps pipelines, reducing manual intervention and achieving 25% faster release cycles.
  • Leveraged Azure Monitor for real-time telemetry, significantly reducing mean time to detect (MTTD) and resolve (MTTR) incidents.
  • Collaborated with senior DevOps engineers to deliver infrastructure as code (IaC) projects, contributing to a 15% increase in overall team efficiency.

Site Reliability Engineer (L1)

Encora innovations labs Pvt Ltd
Hyderabad
01.2023 - 09.2023
  • Monitor CI/CD pipelines and alert on build or deployment failures in tools like Jenkins, Azure DevOps, or GitHub Actions.
  • Provide first-level support for deployment issues, build errors, and minor infrastructure incidents.
  • Handle user access management for DevOps tools (Git, Jenkins, SonarQube, Azure DevOps, etc.) including permission requests and role-based access.
  • Perform basic health checks on systems, services, and application logs; escalate critical issues to L2/L3 teams.
  • Restart failed services, pods, or jobs on Linux servers or Kubernetes clusters as per predefined SOPs.
  • Run daily monitoring checks on CPU, memory, disk usage, and basic application metrics using tools like Nagios, Prometheus, or Grafana.
  • Assist in CI/CD job execution by running pipelines on request or troubleshooting script-level issues under supervision.
  • Participate in release and patching windows, verifying post-deployment sanity checks.
  • Maintain and update documentation, including runbooks, incident logs, and knowledge base articles.
  • Provide L1 support for Azure/AWS services (e.g., checking instance status, monitoring storage alerts, simple reboots).
  • Perform routine log analysis using tools like journalctl, kubectl logs, or ELK stack for known issues.
  • Respond to tickets through ITSM tools like ServiceNow, Jira, or Freshservice with timely updates and basic resolutions.
  • Run and monitor scheduled jobs or crons, checking for failures and reporting anomalies.
  • Execute basic shell or PowerShell scripts to collect system diagnostics or automate simple tasks.
  • Coordinate with L2/L3 engineers during incidents or deployments and participate in war rooms as a support resource.
  • Automated server provisioning and configuration management using Ansible playbooks and roles.
  • Wrote and maintained Shell scripts for daily operational tasks like backups, log rotation, monitoring, and health checks.
  • Designed and executed idempotent Ansible playbooks to ensure consistent configuration across Linux environments.
  • Created custom Ansible roles to modularize infrastructure tasks such as user management, package installation, and service management.
  • Integrated Ansible with CI/CD pipelines (GitHub Actions, Jenkins, Azure DevOps) to automate deployment processes.
  • Developed Shell scripts to automate file transfers, cron job management, and service restarts across multiple servers.
  • Managed inventory files and variable files in Ansible for different environments (dev/stage/prod).
  • Used Ansible Vault to secure sensitive data such as passwords, tokens, and API keys.
  • Performed automated patching and updates across 100s of Linux servers using Ansible.
  • Created Ansible scripts to automate Docker container creation, network setup, and image pulling.
  • Scheduled Shell scripts using cron for tasks like log cleanup, service pings, and DB backups.
  • Debugged and enhanced existing Shell and Ansible scripts for improved error handling and performance.
  • Created dynamic Ansible inventory scripts for cloud infrastructure (AWS, Azure, etc.).
  • Developed automation to detect anomalies via Shell scripts integrated with monitoring tools (e.g., Prometheus, Splunk).
  • Documented all Ansible playbooks and Shell automation tasks for team collaboration and audit readiness.

DevOps/Sr. Linux System Engineer

Teamware Solutions Pvt Ltd
Hyderabad
04.2022 - 01.2023
  • Manage and administer Linux servers (RHEL, CentOS, SUSE, Ubuntu) in production, staging, and development environments.
  • Perform server provisioning, including OS installation, disk partitioning, LVM setup, and network configurations.
  • Monitor and troubleshoot system performance issues using tools like top, htop, iostat, vmstat, netstat, and system logs.
  • Configure and manage file systems, RAID, LVM, and NFS/SMB shares to support scalable storage solutions.
  • Implement system hardening and security policies (firewalls, SELinux, SSH, sudoers), including patching and vulnerability remediation.
  • Manage user and group access control, LDAP/Active Directory integration, and PAM configurations.
  • Schedule and manage cron jobs, system backups, and disaster recovery planning using tools like rsync, tar, and bacula.
  • Handle software package management using YUM, DNF, APT, and custom RPM/DEB repositories.
  • Configure and maintain web servers (Apache, NGINX), database servers (MySQL/PostgreSQL), and app servers (Tomcat, WildFly).
  • Lead root cause analysis (RCA) for critical system outages and incidents, providing detailed reports and long-term fixes.
  • Perform kernel tuning, OS upgrades, and patch rollouts with minimal downtime across clustered and standalone systems.
  • Design and implement high availability (HA) and load-balanced solutions using Pacemaker, Corosync, and keepalived.
  • Deploy and manage automation tools (Ansible, Puppet, or Shell scripts) for configuration management and server provisioning.
  • Collaborate with DevOps/SRE teams to support CI/CD pipeline infrastructure and monitoring tool integrations.
  • Document all infrastructure procedures, system architectures, and troubleshooting guides for audit and continuity purposes.
  • Perform daily health checks on disk usage, services, logs, and user accounts.
  • Assist in patch management and routine server updates under senior supervision.
  • Execute user and group management, password resets, and permission changes.
  • Monitor and respond to alerts related to disk, memory, CPU, and application services.
  • Perform routine backup checks and simple restore tasks using rsync, tar, or backup tools.
  • Maintain basic documentation like IP mappings, service logs, and patch reports.
  • Help configure crontabs, manage scheduled jobs, and monitor failures.
  • Escalate unresolved incidents to senior admins after performing initial triage.
  • Assist in log analysis and troubleshooting using journalctl, dmesg, or /var/log/ files.
  • Deploy and configure basic services like FTP, SSH, or Apache under guidance.
  • Handle hardware tickets, server reboots, or console checks in physical datacenters.
  • Provide L1 support during application rollouts, monitoring jobs, or infrastructure changes.
  • Participate in KT sessions and cross-functional coordination during maintenance windows.
  • Carry out script execution tasks and learn automation basics from senior engineers.
  • Ensure compliance with ITSM processes (ticket updates, closure, incident tracking).

Sr. Linux System Engineer

Team Lease Pvt Ltd
Hyderabad
05.2021 - 01.2022
  • Manage and administer Linux servers (RHEL, CentOS, SUSE, Ubuntu) in production, staging, and development environments.
  • Perform server provisioning, including OS installation, disk partitioning, LVM setup, and network configurations.
  • Monitor and troubleshoot system performance issues using tools like top, htop, iostat, vmstat, netstat, and system logs.
  • Configure and manage file systems, RAID, LVM, and NFS/SMB shares to support scalable storage solutions.
  • Implement system hardening and security policies (firewalls, SELinux, SSH, sudoers), including patching and vulnerability remediation.
  • Manage user and group access control, LDAP/Active Directory integration, and PAM configurations.
  • Schedule and manage cron jobs, system backups, and disaster recovery planning using tools like rsync, tar, and bacula.
  • Handle software package management using YUM, DNF, APT, and custom RPM/DEB repositories.
  • Configure and maintain web servers (Apache, NGINX), database servers (MySQL/PostgreSQL), and app servers (Tomcat, WildFly).
  • Lead root cause analysis (RCA) for critical system outages and incidents, providing detailed reports and long-term fixes.
  • Perform kernel tuning, OS upgrades, and patch rollouts with minimal downtime across clustered and standalone systems.
  • Design and implement high availability (HA) and load-balanced solutions using Pacemaker, Corosync, and keepalived.
  • Deploy and manage automation tools (Ansible, Puppet, or Shell scripts) for configuration management and server provisioning.
  • Collaborate with DevOps/SRE teams to support CI/CD pipeline infrastructure and monitoring tool integrations.
  • Document all infrastructure procedures, system architectures, and troubleshooting guides for audit and continuity purposes.

Linux Administrator

Steric Info Tech Pvt Ltd
Hyderabad
12.2016 - 05.2021
  • Monitor application and system health using Splunk, setting up dashboards and alerts for log patterns, exceptions, and performance issues.
  • Investigate and resolve L1/L2 incidents related to application availability, API failures, and infrastructure alarms.
  • Write and maintain shell scripts for log rotation, service restarts, and health checks across environments.
  • Use Ansible playbooks to automate configuration changes, service restarts, patch deployments, and user provisioning tasks.
  • Perform routine Linux server maintenance, including disk cleanup, log file management, and service status checks.
  • Assist in CI/CD deployment activities, ensuring pre/post-deployment validation and rollback support if needed.
  • Analyze Splunk logs to triage application failures, trace backend issues, and identify performance degradation.
  • Manage user access controls, file permissions, and SSH key deployments on Linux systems.
  • Respond to and resolve tickets through ITSM tools like ServiceNow or Jira, escalating when needed.
  • Schedule and monitor cron jobs, troubleshooting failures and ensuring proper logging.
  • Assist with package installations, patch updates, and dependency fixes using yum, apt, or Ansible.
  • Maintain and update knowledge base documents, SOPs, and incident response playbooks.
  • Support deployment verification, including URL checks, service responses, and log monitoring post-release.
  • Participate in on-call rotations, handling alerts from monitoring systems (Splunk, Grafana, Prometheus).
  • Perform basic networking tasks like DNS resolution, ping/traceroute, and checking firewall or port accessibility.
  • Automated server provisioning and configuration management using Ansible playbooks and roles.
  • Wrote and maintained Shell scripts for daily operational tasks like backups, log rotation, monitoring, and health checks.
  • Designed and executed idempotent Ansible playbooks to ensure consistent configuration across Linux environments.
  • Created custom Ansible roles to modularize infrastructure tasks such as user management, package installation, and service management.
  • Integrated Ansible with CI/CD pipelines (GitHub Actions, Jenkins, Azure DevOps) to automate deployment processes.
  • Developed Shell scripts to automate file transfers, cron job management, and service restarts across multiple servers.
  • Managed inventory files and variable files in Ansible for different environments (dev/stage/prod).
  • Used Ansible Vault to secure sensitive data such as passwords, tokens, and API keys.
  • Performed automated patching and updates across 100s of Linux servers using Ansible.
  • Created Ansible scripts to automate Docker container creation, network setup, and image pulling.
  • Scheduled Shell scripts using cron for tasks like log cleanup, service pings, and DB backups.
  • Debugged and enhanced existing Shell and Ansible scripts for improved error handling and performance.
  • Created dynamic Ansible inventory scripts for cloud infrastructure (AWS, Azure, etc.).
  • Developed automation to detect anomalies via Shell scripts integrated with monitoring tools (e.g., Prometheus, Splunk).
  • Documented all Ansible playbooks and Shell automation tasks for team collaboration and audit readiness.

Education

B. Tech -

JNTU
Anantpur, India
06.2013

Skills

  • Containerization and Orchestration
  • Docker
  • Kubernetes
  • Infrastructure as Code
  • Terraform
  • Ansible
  • CI/CD Pipeline Implementation
  • Linux
  • GitLab CI
  • Cloud Platforms
  • AWS
  • Azure
  • GCP
  • Scripting and Automation
  • Python
  • Bash
  • Version Control Systems
  • Git
  • GitHub
  • Monitoring and Logging
  • Prometheus
  • ELK Stack
  • Agile Methodologies and DevOps Practices
  • Effective Communication and Collaboration
  • Problem-solving and Troubleshooting
  • Continuous Learning and Adaptability
  • Time Management and Prioritization
  • AI-Driven Infrastructure Optimization
  • Quantum-Safe Cryptography Implementation
  • Azure devops
  • ITIL
  • Application Support
  • Log Analysis

Timeline

Devops Engineer (L1)

EPAM
09.2023 - 05.2025

Site Reliability Engineer (L1)

Encora innovations labs Pvt Ltd
01.2023 - 09.2023

DevOps/Sr. Linux System Engineer

Teamware Solutions Pvt Ltd
04.2022 - 01.2023

Sr. Linux System Engineer

Team Lease Pvt Ltd
05.2021 - 01.2022

Linux Administrator

Steric Info Tech Pvt Ltd
12.2016 - 05.2021

B. Tech -

JNTU
PUTTURU RAJA