Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

SivaRama Prasad Pilla

Cloud Architect
Dallas

Summary

With over 16 years of experience in IT infrastructure and system administration, I specialize in managing and automating cloud and on-premises environments. I have hands-on expertise in AWS (EC2, VPC, IAM, S3), Terraform automation, Linux systems (RHEL, CentOS, Ubuntu), VMware virtualization, and CI/CD pipelines using GitLab, Docker, and Kubernetes. My background includes kernel tuning, system optimization, disaster recovery, storage (SAN, NAS, NetApp, EBS), and network configuration. I’m also skilled in infrastructure monitoring (Nagios, CloudWatch), and have a strong track record of collaborating with cross-functional teams to streamline deployments and enhance system reliability. Known for my adaptability, documentation practices, and problem-solving approach, I ensure stable, secure, and scalable IT operations. Experienced with designing and implementing scalable cloud architectures. Utilizes advanced cloud technologies to streamline operations and improve efficiency. Strong understanding of multi-cloud environments and infrastructure automation.

Overview

18
18
years of professional experience
5
5
Certifications

Work History

Enterprise Tech – Cloud Architect

LiveMindz
09.2024 - Current
  • Automated AWS infrastructure provisioning (EC2, RDS, IAM, VPC, S3, Security Groups) using reusable Terraform modules integrated with GitLab CI/CD and CodePipeline, reducing manual effort and increasing deployment speed.
  • Managed secure infrastructure by integrating AWS KMS and Secrets Manager for managing sensitive data and secrets securely across environments.
  • Provisioned and maintained EKS clusters using Terraform and CloudFormation, including non-disruptive version upgrades and patching, ensuring continuous application availability.
  • Configured Kubernetes RBAC, service accounts, and network policies to enforce least-privilege access controls and improve workload security across namespaces.
  • Implemented GitLab CI pipelines for end-to-end multi-environment deployments (dev to prod) with dynamic variables, approval gates, and reusable YAML templates, streamlining release cycles.
  • Enforced CI/CD governance by implementing best practices in branching strategies, merge request approvals, and code quality checks, improving consistency and reducing deployment failures across teams.
  • Built and deployed containerized microservices on EKS using Helm and Kustomize, ensuring consistent and repeatable environment provisioning across dev, staging, and production.
  • Monitored Kubernetes clusters using Prometheus and Grafana, fine-tuning alerts and dashboards to ensure optimal performance and availability.
  • Troubleshot pod failures, crash loops, and network issues using kubectl, container logs, and events, reducing mean time to recovery (MTTR) during production outages.
  • Spearheaded large-scale server migrations from on-premises (BareMetal/OpenStack) to AWS cloud, optimizing for high availability, cost-efficiency, and scalability.
  • Deployed and maintained AWS resources via Terraform integrated with GitLab CI/CD, achieving full IaC adoption and infrastructure versioning.
  • Managed AWS Secrets Manager configurations for RDS and implemented disaster recovery setups, including replicas in a secondary region for fault tolerance.
  • Performed Kubernetes resource optimization by tuning resource requests/limits, HPA (Horizontal Pod Autoscaler), and pod disruption budgets, improving cluster efficiency and workload stability.
  • Led a DevOps team, providing technical direction, defining objectives, and mentoring team members to align efforts with business goals.
  • Planned and prioritized team workload, conducted performance evaluations, and facilitated professional development initiatives to strengthen team capabilities.
  • Collaborated with project managers, developers, and QA leads to synchronize release planning, resource allocation, and risk mitigation.
  • Established operational best practices, standardized processes, and conducted regular reviews to ensure SLAs and KPIs were consistently met or exceeded.
  • Acted as a primary point of contact for clients during high-priority incidents, ensuring timely updates, stakeholder coordination, and post-incident follow-ups.
  • Led incident response and root cause analysis (RCA) activities, documenting findings in Confluence and driving long-term remediation strategies to improve system resilience.

Lead Infrastructure Engineer

CloudBerg Tec
09.2023 - 06.2024
  • Designed and implemented CI/CD pipelines integrated with Kubernetes (EKS/OpenShift) and Jenkins, achieving a 40% reduction in deployment times and a 25% improvement in release stability through automation and standardization.
  • Developed and deployed Helm charts for Kubernetes-based applications, enabling environment-specific configurations and version-controlled rollouts.
  • Built custom Kubernetes operators using CRDs (Custom Resource Definitions) to automate application lifecycle management and enforce consistency across environments.
  • Managed Kubernetes cluster upgrades, including drain and cordon operations, workload rescheduling, and disruption budget tuning to ensure zero-downtime rollouts.
  • Deployed and maintained observability solutions using Prometheus, Grafana, and metrics-server, providing real-time cluster health visibility and proactive alerting.
  • Implemented fine-grained RBAC policies, PodSecurityPolicies, and Kubernetes network policies to meet compliance standards like ISO 27001 and NIST.
  • Enabled secret management by integrating Kubernetes with external secret stores (e.g., AWS Secrets Manager, HashiCorp Vault) for secure credentials handling.
  • Monitored and optimized resource usage across pods and nodes by adjusting CPU/memory limits, autoscaling (HPA), and identifying overprovisioned workloads.
  • Troubleshot pod failures, CrashLoopBackOffs, and DNS/networking issues using kubectl, container logs, and Kubernetes events, reducing incident MTTR.
  • Facilitated multi-tenant cluster architecture with separate namespaces, network isolation, and resource quotas to support parallel dev/test/staging workloads.
  • Led a team of engineers in managing Kubernetes-based deployments, ensuring adherence to deployment patterns, rollback procedures, and best practices.
  • Provided Kubernetes training sessions, hands-on labs, and mentoring to upskill junior engineers and operational teams on containerization and orchestration.
  • Maintained detailed Kubernetes runbooks, SOPs, Helm deployment instructions, and RCA documentation to ensure knowledge continuity and audit readiness.
  • Integrated GitOps workflows using ArgoCD and FluxCD to streamline Kubernetes deployments and improve traceability and rollback capabilities.
  • Supported disaster recovery strategies by designing cross-region EKS backup plans, restoring etcd snapshots, and validating failover procedures.
  • Acted as the SME for Kubernetes infrastructure, providing architectural guidance and production support across dev, QA, and ops teams.

Senior Infrastructure Engineer

ENEA AdaptiveMobile Security
04.2020 - 08.2023
  • Administered and maintained Kubernetes clusters in production environments, ensuring uptime, performance, and security through regular patching, upgrades, and log analysis.
  • Managed Kubernetes resources like Deployments, Services, Ingress, ConfigMaps, and Secrets, enabling microservices architecture and deployment strategies such as canary and blue-green rollouts.
  • Maintained cloud-native applications using Docker containers and deployed them on Amazon EKS, integrating with services like RDS, S3, IAM, CloudTrail, and Route53.
  • Designed and implemented CI/CD pipelines using GitLab and Jenkins, automating application builds, tests, and deployments across multi-environment workflows (Dev, QA, Prod).
  • Built and managed Helm charts for consistent, reusable Kubernetes deployments and version control of applications.
  • Implemented GitOps practices using tools like ArgoCD for declarative Kubernetes deployments and automatic syncing from Git repositories.
  • Created and maintained Kubernetes CronJobs, InitContainers, and Lifecycle Hooks for backup, cleanup, and readiness operations.
  • Defined and enforced RBAC policies, Network Policies, and Pod Security Standards to meet compliance frameworks such as ISO 27001 and NIST.
  • Troubleshot and resolved production issues involving Pods, PVCs, CNI plugins, and evicted containers, reducing service disruption and improving MTTR.
  • Enabled Horizontal Pod Autoscaling (HPA) and optimized container performance using appropriate resource limits and requests.
  • Optimized Docker images with multi-stage builds, caching layers, and vulnerability scanning using tools like Trivy and Clair.
  • Managed container image repositories (ECR, Harbor), implementing tagging, retention, and access control policies.
  • Integrated observability solutions using Prometheus, Grafana, CloudWatch, and Elasticsearch, enabling real-time monitoring, alerting, and SLA compliance.
  • Automated AWS infrastructure provisioning using Terraform, covering EC2, VPCs, Load Balancers, IAM, S3, and Auto Scaling Groups, while eliminating configuration drift.
  • Used Terraform modules and Git version control to build scalable, reusable infrastructure code across multiple environments.
  • Managed incident response, change management, and root cause analysis (RCA) processes, including postmortem reviews and continuous improvement.
  • Documented deployment, rollback, DR procedures, and SOPs in Confluence, and handled change and incident tracking via JIRA.
  • Led a DevOps and Infrastructure team, assigning responsibilities, tracking tasks, and delivering infrastructure solutions in line with business timelines.
  • Mentored junior engineers, performed peer reviews, and established automation and infrastructure quality standards.
  • Facilitated Agile ceremonies (stand-ups, sprint planning, retrospectives) to coordinate work across engineering and operations teams.
  • Worked with InfoSec teams to audit and harden Kubernetes clusters, applying CIS Benchmarks and enforcing node-level security policies.
  • Coordinated with vendors and internal teams for licensing, procurement, and capacity planning, aligning with IT budgets and project requirements.
  • Fostered a culture of transparency, accountability, and continuous learning, encouraging knowledge sharing and innovation within the team.
  • Supported hybrid environments, including on-prem VMs and remote data centers, integrated Veeam backup, and enforced disaster recovery (DR) strategies.
  • Administered network devices (DELL N2048P), managed VLANs and port configurations, and maintained Office 365 and 2FA solutions (SafeNet, FortiToken).
  • Contributed to cost optimization efforts by negotiating hardware/software contracts and consolidating vendor services.

Mar 2016 – Senior IT System Administrator
Mar 2020

  • Acted as the primary IT point of contact, handling system administration, infrastructure support, and vendor coordination.
  • Performed Linux OS installation, upgrades, patching, and troubleshooting; managed LVM-based storage and documented system configurations.
  • Supported engineering and dev teams across OpenStack, Linux, Windows, macOS, and Office 365 platforms.
  • Managed virtualization platforms (ESXi, KVM, Hyper-V) and VM backups; coordinated with global datacenters for hosted server support.
  • Provisioned and managed AWS services (EC2, RDS, S3, VPC, IAM, ELB, Route53); implemented automation and monitoring via CloudWatch.
  • Configured IAM roles, security groups, and firewall/VLAN policies; experience with Checkpoint firewalls and NetApp Ontap storage.
  • Participated in ISO 27001 audits: handled policy exceptions, security hardening, and compliance practices.
  • Used tools like JIRA, Remedy, Qualys Guard, Nagios, and Tenable for support, ticketing, and security monitoring.
  • Led system upgrades and migrations, ensuring high availability with minimal downtime.
  • Managed hybrid infrastructure across on-prem and AWS for scalability and resilience.
  • Automated admin tasks using Bash, Python, and Ansible to improve efficiency.
  • Administered AD, DNS/DHCP, and Group Policies in multi-site environments.
  • Oversaw patching, vulnerability fixes, and OS lifecycle management.
  • Implemented centralized logging and monitoring using ELK stack.
  • Conducted RCA for incidents and enforced preventive measures.
  • Maintained endpoint security with antivirus, DLP, and encryption tools.
  • Mentored junior staff and created detailed SOPs and runbooks.

System Administrator Unix and Storage

Polycom Technologies (R&D) Center Pvt. Ltd
02.2015 - 02.2016
  • Designed and implemented reliable backup and disaster recovery strategies across hybrid environments.
  • Optimized system performance by enhancing hardware, software, and network configurations.
  • Automated routine system administration tasks using shell scripts, improving operational efficiency.
  • Administered, configured, and fine-tuned Linux and Solaris systems for high availability and performance.
  • Integrated kernel-level security controls to ensure system integrity and mitigate vulnerabilities.
  • Deployed and managed OpenStack environments for scalable private and public cloud infrastructure.
  • Proficient in VMware virtualization — including VM provisioning, templating, and snapshot lifecycle management.
  • Managed secure FTP services and automated file transfers to streamline workflows.
  • Worked with Solaris LDAP for centralized authentication and directory services.
  • Collaborated with hardware vendors for incident resolution, upgrades, and replacements.

Senior Systems Engineer/ Linux Engineer

Nisum Technologies
03.2013 - 01.2015
  • Delivered efficient IT support via email and ticketing systems (JIRA), ensuring timely issue resolution and high user satisfaction.
  • Designed and implemented Nagios monitoring to proactively track system and network health, improving uptime and reliability.
  • Automated build and deployment processes using Git, Docker, Kubernetes, and CI/CD pipelines in AWS, enabling standardized multi-environment delivery.
  • Developed and integrated custom kernel modules and device drivers, ensuring compatibility and performance across Linux systems.
  • Diagnosed and resolved kernel-level issues, delivering quick fixes and maintaining OS stability.
  • Acted as the SME for Linux administration (RedHat, CentOS, Ubuntu), handling system configuration, patching, and troubleshooting.
  • Managed VMware vSphere 5.1 environments for efficient server virtualization and resource allocation.
  • Created Bash automation scripts for routine tasks like database backups, enhancing reliability and reducing manual effort.
  • Performed application and system deployments in Linux environments, ensuring consistent and error-free releases.

Linux Engineer (L2)

RealPage India Pvt Ltd
05.2012 - 09.2012
  • Extensive hands-on experience in installing, configuring, and administering Red Hat Linux systems, managing daily system operations and maintenance tasks.
  • Managed user accounts, permissions, disk quotas, and monitored processes to ensure system security and performance.
  • Performed system upgrades, kernel tuning, HBA driver installations, SAN configuration, multi-pathing, and LVM setup in Red Hat environments.
  • Deep expertise in RHEL 5.x and above, including kernel optimization for performance, resource utilization, and security hardening.
  • Deployed and maintained Ansible for configuration management across multiple environments within the AWS cloud.
  • Proficient in VMware infrastructure management—creating, cloning, and maintaining VMs for scalable and reliable virtualization.
  • Experienced in troubleshooting and optimizing Apache and Tomcat web servers to ensure web application availability and stability.
  • Strong background in Logical Volume Management (LVM), enabling flexible and efficient storage allocation andresizing.
  • Developed and debugged Shell scripts to automate routine tasks, improving system efficiency and reducing manual intervention.
  • Utilized enterprise-grade monitoring tools like Nagios for proactive system monitoring and issue resolution.
  • Collaborated with DB engineers to support code deployments, highlighting cross-functional teamwork in application lifecycle management.
  • Managed OS patching and updates via RHN Satellite Server, maintaining system security and compliance.

Associate Software Engineer

Capgemini India Pvt Ltd
12.2010 - 02.2012
  • Installed and configured numerous Red Hat Enterprise Linux (5.x/6.x) and Windows Server (2003/2008 R2) systems across physical blade servers and VMware ESXi (4.1.2 & 5) virtual environments.
  • Extensive experience with Logical Volume Manager (LVM) for creating and managing volume groups, logical volumes, and disk mirroring.
  • Deployed and managed patches, upgrades, and bug fixes on physical and virtual Red Hat Linux servers using Satellite Server and YUM repositories.
  • Enhanced endpoint security by implementing Sophos antivirus solutions on Dell and Lenovo laptops.
  • Managed and troubleshot Avaya video conferencing systems via IP and ISDN protocols to ensure seamless communication.
  • Skilled in configuring and troubleshooting email clients including Microsoft Outlook and Lotus Notes.
  • Provided hardware support for desktops, laptops (Dell, IBM, Lenovo), and HP printers, ensuring quick resolution of technical issues.
  • Performed Linux OS installations, package management, and system troubleshooting to maintain system stability and performance.
  • Resolved remote VPN connectivity issues on Linux platforms, with hands-on experience in Samba and NFS network file sharing.
  • Administered Active Directory tasks including user account management, distribution list creation, and desktop/laptop provisioning to maintain access control.
  • Facilitated software deployment and updates through Corporate Directory, streamlining software distribution processes.

L2 Support Engineer

Texas Instruments (3I Infotech)
11.2009 - 12.2010
  • Experienced in Active Directory user authentication, providing support for email clients such as Outlook and Thunderbird, and troubleshooting basic network connectivity issues to ensure seamless user operations.
  • Skilled in diagnosing and resolving Windows OS issues, including password resets and managing Symantec Antivirus for endpoint security.
  • Established and enforced standardized baseline configurations for desktops and laptops to maintain security and consistency across all devices.
  • Led kernel development initiatives to enhance system performance, stability, and functionality, collaborating with cross-functional teams to optimize core OS components—resulting in faster boot times, improved data throughput, and better overall responsiveness.
  • Coordinated warranty verifications and managed service requests for Dell and Lenovo hardware, ensuring prompt resolution of technical issues.
  • Administered patch management, installed and supported both licensed and unlicensed software, ensuring compliance and smooth operation.
  • Proficient in backup and restore operations, including resolving encryption and decryption challenges with Pointsec, maintaining data protection standards.

GIS Engineer

Satyam BPO(Expera)
09.2007 - 10.2009
  • Optimized workflows in AutoCAD and Nipuna CAD enhance digitization efficiency, leading to significant time and cost savings in infrastructure projects.
  • Enforced strict quality control measures to ensure high accuracy of digitized road and building data, reducing errors and inconsistencies in GIS datasets.
  • Collaborated closely with project teams, architects, and engineers to convert their designs into precise digital models, ensuring the digital data accurately reflected real-world infrastructure.
  • Improved project documentation by integrating detailed digital maps and drawings, providing clearer insights for stakeholders and aiding decision-making.
  • Adapted to diverse project scales by successfully digitizing a range of infrastructure elements, from small structures to large road networks, ensuring flexibility and efficiency across projects.

Education

Bachelor’s - Information Technology

Jawaharlal Nehru Technological University

Skills

DevOps & Automation

CI/CD Strategy & Implementation

Data Center Administration

Infrastructure Management

Cloud Migration & Transformation

Leadership & Collaboration

Backup, Disaster Recovery & High Availability

API Gateway & Service Integration

Infrastructure Lifecycle Management

Documentation and Knowledge Sharing

Networking and Firewalls

Tooling & Platform Expertise

Change and Release Management

Incident Management

System Administrator

Service Reliability and Performance Optimization

Cloud Services: AWS, Azure

Containerization Knowledge: Docker and Kubernetes

Operating Systems: RedHat(6/7/8/9), Ubuntu, Windows2012R2/2019, 10/11, Mac

Cloud and Collaboration Tools: AWS, Azure, IBM SoftLayer, Office 365, NetApp Ontap

Support tools: JIRA, Remedy, Confluence

Virtualization: VMware, Hyper-V, KVM

Monitoring and Tools: Nagios, Tenable, LAN Sweeper, Xymon, Nessus Scan, Splunk

Authentication Management: Safe Net, Forti token, Microsoft and Google Authentication

Hardware: HP ProLiant servers, DELL PowerEdge Servers, DELL Switches

Cloud architecture design

DevOps methodologies

Infrastructure as Code

Containerization technologies

Continuous integration, continuous deployment

Network configuration

High availability design

Cost management

Automation tools

Team Training

Excellent communication

Adaptability and flexibility

Code development

Certification

AWS Certified Solutions Architect - Associate

Timeline

Enterprise Tech – Cloud Architect

LiveMindz
09.2024 - Current

Lead Infrastructure Engineer

CloudBerg Tec
09.2023 - 06.2024

Senior Infrastructure Engineer

ENEA AdaptiveMobile Security
04.2020 - 08.2023

System Administrator Unix and Storage

Polycom Technologies (R&D) Center Pvt. Ltd
02.2015 - 02.2016

Senior Systems Engineer/ Linux Engineer

Nisum Technologies
03.2013 - 01.2015

Linux Engineer (L2)

RealPage India Pvt Ltd
05.2012 - 09.2012

Associate Software Engineer

Capgemini India Pvt Ltd
12.2010 - 02.2012

L2 Support Engineer

Texas Instruments (3I Infotech)
11.2009 - 12.2010

GIS Engineer

Satyam BPO(Expera)
09.2007 - 10.2009

Bachelor’s - Information Technology

Jawaharlal Nehru Technological University
SivaRama Prasad PillaCloud Architect