Summary
Overview
Work History
Education
Skills
Timeline
Generic

Siva Billa

Summary

Site reliability engineer with over 3 years of experience in CI/CD, cloud infrastructure, and production support across development, testing, staging, and production environments. Expertise in automating build, deployment, and release processes using GitHub Actions and Azure DevOps. Proficient in cloud platforms such as Azure and GCP, with hands-on experience in container orchestration using Kubernetes and Rancher. Strong skills in infrastructure automation with Terraform, Ansible, Chef, and Puppet, alongside scripting in Bash/Shell.

Overview

3
3
years of professional experience

Work History

Site Reliability Engineer (SRE)

Quant Cloud Solutions Pvt.Ltd (Sonata Software)
03.2025 - 10.2025
  • Monitored and optimized cloud workloads using IBM Turbonomic to enhance resource utilization and reduce infrastructure costs.
  • Implemented proactive monitoring and alerting strategies to ensure high availability of production systems.
  • Conducted root cause analysis for incidents, instituting preventive measures to mitigate recurrence.
  • Troubleshot performance bottlenecks in Azure and GCP cloud infrastructure, ensuring system stability.
  • Contributed to capacity planning and performance tuning efforts for optimal system efficiency.
  • Maintained system uptime while adhering to SRE principles including reliability, observability, and automation.
  • Documented runbooks, SOPs, and incident resolutions to promote operational excellence and knowledge sharing.

Site Reliability Engineer (SRE)

NAYAGARA TECHNOLOGIES LIMITED
11.2023 - 02.2025
  • Monitored production systems using Datadog, ensuring high availability, performance, and reliability
  • Managed incidents, alerts, and performed root cause analysis (RCA) to minimize downtime
  • Supported Kubernetes environments (AKS/GKE), handling deployments, scaling, and troubleshooting
  • Implemented and maintained CI/CD pipelines using GitHub Actions and Azure DevOps
  • Automated infrastructure provisioning and configuration using Terraform and Ansible
  • Performed system performance tuning and capacity planning to improve resource utilization
  • Managed cloud infrastructure on Azure and GCP, including networking, security, and compute resources
  • Configured monitoring, logging, and alerting using Datadog, Azure Log Analytics, and GCP tools
  • Handled Linux/Windows server administration and automated routine tasks using Bash/Shell scripting
  • Collaborated with cross-functional teams to ensure SLA compliance and continuous service improvement

Skills: SRE, Kubernetes, Datadog, Azure, GCP, Terraform, CI/CD, Incident Management, Monitoring, Automation

Site Reliability Engineer

IDC Technologies Pvt. Ltd(TCS)
06.2022 - 10.2023
  • Designed and implemented scalable, secure, and high-availability cloud infrastructure using Azure, GCP, and Kubernetes (AKS/EKS)
  • Built and maintained CI/CD pipelines using Azure DevOps, ensuring reliable and automated deployments across environments
  • Integrated SonarCloud for code quality checks and improved release standards
  • Automated infrastructure provisioning using Terraform and CloudFormation
  • Optimized cloud resources and reduced costs using Turbonomic and Kubernetes scaling strategies
  • Monitored systems using Datadog, ensuring performance, availability, and proactive issue detection
  • Managed containerization using Docker and deployed applications using Kubernetes and Helm
  • Installed and configured sealed secrets for secure Kubernetes secret management
  • Managed JFrog Artifactory for container image storage and deployment pipelines
  • Troubleshot build and deployment failures, ensuring smooth release processes across DEV/TEST/STAGE/PROD
  • Performed cluster health checks and maintained Kubernetes environments using Azure Portal and GCP Console
  • Managed Rancher for centralized Kubernetes cluster orchestration
  • Handled incidents and service requests via ServiceNow, ensuring SLA compliance
  • Collaborated with cross-functional teams to resolve issues and improve system reliability
  • Supported Agile/Scrum processes, participating in sprints, stand-ups, and continuous improvement initiatives

Skills: SRE, Kubernetes, Azure, GCP, Terraform, CI/CD, Datadog, Docker, Rancher, Turbonomic, ServiceNow

Education

Bachelor's - Mechanical Engineering

Rise Krishna Sai Gandhi Groups of Institutions
03-2019

Skills

  • GitHub
  • GitLab
  • CI CD
  • Azure DevOps
  • Linux
  • Confluence
  • Data Dog
  • Splunk
  • J-Frog Artifactory
  • SonarQube
  • Veracode
  • Azure
  • GCP
  • MongoDB
  • Google Cloud Platform
  • Docker
  • Kubernetes
  • Rancher
  • AKS
  • GKE
  • Terraform

Timeline

Site Reliability Engineer (SRE)

Quant Cloud Solutions Pvt.Ltd (Sonata Software)
03.2025 - 10.2025

Site Reliability Engineer (SRE)

NAYAGARA TECHNOLOGIES LIMITED
11.2023 - 02.2025

Site Reliability Engineer

IDC Technologies Pvt. Ltd(TCS)
06.2022 - 10.2023

Bachelor's - Mechanical Engineering

Rise Krishna Sai Gandhi Groups of Institutions
Siva Billa