Experienced CloudOps/DevOps Engineer with 8+ years in optimizing IT infrastructure for both Cloud and On-Premise environments. Proven track record in managing successful SaaS product cloud Operations, specializing in Observability and Proactive Mitigation using advanced DevOps Tools. Committed to operational excellence, continuous improvement, and delivering impactful results by staying continuously learning.
Overview
9
9
years of professional experience
1
1
Certification
Work History
Senior Software Engineer ( SRE )
Arctera ( Formerly Veritas ) | CloudOps Team
12.2024 - Current
Working in the Global SRE Team to ensure reliability & availability of the SaaS Product - Arctera Insights supporting 4,000+ customers for their data protection, compliance, and lifecycle management for legal and analytical operations.
Azure Infrastructure Optimization and Leveraging Infrastructure as Code (IaC), CD pipelines, and Azure DevOps services to enhance platform scalability, performance, and cost-efficiency.
Improving monitoring, observability of infrastructure, application & platform using multiple Monitoring Tools tools and proactively monitoring, fixing the issues related to Infrastructure, Platform and product.
Collaborating with engineering and QA teams to address highly escalated customer issues and deployed hotfixes.
Cost Optimization by working with cost center and platform engineering to analyze, research, and plan the required changes in Azure infrastructure, and platform to reduce Azure spending by optimizing resource usage across services.
Identifying and resolving complex technical issues to enhance software functionality and platform reliability.
Maintaining detailed product documentation, issue runbooks, and a knowledge base, enabling automation and streamlining troubleshooting processes.
Senior Software Engineer
Veritas Technologies | DevOps Team
09.2022 - 12.2024
As a part of the DevOps team working on supporting Azure based SaaS Protection platform which serves 1000+ customers to backup, secure and E-Discover their Data from M365, Slack, salesforce, Google workspace
Designed and Deployed new infrastructures and integration using Terraform, supported dev team and worked closely with engineering to fix the product issues that are internal and customer facing, Developed and maintained Terraform scripts and ARM templates for automated Azure resource provisioning and scaling
Applied FinOps practices to minimize cloud costs in prod/dev environments, right-size customer resources based on usage patterns and licensing, and optimize resource allocation using Azure scale sets, logic apps to balance customer experience with Veritas's profit margins
Collaborate cross-functionally with Engineering, Product Management, QA, Onboarding, and Provisioning teams to resolve customer and product incidents
Monitored infrastructure health via Grafana and internal tools, managing Azure Web App Service, SQL Databases, VMs, Storage Accounts, and ElasticSearch clusters
Implemented PagerDuty for setting up a 24/7 alert work process, to resolve Azure and product alerts proactively
Configured Azure load balancing (VM Auto Scaling policies), network policies, and infrastructure to ensure optimal performance and security
Setting up Elasticsearch indexing servers in Azure, with Elasticsearch nodes running in Azure Kubernetes Service (AKS) and using Azure Container Instances (ACI), Leveraging Azure Monitor and the Elastic Stack's Kibana for real-time monitoring, alerting, and visualization
Led high touch customer and WhiteGlove service escalations independently, ensuring prompt communication/fixes
Performed monthly Azure SaaS product upgrades and resolved build and release issues for smooth deployments in azure DevOps CI/CD pipelines
Enhanced product documentation in Confluence for accuracy and resourcefulness for Support team enablement
Contributed product insights to Engineering for improvements focused on UX and operational efficiency
Tested new features in Dev environments, providing feedback to engineering for refinements
Troubleshooted web apps, SQL databases, Windows VMs, and ElasticSearch clusters using Veritas internal tools
Audited Veritas-hosted cloud resources to identify over-provisioned assets; created Azure policies for scaling down
Developed strategies for loss-making customers to minimize expenses while ensuring essential services
Documented resource assessments and reported on cost-saving strategies, enabling a new Team to work on this
Senior Analyst
Accenture India | Cloud Infrastructure Team
04.2021 - 08.2022
Implemented on-prem and AWS cloud servers, SQL, and Oracle DB ( On-prem & cloud ) backups with multi-region tiering/replication using AWS backup service in Veritas protection platform
Worked in AWS Infrastructure as a Service (IaaS) Team to provision resources in AWS using Terraform
Developed disaster recovery (DR) strategies using AWS backups, optimizing RPO and RTO; led DR drills
Remotely managed data center operations, troubleshooting issues, Partnered with vendors to ensure infrastructure integrity, testing and deploying automation in AWS backup service
Built and deployed servers with LUNs, NFS shares for backup infrastructure across remote on-prem and AWS cloud using Ansible and Terraform
Delivered L4 support for Accenture customers on NetBackup, NetApp Storage, Tape Libraries, and AWS services
Automated error code fixes using AWS system manager, batch scripts, report generation & perform monthly patching
Managed server installation, resource groups, monitoring, and patching for Linux and Windows environments
Led project to save cost & operational efficiency by migrating legacy physical servers from on-premises to AWS
Administered AWS services, including Compute, EBS, S3, VPC, IAM, SSH, and HTTP/HTTPS with DataDog
Handled capacity reporting, tape usability, and optimized backup infrastructure costs
Developed SOPs and runbook documentation for Automation planning
Acted as ITIL standards representative in CAB for all infrastructure changes
System Engineer
Tata Consultancy Services | Cloud Support Team
08.2018 - 03.2021
Provision, configure, & manage AWS/Azure resources, LUNs, and storage accounts for On-prem and cloud servers, Monitor and optimize server performance, identifying & address bottlenecks to enhance IOPS, latency
Integrate on-premises SAN solutions with cloud infrastructure (Azure, AWS) we used NetApp Hyper converged system for this implementation, supporting hybrid cloud environment, Monitor/report tasks using Bash scripting
Develop and manage Cloud DR strategies, performing regular tests to ensure data protection and compliance
Investigate and resolve cloud server issues, collaborating with cross-functional teams and escalating to vendors when necessary and Maintain infrastructure, perform on prem servers firmware updates, patches, Maintenance
Implement data security measures in line with industry standards, access controls & encryption for systems
Document configurations, operational procedures, and troubleshooting guides; prepare performance reports
System Admin
Microland Ltd | Server Team
08.2017 - 07.2018
Supported Microland customers by managing and resolving Infrastructure support tickets
Supported Linux server core issues, installation, configuration, failures, restoration, troubleshooting and log analysis
Conducted media and library management to ensure efficient data handling and storage operations
DataCenter Engineer
Softenger India/IBM DataCenter
01.2016 - 07.2017
Installation, configuration of servers, storage devices, & network equipment
Conducted basic troubleshooting for hardware issues, including component replacements
Monitored data center environment for temperature, humidity, and power status, ensuring optimal conditions