Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Locations
Summary
Certification
Languages
Timeline
Generic

SUMIT SRIVASTAVA

Bangalore

Summary

  • I am a seasoned professional with over 20 years of expertise in Site Reliability Engineering (SRE), DevSecOps, Cloud Operations, and Performance Engineering, having successfully held leadership roles including Head, Delivery Manager, Director, and VP.
  • I excel at establishing and managing enterprise-wide SRE and DevOps programs, mastering Production Performance Monitoring, observability dashboards, CI/CD pipelines, and Infrastructure as Code.
  • My deep understanding of security vulnerability assessments, leveraging tools like Qualys and Nessus, ensures strict compliance with OWASP standards.
  • My collaborative approach with sales and delivery teams equips me to fully comprehend customer needs, develop compelling proposals, and deliver impactful SRE and cloud solutions.
  • I have a proven track record of onboarding large clients to leading public clouds such as OCI, AWS, and Azure, contributing significantly to revenue growth of $50 million.
  • As a prominent representative at major industry events like Cloud World and KubeCon, I confidently showcase our capabilities and elevate our brand profile.
  • I manage service and incident operations with precision, conduct insightful root cause analyses, and implement top-tier tools like ServiceNow and PagerDuty for efficient SLA management.
  • My international experience spans across Australia, the UK, Singapore, India, Mexico, and the USA, giving me a global perspective on delivering value.
  • I automate SRE and DevOps tool onboarding and CI/CD pipelines using industry-leading technologies such as Jenkins, Azure DevOps, and Ansible. Additionally, I implement robust CI/CD governance and lead Change Advisory Boards for effective and compliant release management.
  • I guarantee production reliability through meticulously defined SRE KPIs and strategically build roadmaps to drive operational success for AWS, OCI, and Azure services.
  • My influence in stakeholder engagement and ability to foster strong relationships position me as a trusted leader in rapidly hiring and scaling high-performing teams.
  • With a powerful blend of statistical data analysis and machine learning expertise, alongside practical experience in high availability and disaster recovery, I consistently drive operational excellence and deliver outstanding results in high-pressure environments.

Overview

20
20
years of professional experience
1
1
Certification

Work History

Associate Director- SRE

ORACLE CORPORATION
03.2022 - Current
  • Managed the reliability of over 150 enterprise applications in Oracle Cloud Infrastructure (OCI). - Implemented full-stack observability, including metrics, tracing, logging, and event correlation.
  • Onboarded 30 customers to PaaS and SaaS for 75+ applications on OCI, totaling $150 million.
  • Established SRE dashboards for availability, SLOs, error budgets, and large-scale events.
  • Addressed security vulnerabilities across Lines of Business (LoB) for the CIO Scorecard.
  • Led problem management, including corrective actions, postmortems, and root cause analysis (RCA) for outages.
  • Achieved a 98% improvement in resiliency and availability for critical applications through resilience engineering.
  • Implemented SRE processes such as toil reduction, change management, and automated release pipelines.
  • Managed disaster recovery and high availability for critical applications.
  • Served as a core member of Oracle's global hiring panel for Cloud and led campus hiring efforts.
  • Led the SRE Community of Practice within the Oracle Cloud Team and mentored over 20 individuals in SRE, DevOps, and cybersecurity.

General Manager- SRE

HCL Technologies
09.2021 - 03.2022
  • I managed SRE Consulting, an advisory division for Cloud Native Labs, where I oversaw SRE practices, processes, and the implementation of Cloud Native platforms.
  • My work included setting up OpenShift, Kubernetes (K8s), and Tanzu, as well as deploying microservices applications.
  • I provided pre-sales support for clients like US Bank and a telecommunications company in the Middle East, contributing to a $45 million deal to create a 30-member SRE team at HCL.
  • Additionally, I conducted training sessions on SRE and DevSecOps practices and helped establish an SRE Community of Practice to develop our capabilities.

SRE Delivery Manager

GlobalLogic
01.2019 - 09.2021
  • I managed Site Reliability Engineering (SRE) and Performance Engineering for a large US-based building management company, overseeing a team of 55 and ensuring SRE for 25 IoT applications in Azure Cloud.
  • My responsibilities included supporting global environments, managing cloud infrastructure, and handling patching, security vulnerabilities, and compliance with ISO/SOC standards.
  • I led pre-sales activities for cloud-based SaaS, responded to RFPs, and designed solutions for customers.
  • I established monitoring and alerting systems using Dynatrace, DataDog, and Runscope, and set up AWS infrastructure using Infrastructure as Code (IaC) with AWS CDK.
  • Additionally, I managed SLOs, SLAs, SLIs, and error budgets to enhance reliability and conducted security scans for vulnerabilities.
  • I was also responsible for deploying APIs, maintaining CI/CD pipelines, and overseeing SRE and Performance Engineering for CitiBanamex in Mexico, managing 32 team members.
  • I automated performance monitoring using AppDynamics and managed revenue, gross margins, and budgeting for the projects.

Technical Consultant

AON Consulting
01.2018 - 01.2019
  • Management of the Performance Engineering practice for multiple products, deliverables including - Planning, scope, budget, resources, tools, roadmap etc
  • Set up DevOps toolset for delivery teams
  • Automate the Deployment pipelines for Test & Prod environments
  • Set up HP Performance Centre for global usage
  • NFT Consultant to advice and enable PT/PE engagement across business units

Senior Principal Engineer

Kronos Solutions
01.2016 - 01.2018
  • Manage a team of 8 Performance Test, SRE Leads/Engineers
  • Management of the Performance Engineering/SRE practice for multiple products, deliverables including - planning, scope, budget, resources, tools, roadmap etc
  • Oversee and manage Performance test execution Peak Load tests, Performance tests, Response Time tests, Single User baselines, Baseline tests, Failover tests, Spike tests, Endurance/Longevity/Stability tests, Capacity Tests
  • Oversee Performance at all tiers, profiling, tracing, thread dump & heap dump analysis to find root cause of performance issues

Senior Technical Architect

HCL Technologies
11.2014 - 01.2016
  • Manage a team of 4 Performance Test Leads/Engineers onsite and 11 Engineers offsite
  • Working on RFPs, providing solutions for Functional & non-Functional testing
  • Was TCoE Practice lead for large scale Telecom Customer Responsible for driving factory like technical transformation of the applications focused on industrializing, standardizing and automating build, deployment, release and testing processes
  • Recruit Manage and Train Performance Engineers and Performance Testers in the teams

Technical Specialist

FIL Business Services
06.2012 - 10.2014
  • Built the central Performance Engineering/Center of Excellence Team from ground up
  • Responsible for defining and implementing Performance Engineering strategy for ETL/Enterprise Datawarehouse & BI Reporting apps
  • Suggested multiple enhancement areas for the application under test for improving performance
  • Contributed to meeting the performance goals of the application by reducing the response times from half a minute to less than 4 seconds

Team Lead

CSC
01.2005 - 06.2012
  • Involved in preparation of estimation, capacity matrix, testing plan and details, capacity plan and performance strategy docs and conducted assessments and data modeling using excel
  • Developed scenarios for CICS transactions using RTE protocol in load runner & scheduled batch jobs to simulate heavy payments files for processing by payment engine
  • Worked towards reducing the overall performance of batch jobs by 75%
  • Worked towards reducing the user response times drastically for high response functions, some by 19 minutes
  • Oversaw the database tuning and store procedure tuning which brought down the response times, CPU utilization and improved overall system performance

Education

Certificate in Big Data & Data Analytics -

Fore School of Management
New Delhi, India
11.2016

Master of Computer Application -

HNB Garhwal University
Dehradun, India
01.2005

Bachelor of Science - Physics

University of Allahabad
Prayagraj
04.2002

Skills

Languages : Java, Python, Go

Cloud : OCI, Azure, AWS, GCP

SRE/DevSecOps Tools : Jenkins,GitHub,Ansible, Terraform,Dynatrace,New Relic,Grafana, Prometheus

Technical Leadership : Managing, directing Cloud Native Development, SRE/Cloud Operations, Security ,Gen AI implementation, QA

Accomplishments

  • I implemented automations and data analytics to reduce monthly security vulnerabilities from an average of 22,000 to under 10 in a year using Ansible, Python, the Meta Frontier model, CDR, and GitHub Pipelines. This effort earned me a Diamond Award, in 2025
  • Awarded as Rockstar Hiring Bartender in participating in more than 20 IITs, NITs, IIITs as well as interviewing more than 50 candidates in 2024
  • Won Best Team award for migrating 25+ on-Prem applications onto Oracle Cloud with zero data loss & zero outages in Oracle in 2022


Locations

  • Bangalore, Gurugram, Noida, Hyderabad,Chandigarh, India
  • Chicago, Milwaukee,USA
  • Melbourne, Australia
  • London, Edinburgh, Swindon, UK
  • Singapore
  • CDMX, Mexico

Summary

SRE, DevSecOps, AI/Gen AI/ML team set up,Cloud Ops, Performance Engineering, Head, Delivery Manager, Director, VP

Certification

  • AWS Certified Professional
  • Azure Cloud Certification
  • OCI Professional
  • OCI Generative AI foundation
  • Certified Kubernetes Administrator
  • Basics of LLMs, AI frontier Models, Transformers
  • SRE Foundation from DevOps Institute
  • Certified Scrum Master from Scrum Alliance
  • Six Sigma Green Belt from Exemplar Global
  • Prince 2 from Axelos UK
  • PMP from Project Management Institute

Languages

English
Bilingual or Proficient (C2)
Hindi
Bilingual or Proficient (C2)
Spanish
Elementary (A2)

Timeline

Associate Director- SRE

ORACLE CORPORATION
03.2022 - Current

General Manager- SRE

HCL Technologies
09.2021 - 03.2022

SRE Delivery Manager

GlobalLogic
01.2019 - 09.2021

Technical Consultant

AON Consulting
01.2018 - 01.2019

Senior Principal Engineer

Kronos Solutions
01.2016 - 01.2018

Senior Technical Architect

HCL Technologies
11.2014 - 01.2016

Technical Specialist

FIL Business Services
06.2012 - 10.2014

Team Lead

CSC
01.2005 - 06.2012

Bachelor of Science - Physics

University of Allahabad

Certificate in Big Data & Data Analytics -

Fore School of Management

Master of Computer Application -

HNB Garhwal University
SUMIT SRIVASTAVA