Summary
Overview
Work History
Education
Skills
Certification
Projects
Languages
Personal Information
Timeline
Generic
SREEJESH MURALIDHARAN
Open To Work

SREEJESH MURALIDHARAN

Bangalore

Summary

Senior technology leader with 13+ years of experience driving Cloud Operations,SRE, and DevSecOps for large-scale SaaS platforms.Proven track record of building and scaling CloudOps functions, establishing SRE operating models, and delivering highly reliable, secure, and cost-optimized cloud services.Expertise in AWS-first cloud environments, SLI/SLO/SLA frameworks, incident and change management,observability (APM, logs, metrics, traces), and automation-first DevOps culture.Strong leadership background managing high-performing teams, improving MTTR, optimizing cloud costs (FinOps), and enabling resilient, scalable systems aligned with business SLAs.

Overview

1
1
Certification
17
17
years of professional experience

Work History

Senior Manager - Site Reliability Engineering

Deltek | Replicon
Bangalore
12.2013 - Current
  • Lead and scale Cloud Operations and SRE functions supporting mission-critical SaaS platforms with 99.9%+ uptime in a 24x7 environment
  • Manage and mentor a 14-member platform engineering and SRE team, driving high performance, ownership, and operational excellence
  • Own end-to-end platform reliability, availability, and performance across cloud infrastructure and distributed systems
  • Designed, deployed, and managed Kubernetes (EKS) clusters for microservices-based applications, enabling scalable, containerized workloads
  • Led containerization initiatives, migrating legacy services to Docker and Kubernetes, improving deployment consistency and release velocity
  • Implemented Helm-based deployments and Kubernetes manifests for standardized application releases across environments
  • Established auto-scaling (HPA/VPA), resource optimization, and cluster capacity planning to ensure performance and cost efficiency
  • Integrated Kubernetes with CI/CD pipelines (Git-based workflows), enabling automated build, test, and deployment cycles
  • Strengthened cluster observability using Prometheus, Grafana, and logging pipelines, improving visibility into container workloads
  • Establish and implement SRE practices (SLI/SLO/SLA, error budgets, incident response, postmortems)
  • Implemented DevSecOps practices integrating SAST, DAST, and Software Composition Analysis (SCA) into CI/CD pipelines, improving application security posture and reducing vulnerabilities in production
    • Led adoption of security tools including CrowdStrike (endpoint protection), KICS (Infrastructure-as-Code security scanning), and Black Duck (open-source vulnerability management), ensuring compliance with security and audit standards
    • Integrated automated security scanning into build and deployment workflows, enabling early detection of vulnerabilities and strengthening shift-left security practices
  • Drive incident, problem, and change management processes, significantly improving MTTR and system stability
  • Collaborate with engineering and product teams to ensure production readiness and operational scalability
  • Manage and support cloud database platforms (Amazon RDS, Aurora PostgreSQL) for large-scale production workloads
  • Design and implement high availability and disaster recovery strategies, including replication, failover, and backup mechanisms
  • Support multi-terabyte workloads ensuring scalability, resilience, and performance optimization
  • Design and deploy enterprise observability platforms (metrics, logs, traces, APM) using Prometheus, Grafana, Dynatrace, and Sumo Logic
  • Implement Infrastructure-as-Code (Terraform) and CI/CD pipelines to standardize deployments and reduce failure rates
  • Drive automation-first culture, minimizing manual intervention and improving operational efficiency
  • Support compliance initiatives (FedRAMP, SOC2-aligned practices) and security standards
  • Drive FinOps initiatives, including cost allocation, rightsizing, and optimization

SRE / DevOps Engineer

Deltek | Replicon
Bangalore
12.2013 - Current
  • Resolved critical production issues and customer-reported bugs as part of sustainment engineering
  • Developed backend features using C#, Node.js, and frontend components with JavaScript/TypeScript
  • Collaborated in Agile teams to implement fixes and enhancements for backend and frontend systems
  • Utilized JIRA for tracking, prioritization, and resolution of engineering tasks and incidents

DevOps Engineer – Cloud Operations

Deltek | Replicon
Bangalore
12.2013 - Current
  • Provided 24/7 on-call support, maintaining 99.98% uptime for cloud services
  • Implemented containerization and CI/CD pipelines with Docker, streamlining deployment processes
  • Contributed to on-prem to AWS cloud migration initiatives, increasing scalability and reliability
  • Delivered cloud-based engineering solutions using Node.js, Prometheus, Grafana, enhancing monitoring systems
  • Built proactive monitoring frameworks using Prometheus and Grafana, enhancing system visibility
  • Reviewed and approved Terraform and Git-based infrastructure changes, ensuring governance and stability
  • Acted as technical lead for operations, guiding team decisions and ensuring service reliability

Cloud Operations Engineer – Tier 3

Deltek | Replicon
Bangalore
12.2013 - Current
  • Oversaw AWS services including EC2, RDS, Redis, RabbitMQ, S3, and Elastic Beanstalk to ensure reliable cloud infrastructure
  • Designed and implemented CI/CD pipelines, build automation, and release processes for cloud deployments
  • Developed automation scripts using Node.js, PowerShell, and Java to streamline operational workflows
  • Created custom automation solutions using Node.js and Ruby WebServices for business requirements
  • Utilized monitoring tools such as CloudWatch, Sumo Logic, Zabbix, and Grafana for system health tracking
  • Delivered escalation support for Sev-1 production issues, reducing downtime and mitigating customer impact
  • Partnered with DevOps and Engineering teams to troubleshoot production issues and enhance deployment processes
  • Participated in release planning and production readiness reviews

Systems Engineer

CGI
Bangalore
05.2012 - 11.2013
  • Managed application servers, troubleshooting and monitoring server performance using Wireshark to ensure optimal functionality
  • Troubleshot network issues, identifying root causes to minimize downtime and enhance network reliability
  • Worked on Selenium Automation Tool, Core Java and ITSM tool
  • Collaborated with testing team, contributing to training on manual and automation testing to improve testing efficiency
  • Raised tickets with NOC team for issues of varying severity

Escalation Officer

Convergys India Private Limited
09.2009 - 05.2012
  • Resolved issues in Windows Operating Systems, ensuring system stability and user productivity.
  • Delivered process training to agents on Windows Operating System and Outlook, improving operational knowledge and support quality.
  • Supported networking setup and configuration across diverse environments to enhance connectivity.

Technical Support Engineer-JTAC

Convergys India Private Limited
09.2009 - 05.2012
  • Configured routed protocols including TCP/IP, RIP, IGRP, EIGRP, and OSPF to enhance network communication
  • Created hierarchical network structures and assigned subnetworks across various physical media to optimize connectivity
  • Designed, installed, and configured LAN; simulated network configurations in lab environment

Education

BE - Mechanical

AMC Engineering College
Bangalore
01-2008

Skills

  • AWS (EC2, RDS, S3, IAM, VPC)
  • CI/CD
  • Terraform
  • PostgreSQL
  • Aurora
  • Nodejs
  • Dynatrace
  • Prometheus
  • Grafana
  • SLO/SLA
  • Observability
  • Incident Management
  • Git
  • Sumo Logic
  • Elasticsearch
  • PowerShell
  • DevSecOps(SAST, DAST)
  • Security Tools: CrowdStrike, KICS, Black Duck

Certification

  • JNCIA
  • Selenium

Projects

  • PostgreSQL / Aurora Migration & Modernization, Led large-scale PostgreSQL upgrade and migration (v13 → v17) using AWS Aurora Blue-Green deployment strategy. Resolved critical challenges involving logical replication limitations, replica identity configuration, WAL lag monitoring and replication slot optimization. Designed near zero-downtime migration strategy for production workloads. Implemented validation and rollback mechanisms ensuring data integrity and minimal business disruption. Collaborated with AWS support to troubleshoot migration blockers and optimize deployment approach. Improved database performance and scalability post-migration.
  • APM Implementation & Observability Modernization (Dynatrace), Led end-to-end implementation of Application Performance Monitoring (APM) using Dynatrace across distributed microservices architecture. Instrumented applications to achieve deep visibility into application performance, service dependencies, and user transactions. Enabled real-time monitoring of critical business transactions, reducing mean time to detect (MTTD) and resolve (MTTR) production issues. Configured custom dashboards, alerting policies, and anomaly detection, improving proactive issue identification. Integrated Dynatrace with cloud infrastructure (AWS) for full-stack observability.
  • OpenTelemetry Implementation & Distributed Tracing, Designed and implemented OpenTelemetry-based observability framework for distributed systems. Enabled end-to-end distributed tracing across microservices, improving root cause analysis of latency and failures. Instrumented services using OpenTelemetry SDKs and integrated with observability platforms. Standardized telemetry data collection across applications for unified monitoring. Improved visibility into service-to-service communication and performance bottlenecks.

Languages

  • Hindi (Fluent)
  • Malayalam (Native)
  • Tamil (Conversational)
  • English (Fluent)
  • Hindi (Fluent)
  • Malayalam (Native)
  • Tamil (Conversational)

Personal Information

  • Bangalore
  • Remote
  • Hybrid
  • Date of Birth: 11/09/86

Timeline

Senior Manager - Site Reliability Engineering

Deltek | Replicon
12.2013 - Current

SRE / DevOps Engineer

Deltek | Replicon
12.2013 - Current

DevOps Engineer – Cloud Operations

Deltek | Replicon
12.2013 - Current

Cloud Operations Engineer – Tier 3

Deltek | Replicon
12.2013 - Current

Systems Engineer

CGI
05.2012 - 11.2013

Escalation Officer

Convergys India Private Limited
09.2009 - 05.2012

Technical Support Engineer-JTAC

Convergys India Private Limited
09.2009 - 05.2012

BE - Mechanical

AMC Engineering College
SREEJESH MURALIDHARAN