Summary
Overview
Work History
Education
Skills
Certification
Languages
Websites, Portfolios and Profiles
Timeline
Generic
Manjunatha S

Manjunatha S

Bengaluru

Summary

Lead DevOps Engineer with 10+ years of experience delivering cloud-native infrastructure, DevOps platforms, MLOps solutions, and distributed systems at enterprise scale. Specialized in AWS, Azure, Kubernetes, Infrastructure as Code, CI/CD automation, observability, and Edge AI platforms, with proven success in architecting reliable, scalable, and highly automated engineering ecosystems. Strong background in platform reliability, automation, incident intelligence, and cloud modernization, partnering with global teams to accelerate delivery and operational excellence.

Core Strengths: Cloud & Platform Engineering | DevOps & CI/CD Automation | MLOps & Edge AI Platforms | LLM & GenAI Integration | Kubernetes & Distributed Systems | Infrastructure as Code (IaC) | Observability & Reliability Engineering | Automation & Python Development | Incident Intelligence & AIOps | Cloud-Native Architecture | Platform Modernization.

Overview

1
1
Certification
12
12
years of professional experience

Work History

Lead DevOps | Platform Engineer | MLOps | Edge AI

Tesco Technology
10.2023 - Current
  • Developed and optimized MLOps workflows for model versioning, rollout orchestration, inference monitoring, failure recovery, and large-scale execution, improving deployment stability across thousands of edge environments.
  • Architected and operated large-scale Edge AI and Computer Vision platforms across distributed retail environments, enabling reliable video ingestion, AI inference, event processing, and platform resiliency for thousands of store devices.
  • Automated enterprise-scale platform onboarding and operational workflows across distributed environments, improving infrastructure provisioning, system integration, deployment standardization, and operational efficiency for thousands of edge devices.
  • Managed cloud-native container platforms and Kubernetes-based edge environments, while implementing CI/CD automation pipelines, accelerating engineering delivery and release reliability.
  • Designed and managed Infrastructure as Code (IaC) and configuration automation, provisioning and maintaining 900+ distributed servers, improving deployment consistency, scalability, and platform reliability.
  • Developed Python-based automation frameworks for platform validation, configuration management, workflow orchestration, and cloud integrations, enhancing troubleshooting speed and engineering productivity.
  • Led platform reliability engineering and root cause analysis (RCA) for distributed systems, resolving production failures, degraded inference behavior, infrastructure instability, and service disruptions, improving uptime and operational resilience.
  • Created enterprise observability and monitoring solutions with Splunk and Grafana, providing real-time visibility into platform health, infrastructure availability, service performance, security compliance, and operational reliability.
  • Enhanced security and vulnerability governance, increasing infrastructure risk visibility, prioritisation, and remediation across enterprise systems.
  • Built an AI-powered incident intelligence platform integrating ticketing, observability, and LLM technologies (GPT-4o) to automate duplicate incident detection, RCA generation, SLA/SLO insights, repeated issue analysis, and operational reporting through natural language prompts, reducing manual triage effort and accelerating incident resolution.

Lead DevOps Engineer

7-Eleven
Bengaluru, India
06.2022 - 10.2023
  • Deployed public and private facing production grade website on AWS cloud and provisioned the AWS infrastructure using terraform and CloudFormation using Serverless Framework for automating the AWS resource deployment.
  • Utilised Jenkins and AWS CodePipeline for end-to-end build automation and deployments, integrating Checkmarx for static code analysis to improve code quality.
  • Extensively worked on Jenkins as continuous integration tool creating new jobs, managing required plug-ins, configuring the jobs selecting required source code management tool. Build trigger, build system, and post build actions, scheduled automatic builds, notifying the build report.
  • Developed Docker files to containerise applications for deployment on managed Kubernetes services EKS and AKS, enhancing application portability and deployment speed.
  • Good experience with helm charts deployments, writing manifest files to deploy K8 objects microservices.
  • Created AWS infrastructure for multiple countries for 7-Eleven international, configuring AWS resources to ensure high availability, scalability, and flexibility for handling load bursts.
  • Experienced in infrastructure and application monitoring using Prometheus, Splunk, New Relic, AWS Cloudwatch.
  • Proficient in installing and configuring Open source Artifact repositories like Jfrog Artifactory, Nexus.
  • Extensively worked with version control system like Github, Gitlab, bitbucket.
  • Worked on creating AWS managed Kafka, and open source tools like Apache Pinot and Apache Presto in AWS EKS cluster.

Senior DevOps Engineer

DigbiHealth
Bengaluru, India
01.2021 - 06.2022
  • Designed and architected cloud and DevOps solutions focused on scalability and cost reduction.
  • Successfully implemented and managed CI/CD pipelines for Code Deployment using AWS DevOps Tools i.e using AWS CodePipeline, AWS CodeCommit, CodeBuild, CodeDeploy.
  • Designed, deployed, managed, and monitored cloud infrastructure using Amazon Web Services (AWS) to enhance customer-facing services.
  • Completed the Capacity planning and Architecture Design of AWS Infrastructure.
  • Ensure necessary system security by using best in class cloud security solutions.
  • Implemented IAM requirements to improve manageability and privacy for cloud-based environments.
  • Worked on Blue/green deployment strategy by creating new applications which are identical to the existing production environment using CloudFormation templates.
  • Managed Docker container lifecycle, including snapshots, attachments to running containers, and directory structure organisation.
  • Configured Splunk for application monitoring, analysing server log files and creating dashboards, reports, and alerts to improve visibility and response times.
  • Experienced in DBs like Postgres, MySQL, Dynamo DB.

Senior Software Engineer

Torry Harris Integration Solutions
05.2019 - 01.2021
  • Utilised AWS platform features including IAM, EC2, EBS, VPC, RDS, Cloud Watch, Cloud Trail, Cloud Formation, Autoscaling, Cloud Front, S3, SQS, SNS, Lambda, and Route53 to support scalable infrastructure.
  • Designed, deployed, managed and operated scalable, highly available, and fault tolerant system on AWS.
  • Developed AWS IAM users, policies, groups to enhance security and access management.
  • Provisioning of AWS resources like EC2, VPC, EBS, AMI, S3 buckets, creation of subnets.
  • Creating the configuration for establishing a VPN tunnel between on-premise network and AWS VPC.
  • Experience in WSO2 API Manager for designing API facades, and designing and implementing API Proxies.
  • Good hands on experience on WSO2 ESB working knowledge of proxy services, connectors, error handling, custom mediators and good debugging skills.
  • Provided expert consulting services to clients at their location in Bath, UK.
  • Experienced in using Active MQ.
  • Customised Salesforce.com scopes including users, roles, and profiles, and created custom objects, fields, SOQL queries, and triggers to streamline business operations.

Software Engineer

Torry Harris Integration Solutions
07.2014 - 04.2019
  • Designing and implementing of public and private facing websites on AWS Cloud.
  • Configured and managed various AWS Services including EC2, RDS, VPC, Cloud Watch, Cloud Front, and Route 53 etc.
  • Configured performance metrics using AWS CloudWatch and CloudTrail to enhance system monitoring.
  • Created AWS IAM users, policies, and groups.
  • Tuned Hybris application performance, monitored critical jobs, and conducted daily health checks, collaborating with infrastructure teams to ensure 100% uptime.
  • Installation and maintenance of cluster-based Hybris B2C commerce systems.
  • Hybris Application Performance and Monitoring with APM tools such as SciVisum.
  • End-to-end support and troubleshooting HAC, Back office, Cron Jobs, and ImpEx Import/Export, SMTP, Catalogs, Products synchronization and data load from SAP ERP system.
  • Troubleshooting end user issues in different layers (load balancer, web server, SOA, hybris app, datahub, Solr and database) in hybris landscape.
  • Collaborated with internal infrastructure teams to implement documentation, backup/system recovery, and disaster recovery solutions for SAP Hybris systems.
  • Provided technical support at client site in Dublin, Ireland.
  • Hands-on experience with SSH, Linux shell commands, PowerShell scripts.

Education

Master Of Computer Application -

The Oxford College of Science
06-2013

Skills

    Cloud & Platform:
    AWS, Azure, GCP, VMware Tanzu, Cloud Architecture, Distributed Systems

    DevOps & CI/CD:
    GitHub Actions, Jenkins, AWS CodeBuild, CodeDeploy, CI/CD Automation

    Infrastructure as Code (IaC):
    Terraform, CloudFormation, AWS CDK, Chef, Ansible

    Containers & Orchestration:
    Docker, Kubernetes, AWS ECS, EKS, Container Platforms, Edge Infrastructure

    MLOps & Edge AI:
    MLOps, Model Deployment, Inference Monitoring, Workflow Orchestration, Edge Computing

    Programming & Automation:
    Python, Shell, Bash, API Integrations, Automation Frameworks

    Observability & Reliability:
    Splunk, Grafana, Prometheus, CloudWatch, New Relic, Incident Management

    Serverless & Event Streaming:
    AWS Lambda, API Gateway, Step Functions, Kinesis, Google Cloud Functions

    Databases & Messaging:
    PostgreSQL, MySQL, DynamoDB, Kafka, RabbitMQ

    Security & Quality Engineering:
    SonarQube, Checkmarx, Snyk, Kenna, Vulnerability Management

Certification

  • ITIL Foundation Certificate in IT Service Management, GR750220213DN
  • HIPPA (Health Insurance Portability & Accountability Act)

Languages

English
Kannada
Telugu

Websites, Portfolios and Profiles

https://www.linkedin.com/in/manjunatha-shankarareddy-180145136

Timeline

Lead DevOps | Platform Engineer | MLOps | Edge AI

Tesco Technology
10.2023 - Current

Lead DevOps Engineer

7-Eleven
06.2022 - 10.2023

Senior DevOps Engineer

DigbiHealth
01.2021 - 06.2022

Senior Software Engineer

Torry Harris Integration Solutions
05.2019 - 01.2021

Software Engineer

Torry Harris Integration Solutions
07.2014 - 04.2019

Master Of Computer Application -

The Oxford College of Science
Manjunatha S