Summary
Overview
Work History
Education
Timeline
Generic
RAMESH REDDY BOILLA

RAMESH REDDY BOILLA

Site Reliability Engineer
Bengaluru

Summary

Dynamic Technical Lead with extensive experience at Manhattan Associates, specializing in cloud-native infrastructure and Kubernetes operations. Proven expertise in incident management and system monitoring, driving high availability and performance optimization. Adept at leveraging automation and configuration management to enhance operational efficiency while fostering collaboration across teams.

Overview

10
10
years of professional experience

Work History

Technical Lead, Operations

Manhattan Associates
11.2022 - Current

Leading cloud-native infrastructure and Kubernetes operations for Manhattan's enterprise supply chain management platform on Google Cloud Platform (GCP), managing mission-critical applications serving global clients.
Key Responsibilities:
• Architect and manage Google Kubernetes Engine (GKE) clusters across multiple GCP projects hosting supply chain application stacks (MAWM, MATM, MAO, MASCP)
• Design and implement containerized microservices deployments using Kubernetes pods and orchestration
• Manage RabbitMQ messaging infrastructure for internal service communication
• Configure and maintain GCP Pub/Sub services for external event management between customers and Manhattan platform
• Deploy and manage Apache Kafka clusters on GKE for event-driven architectures and message sequencing
• Administer MySQL database deployments, image management, and persistent storage solutions on Kubernetes
• Manage GCP Cloud Storage buckets for application data, backups, and artifacts
• Implement CI/CD pipelines using Jenkins and Bitbucket for automated deployments
• Develop shell scripts for automation, deployment orchestration, and operational tasks
• Configure ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging and log analysis
• Implement Prometheus for real-time metrics collection and performance monitoring
• Design Grafana dashboards for infrastructure observability and SLA tracking
• Manage Google Container Registry (GCR) and Google Source Repository (GSR) for container images and source code
• Apply SRE principles and best practices across all managed services
• Leverage GCP Associate Cloud Engineer and CKA (Certified Kubernetes Administrator) expertise for infrastructure optimization
• Well experienced in analyzing thread dumps, heap dumps, GC logs, NMON reports, and network packet captures, providing crucial insights for customers facing performance issues.

GS Consultant

Software AG
06.2022 - 10.2022
  • Specialized in WebMethods performance optimization and troubleshooting for enterprise clients across the APAC region.

    Key Responsibilities:
    • Analyzed thread dumps and heap dumps to diagnose and resolve performance bottlenecks in WebMethods applications serving multiple APAC customers
    • Performed root cause analysis for critical WebMethods performance issues including memory leaks, thread deadlocks, and resource contention
    • Provided expert support for WebMethods Microservices Runtime (MSR) deployments on Kubernetes orchestration platforms
    • Collaborated with R&D teams to troubleshoot and resolve Kubernetes-related issues on AWS and GCP cloud environments
    • Optimized WebMethods MSR containerized deployments for high availability and scalability in cloud-native architectures
    • Implemented monitoring and alerting solutions for WebMethods applications to proactively identify performance degradation
    • Conducted performance tuning sessions focusing on JVM optimization, garbage collection analysis, and memory management for WebMethods environmentsSpecialized in WebMethods performance optimization and troubleshooting for enterprise clients across the APAC region. Key Responsibilities: • Analyzed thread dumps and heap dumps to diagnose and resolve performance bottlenecks in WebMethods applications serving multiple APAC customers • Performed root cause analysis for critical WebMethods performance issues including memory leaks, thread deadlocks, and resource contention • Provided expert support for WebMethods Microservices Runtime (MSR) deployments on Kubernetes orchestration platforms • Collaborated with R&D teams to troubleshoot and resolve Kubernetes-related issues on AWS and GCP cloud environments • Optimized WebMethods MSR containerized deployments for high availability and scalability in cloud-native architectures • Implemented monitoring and alerting solutions for WebMethods applications to proactively identify performance degradation • Conducted performance tuning sessions focusing on JVM optimization, garbage collection analysis, and memory management for WebMethods environments.

Sr. Cloud Application Engineer

OpenText
01.2022 - 05.2022
  • Managed cloud infrastructure and application deployments for OpenText's enterprise content management solutions.

    Key Responsibilities:
    • Conducted cluster analysis and performance optimization for distributed systems
    • Implemented uptime monitoring and alerting solutions to ensure 99.9%+ availability
    • Performed root cause analysis for complex production issues using TCP/IP and network protocol analysis
    • Designed and implemented transmission control protocols for secure data transfer
    • Automated deployment and scaling processes for cloud applications
    • Collaborated with development teams to implement SRE principles and reliability improvements
    • Created comprehensive documentation for infrastructure architecture and troubleshooting procedures
    • Participated in on-call rotation and incident response for critical services

Senior Software Engineer

Tech Mahindra
05.2021 - 01.2022
  • Provided application infrastructure support and middleware management services for Principal Financial Group (USA), managing mission-critical production environments serving financial services applications.

    Key Responsibilities:
    • Managed and maintained WebSphere Application Server clusters for high-availability production environments
    • Administered JBoss Application Server instances supporting critical financial applications
    • Configured and optimized Apache web servers for load distribution and performance
    • Implemented automation solutions using Ansible Tower for configuration management and deployments
    • Managed F5 Load Balancers to ensure optimal traffic distribution and high availability
    • Utilized IPControl for IP address management and network infrastructure coordination
    • Performed cluster analysis and performance tuning for production application environments
    • Conducted root cause analysis and resolved complex production issues with minimal downtime
    • Implemented monitoring and alerting solutions to proactively identify and resolve issues
    • Collaborated with application teams to ensure smooth deployments and system reliability.

Lead System Administrator

Wipro Technologies
08.2019 - 03.2021
  • Led application infrastructure and middleware operations for HSBC Retail Banking and Wealth Management (RBWM) unit, managing enterprise-scale banking applications across multiple geographical regions.

    Key Responsibilities:
    • Built and managed deployment platforms for Linux-based application infrastructure (WebSphere, WebSphere Liberty Server, JBoss)
    • Administered WebSphere Application Server clusters hosting critical HSBC banking applications
    • Managed Pega BPM applications deployed on JBoss Application Server for workflow automation
    • Configured and maintained WebSphere Liberty Server (WLS) instances for microservices deployments
    • Implemented Apache Reverse Proxy and Nginx solutions for web server load balancing and security
    • Led integration projects connecting HSBC applications with Qatar Bank and TSYS Allied Wallet for HSBC Cards operations
    • Managed end-to-end SSL certificate lifecycle including procurement, installation, renewal, and troubleshooting
    • Performed application performance tuning and optimization to meet SLA requirements
    • Conducted cluster analysis and capacity planning for production environments
    • Provided Level 3 support for critical production incidents with root cause analysis
    • Automated deployment and configuration management tasks using shell scripting

Consultant

Capgemini
01.2017 - 08.2019
  • Served as Middleware Production Support Engineer for AXA Insurance APAC operations, managing mission-critical insurance applications across multiple countries including Malaysia, China, Singapore, Hong Kong, Thailand, India, Japan, and Indonesia.

    Key Responsibilities:
    • Provided L2 production support for 15+ enterprise applications across APAC region
    • Managed GTOM (Pega BPM platform) for business process automation and workflow management
    • Supported Smart Claims application on Guidewire platform for insurance claims processing
    • Administered middleware infrastructure for AXA Workbench, CaseNet, DMTM, Aura, SPCP and Japan-specific applications
    • Deployed and maintained applications across multiple middleware platforms: WebSphere, JBoss, Tomcat, and WebLogic
    • Executed production deployments and changes across UAT, Production, and Disaster Recovery environments
    • Handled incident management following ITIL best practices with strict SLA adherence
    • Managed change requests and performed impact analysis for production changes
    • Provided 24/7 on-call support for critical production incidents across multiple time zones
    • Performed root cause analysis and implemented permanent fixes for recurring issues
    • Collaborated with application teams, vendors (Pega, Guidewire), and regional stakeholders
    • Created and maintained documentation for support procedures and runbooksServed as Middleware Production Support Engineer for AXA Insurance APAC operations, managing mission-critical insurance applications across multiple countries including Malaysia, China, Singapore, Hong Kong, Thailand, India, Japan, and Indonesia. Key Responsibilities: • Provided L2 production support for 15+ enterprise applications across APAC region • Managed GTOM (Pega BPM platform) for business process automation and workflow management • Supported Smart Claims application on Guidewire platform for insurance claims processing • Administered middleware infrastructure for AXA Workbench, CaseNet, DMTM, Aura, SPCP and Japan-specific applications • Deployed and maintained applications across multiple middleware platforms: WebSphere, JBoss, Tomcat, and WebLogic • Executed production deployments and changes across UAT, Production, and Disaster Recovery environments • Handled incident management following ITIL best practices with strict SLA adherence • Managed change requests and performed impact analysis for production changes • Provided 24/7 on-call support for critical production incidents across multiple time zones • Performed root cause analysis and implemented permanent fixes for recurring issues • Collaborated with application teams, vendors (Pega, Guidewire), and regional stakeholders • Created and maintained documentation for support procedures and runbooks.

Operations Specialist

IBM
03.2016 - 12.2016
  • Supported middleware platforms for OGE (Oklahoma Gas & Electric) with a focus on WebSphere and WebSphere Liberty Server, providing L1/L2 production operations across critical utility applications.

    Key Responsibilities:

    Administered IBM WebSphere Application Server (v8.5) for high-availability production systems
    Supported API services hosted on WebSphere Liberty Server (WLS) and assisted with environment upkeep
    Performed L1/L2 operations including incident triage, resolution, and escalation per SLA
    Executed production changes and deployments across DEV/UAT/PROD following change management
    Monitored application health, performed log analysis, and handled recurring issue remediation.

    Coordinated with application and infrastructure teams to plan and execute releases

    Maintained runbooks and standard operating procedures for consistent support.

Education

Bachelor of Engineering - Electrical, Electronics And Communications Engineering

Anna University
Chennai
04.2001 -

Timeline

Technical Lead, Operations

Manhattan Associates
11.2022 - Current

GS Consultant

Software AG
06.2022 - 10.2022

Sr. Cloud Application Engineer

OpenText
01.2022 - 05.2022

Senior Software Engineer

Tech Mahindra
05.2021 - 01.2022

Lead System Administrator

Wipro Technologies
08.2019 - 03.2021

Consultant

Capgemini
01.2017 - 08.2019

Operations Specialist

IBM
03.2016 - 12.2016

Bachelor of Engineering - Electrical, Electronics And Communications Engineering

Anna University
04.2001 -
RAMESH REDDY BOILLASite Reliability Engineer