Highly accomplished DevOps Architect and SRE with 18+ years of experience in high-performance computing, specializing in Kubernetes orchestration and disaster recovery. Reduced mean time to recovery (MTTR) for fintech and telecommunication platforms. Skilled in building self-healing systems and leading cross-functional teams to drive digital transformation initiatives.
Overview
18
18
years of professional experience
1
1
Certification
Work History
DevOps Architect
Sinch Cloud Communication Pvt.
11.2020 - 11.2025
Design and implement secure DevOps architectures integrating security across the entire SDLC.
Part of the team to Migrate Applications,Suppliers and Clients from On-primeses to Cloud.
Architect and implement DevSecOps pipelines using tools like Jenkins, GitLab, and GitHub Actions.
Integrate security scanning tools into CI/CD pipelines including SonarQube and Synk.
Design and implement secure Infrastructure as Code (IaC) using Terraform and Ansible, with automated security checks.
Architect cloud security frameworks for platforms such as Amazon Web Services, Microsoft Azure, and Google Cloud Platform.
Implement container and Kubernetes security strategies using Docker and Kubernetes.
Integrate container vulnerability scanning and runtime protection tools into pipelines.
Establish DevSecOps governance, policies, and compliance standards across engineering teams.
Implement automated secrets management and secure credential handling using tools like HashiCorp Vault.
Design identity and access management (IAM) strategies and least-privilege access models in cloud environments.
Implement security monitoring, logging, and incident response frameworks using tools like Splunk and ELK Stack.
Ensure compliance with security standards such as ISO 27001, SOC 2, and GDPR.
Conduct threat modeling and security architecture reviews for applications and infrastructure.
Lead DevSecOps adoption initiatives and mentor teams on secure development and deployment practices.
Collaborate with security, development, and operations teams to implement shift-left security strategies.
Implemented automated security scanning in CI/CD pipelines, reducing critical vulnerabilities by 70%.
Designed a secure Kubernetes platform improving application deployment security and compliance.
Reduced manual security review effort by 50% through automated DevSecOps workflows.
Implemented AutoGPT and AutoGen frameworks to automate routine infrastructure maintenance tasks, reducing manual operational toil by 60%.
Developed a LangChain-based AI agent to analyze Kubernetes logs, resulting in a 40% faster root cause analysis (RCA) and automatic remediation of common service failures.
Integrated GitHub Copilot into the CI/CD pipeline, enhancing developer productivity by 25% through AI-assisted coding and automated test generation.
Customer Solution Adoption Sr. Specialist
SAP Labs
11.2016 - 11.2020
Deploying, monitoring, and troubleshooting applications of SAP in a Linux environment.
Installation, configuration, and troubleshooting of Linux systems.
Led a team of 10 SREs, mentoring them on best practices for incident response, and reducing average resolution time (MTTR).
Automated CI/CD pipelines and infrastructure deployment using Terraform and Jenkins, cutting deployment time by 40%.
Implemented proactive monitoring using Grafana and Prometheus, reducing downtime by 20% through early detection of latency bottlenecks.
Eliminated single points of failure, increasing system availability from 99.5% to 99.99%, and reducing customer impact by 80%.
Responsible for the installation, configuration of MySQL databases, backup, recovery, performance tuning, optimising SQL queries, and overall performance tuning.
Responsible for the deployment, configuration, and administration of web servers such as Apache Tomcat and Apache HTTP Server.
Monitor customer monitoring tickets over Nagios based on priority, and provide resolution and updates based on SLA. Joining or initiating bridge calls during outages or patching.
To ensure that all platform instances are up and running smoothly at all times and to maintain the uptime of the platform instances at 99.9%.
Investigate recurring incidents and fixing it from root.
Log analysis, Basic network troubleshooting and coordinates with Network Team.
Patching application Releases and system software.
Training and managing the L1, L2 support team. Handling critical issues that impact the production environment and act accordingly to make sure the host and applications are running smoothly.
Support Sybase database to interact with messaging protocol.
Writing Python, Perl scripts to fetch data from DB and generate the billing/traffic report.
Container management using Docker by writing Dockerfiles and set up the automated build on Docker HUB and installed and configured Kubernetes.
SAP Products and platforms mainly Hybris Marketing cloud, CAAS email and Enterprise messaging.
Supporting customer during email campaigns, IP warm up, Soft bounces, hard bounces, complaints etc.
Sybase365 is a subsidiary of Sybase, Inc, A SAP Company. Sybase 365 is the global leader in mobile messaging and mobile commerce services. It's got SMSC HUB to support international P2P and A2P SMS (MO/ MT) traffic over SS7/SMPP via direct route or via Peering HUB or Transit nodes.
Customer Solution Specialist
Wipro
07.2016 - 11.2016
Assisted in addressing cases and providing solutions to NOC or Service Desk teams.
Facilitated routing of SMS traffic from peering hub and transit nodes via Sybase365 hub.
Updated case statuses in SAPCRM tool regularly for visibility to sales and operations teams.
Contributed to L3 production support team managing A2P infrastructure.
Supported application-level changes with a focus on minimising downtime.
Resolved system issues raised by NOC team efficiently.
Addressed application issues reported by Service Desk for impacted clients.
Implemented new monitoring tools in Nagios and EM7.
Prepared RCA documentation for clients and internal issues.
Sr. Software Engineer
Minacs
02.2011 - 06.2016
Assisted in managing L3 production support for A2P infrastructure.
Addressed major application-level updates ensuring zero downtime.
Resolved system issues reported by NOC team.
Handled application issues raised by service desk affecting clients.
Implemented new monitoring solutions in Nagios and EM7.
Prepared RCAs and internal documentation for issues.
Software Engineer
Roamware Inc.
12.2009 - 11.2010
Software Engineer
Cellebrum
12.2007 - 12.2009
Education
BTech - Information & Technology
UPTU
Jhansi
01-2007
Skills
Certified Kubernetes Administrator (CKA)
Cloud platforms: Having expert-level knowledge with 12 years of AWS, and 3 years in Azure and GCP, focusing on architecture, networking, and security
Infrastructure as Code (IaC): Proficiency in provisioning infrastructure using Terraform, Ansible, and Chef
CI/CD and Automation: Designing robust pipelines using Jenkins, GitLab CI, Argo CD
Containerisation and orchestration: Expertise in Docker, Harbour, Kubernetes, and Helm charts for application containerisation
Scripting and Programming: Advanced scripting in Python, Shell scripting, and Groovy to automate complex tasks
Monitoring & Logging: Implementing observability using Prometheus, Grafana, ELK Stack, Nagios, Dynatrace, and Solarwind
Version Control & SCM: Advanced Git usage, GitFlow methodology, GitHub, or GitLab