Experienced Principal Engineer with a strong foundation in DevOps, Infrastructure as Code, and automation. Demonstrated success in managing end-to-end deployments, optimizing performance, and automating processes to boost operational efficiency. Skilled in applying Site Reliability Engineering (SRE) principles, including automated health checks, performance tuning, and capacity planning, to ensure system resilience, high availability, and scalability. Proficient in monitoring and troubleshooting to uphold uptime and reliability standards across projects.
Led the client project from the pilot phase, establishing CI/CD pipelines for seamless integration and deployment, utilizing Docker and Kubernetes for containerization, orchestration, and workload management.
Optimized Kubernetes cluster performance by tuning resource allocations, managing workloads, and implementing best practices for scaling, high availability, and fault tolerance.
Troubleshot complex Kubernetes issues, including cluster failures, networking challenges, persistent storage integration, and multi-cluster setups.
Designed and optimized Kubernetes architectures for scalability and security while ensuring compliance with industry standards and internal guidelines.
Conducted quarterly Kubernetes health check audits, identifying and implementing optimizations for better performance, security, and reliability.
Automated infrastructure provisioning and deployment using Terraform and Ansible, streamlining cluster management across multi-cloud and hybrid environments.
Implemented advanced CI/CD pipelines tailored for Kubernetes, ensuring seamless application deployments and rollbacks.
Led root cause analysis (RCA) for major Kubernetes-related incidents, providing post-incident reports and proposing long-term solutions.
Proactively monitored cluster health using Grafana and Graylog, enabling preemptive troubleshooting and performance tuning.
Managed production infrastructure following DevOps best practices, ensuring 99.99% uptime and operational stability.
Developed and integrated Python scripts for automation, log parsing, and database interactions, improving system efficiency.
Administered job schedules using Cronicle and maintained version control using Git.
Utilized JIRA and Confluence to track issues, manage workflows, and document Kubernetes and DevOps processes.
Spearheaded disaster recovery (DR) and system recovery (SR) initiatives, ensuring minimal downtime and business continuity.
Designed Kubernetes upgrade and migration plans, ensuring compatibility across components and seamless transitions.
Generated detailed reports on system performance and optimizations using automated scripts, delivering insights for data-driven decision-making.
Developer Support Engineer
JFrog
01.2020 - 11.2020
Customer Support Engineering: Delivering R&D-level support for DevOps tools such as JFrog Artifactory, Docker, and Kubernetes, enhancing CI/CD pipelines and optimizing cloud-based container deployments
Application Configuration: Setting up JFrog applications with integrated DevOps tools, creating scalable CI/CD workflows, and leveraging Kubernetes for efficient container orchestration
Database Optimization: Ensuring seamless integration of MySQL, Oracle, and PostgreSQL with JFrog applications, optimizing configurations for performance, and aligning database operations with CI/CD processes
Performance Monitoring: Proactively analyzing logs, JVM parameters, and OS configurations using monitoring tools like Grafana to optimize system health and prevent bottlenecks
Issue Replication and Resolution: Simulating customer environments to troubleshoot issues, conducting root cause analysis, and deploying automated solutions to enhance system resilience
Knowledge Base Creation: Developing a detailed knowledge base of common issues, automation scripts, and DevOps best practices to streamline incident resolution and standardize support workflows
Senior Associate
Cognizant Technology Solutions India Ltd
11.2018 - 01.2020
User Problem Analysis: Disintegrated and assessed user issues using test scripts and advanced troubleshooting techniques, leveraging DevOps methodologies to identify and resolve problems efficiently
Root Cause Analysis: Conducted thorough root cause analysis and troubleshooting, partnering with the DevOps team to swiftly address system and infrastructure issues, implementing preventative measures to minimize recurrence
Support Request Response: Addressed support requests from end users with hands-on guidance, employing a systematic approach to troubleshooting that aligns with DevOps practices for rapid resolution
Documentation and Knowledge Management: Documented all support interactions and transactions in the system, enhancing the knowledge base with insights and solutions to foster continuous improvement and support efficiency
Cross-Functional Coordination: Collaborated with cross-functional teams to optimize system performance, streamline processes, and maintain a robust support framework, integrating DevOps practices to enhance collaboration between development and operations
DevOps Integration: Integrated DevOps practices by working closely with development and operations teams, driving improvements in continuous integration, continuous deployment (CI/CD), and infrastructure management
CI/CD Pipeline Management: Managed Jenkins CI/CD pipelines to automate build, test, and release processes, ensuring smooth deployments, reducing downtime, and enhancing overall system reliability
Infrastructure Monitoring: Utilized monitoring tools such as Graylog and Grafana to gain real-time insights into system performance, proactively identifying areas for improvement and optimizing resource utilization
Performance Enhancement Collaboration: Worked with cross-functional teams to improve application and system performance by addressing bottlenecks, automating workflows, and implementing resource optimization strategies in line with DevOps principles
Technical Support Engineer
NewNet Techno Engineering India Pvt Ltd
06.2016 - 10.2018
Provided operational and systems engineering support for cloud applications, leveraging AWS services such as EC2, S3, and RDS to manage and maintain scalable cloud platforms
Patched software and installed new versions using AWS Systems Manager to automate patch management and ensure security compliance across cloud environments
Followed up with clients to verify customer satisfaction after resolving issues, utilizing AWS CloudWatch to monitor application performance and ensure smooth operations post-resolution
Worked with Zendesk, JIRA, and Confluence for tracking bugs, responding to customer queries, and documenting troubleshooting steps for cloud-based applications on AWS
Monitored and maintained production systems using AWS CloudWatch for real-time insights, optimizing resource usage and ensuring high availability across distributed environments
IT Analyst
Klaus IT Solutions
07.2015 - 05.2016
Company Overview: Client : Intel Security Group(McAfee Inc.)
Supporting our entire McAfee customers through a variety of mediums
Assisting customers with technical queries for all McAfee products like VSE, EPO, DLP
Ensure that all the required files are gathered and available prior to escalating the issues to next level
Client : Intel Security Group(McAfee Inc.)
IT Analyst
Future Focus Infotech
06.2014 - 07.2015
Company Overview: Client : Tata Consultancy Services Sub Client : GMR Group
Evaluated and adopted new technologies to address changing industry needs
Boosted information sharing by enhancing interfaces between computer systems
Dedicated IT Support to Chairman of the Company and Supporting all kinds of IT Related issues like Windows, Apple MAC book, iPhone, iPAD, Android, Blackberry and Windows and Video Conferencing services and also provide round the clock support to the Chairman
Was the first line of contact for all the IT related issues for the Chairman office
Was the key person in designing and implementing the new office IT Infrastructure for the GMR Group
Client : Tata Consultancy Services Sub Client : GMR Group
Senior System Administrator
Network Solutions(An IBM Company)
09.2010 - 06.2014
Company Overview: Client : GMR Group
Implemented, developed and tested installation and update of file servers, print servers and application servers in all departments
Complete configuration, installation and support of equipment in a Microsoft Windows, MAC and Mobile devices environment to the specifications of client proposals
Provided input on hardware and software purchasing, prioritizing return on investment to optimize IT spending
Interacted directly with users to diagnose and correct major system issues and address concerns
Patch Management & Antivirus updating
Installing, Configuring and Troubleshooting of SCCM Client
Installing, Configuring and Troubleshooting of Auto CAD 2008 and 2010 Application
Installing, Configuring and Troubleshooting MS Office (2003, 2007, 2010 & 2013) and Office Communicator 2005, 2007 R2 and Lync 2010/2013
Client : GMR Group
Junior System Administrator
Bhilwara Scribe Pvt Ltd.
09.2009 - 09.2010
System Admin and Hardware Engineer
Assigning routine jobs and handing over specific tasks to them
Technical Co-ordination with vendors, Client for the technical maintenance of the company, Installation of Software's, LAN, ISDN modem, Hubs, D-Link and CISCO Switches, and maintenance of Medical Transcriptionist's database
Education
B.Tech - Computer Science And Systems Engineering
Sri Vidyanikethan College of Engineering
06.2007
Skills
Livesite Handling
Azure
AWS
DevOps
Jenkins
Docker
Kubernetes
Terraform
Ansible
JFrog Artifactory
Linux
Windows
JIRA
Confluence
MongoDB
Python
GIT
Cronicle
Generative AI
Prompt Engineering
Lang Chain
Large Language Models
Vector database
LLMs Powered Applications
Timeline
Principal Engineer
Wissen Technology
08.2021 - 10.2024
Developer Support Engineer
JFrog
01.2020 - 11.2020
Senior Associate
Cognizant Technology Solutions India Ltd
11.2018 - 01.2020
Technical Support Engineer
NewNet Techno Engineering India Pvt Ltd
06.2016 - 10.2018
IT Analyst
Klaus IT Solutions
07.2015 - 05.2016
IT Analyst
Future Focus Infotech
06.2014 - 07.2015
Senior System Administrator
Network Solutions(An IBM Company)
09.2010 - 06.2014
Junior System Administrator
Bhilwara Scribe Pvt Ltd.
09.2009 - 09.2010
B.Tech - Computer Science And Systems Engineering
Sri Vidyanikethan College of Engineering
Similar Profiles
Aparna SabooAparna Saboo
Senior Principal Engineer at Wissen TechnologySenior Principal Engineer at Wissen Technology