Accomplished Senior DevOps/SRE Engineer with a proven track record at Sequoia Consulting Group, enhancing platform reliability to 99.99% year-over-year. Expert in AWS, Terraform, and Python, with a knack for leading security vulnerability resolutions and cost management initiatives. Demonstrates exceptional problem-solving and automation skills, driving significant infrastructure optimizations.
Ensuring the platform and products running on them maintain the reliability of 99.99% Year-over-year.
Designed experience of AWS infra, Ubuntu Linux, Docker, VPC,Endpoints, Application Load balancers, WAF, NACL, CDN,Security groups and S3.
Designed Terragrunt modules, tools/scripts for automation and code reviews in Terraform, Terragrunt, Ansible and Python.
Creating and managing CI/CD pipeline using Jenkins, GitHub, AWS Secrets manager, ECR with build and release automation.
Ensuring the security vulnerability detection and fixing in platform and products using tools such as WIZ, Security Hub and Cyera.
Setting up in Metrics, logging and tracing using observability products on Datadog/ELK/CloudWatch/CloudTrail/GuardDuty, SIEM tools.
Setting up DBA/DB Optimisation of MySQL/Postgres/Mongo and Data sunrise.
Participate in architecture design review and implementation a in a simplified, secure, modular, loosely coupled and optimal capacity architecture.
Take the lead in security and privacy vulnerabilities detection and resolution in all areas of the platform.
Focusing on the cloud infra cost management by following best practices.
Exposure to product and platform optimisation initiatives such as POCs like implementing cyberark, rearchitecting legacy systems, etc
Part of the team working on SaaS product that works in the domain of AI (predictive analytics & ML)
Designing the Data pipeline and the micro-service architecture to facilitate real time streaming data(time series)Coming in via IoT devices.
Experience in setting up strimzi and confluent /Apache kafka on Kubernetes.
Experience on Containerised K8 deployments for microservices.
Working on building Data pipelines.
Leading Azure cloud for the product to support multi tenancy architecture.
Perform capacity management functions for various lab environments.
Setting up the NGINX configuration for web along with Django API channels using gunicorn wsgi.
Perform systems monitoring to provide proactive alerting and historical performance records using ELK stack.
Automated the Azure infrastructure using Terraform,configuration management life cycle using Ansible,Monitoring the environment, application logging using ELK.
Implemented OSI Soft PI system and Apache airflow DAG's
Involved on Soc2 Audit ,VAPT vulnerable testing assessment & bug fixing as per security standards.
Implemented the websocket protocol to stream the real time data over the HTTP Request/Response.
Single point of contact for Azure infrastructure.
Written Ansible playbooks to automate the installations and configurations of different tools like Tomcat, , Apache Http, IIS, MS-SQL.
Plan and execute the application deployments from scratch to go-live by coordinating cross teams.
Deploy SAP SAAS based applications on cloud (Azure, SAP Monsoon and SAP Converge) and On-prem data centers.
Automate infrastructure and application deployment using CI/CD pipeline, Jenkins, Ansible, and Ruby.
Automate testing and auditing of applications and infrastructure using Chef Inspec.
Developed Python module to integrate and test the applications after deployments .
Written various shell scripting to qualify the on-prem virtual machines before application deployments and configure the observation tools of applications.
Utilize JIRA for issue reporting, status, activity planning, tracking, and updating project defects and tasks.
Expertise in Python and Shell Scripting for Platform and Application automation jobs.
Migrated Productions Environments from Non-cloud data centers to Cloud data centers
Implemented IT infrastructure upgrades, ensuring minimal downtime during the transition process.
Improved system performance by identifying and resolving technical issues in a timely manner.
Delivered helpdesk service and support to customers.
Completed software updates and assessed security patches for optimized computer use.
Installation, configuration and Maintenance of Apache Tomcat server and Apache web server.
Trouble shooting, monitoring and administration of web and app servers.
Automated some regular activities by setting up the Cron Jobs.
Performing backup and recovery when new technologies related to Web or Application servers are being installed or upgraded.
Handling Incidents and Problem tickets across all the life cycles of the server.
Handling Change Control Requests for production changes.
Proficient in deployment and troubleshooting of JAR, WAR and EAR files in domain.
Experienced in Analyzing the Logs,dumps And HTTP Watch Trace To Find the cause and fix the Issue
AWS,Microsoft Azure and GCP
Terraform and terragrunt
Ansible
Docker and Kubernetes
Jenkins
Nginx and Tomcat
Python and shell scripting
Github and SVN
Datadog,ELK and prometheus
Mysql,Mongo DB and MS-SQL
WIZ and Cyera scan
Ubuntu Linux,Redhat and SUSE and Windows
Security Complaince
Certified Kubernetes application developer
Splunk Certified