Summary
Overview
Work History
Education
Skills
Accomplishments
Work Availability
Quote
Timeline
Generic
Vishnu Vardhan Puttepu

Vishnu Vardhan Puttepu

Site Reliability Engineer

Summary

Certified and self-driven Site Reliability Engineer who is eager to apply extensive knowledge and experience in a practical setting. Skilled at providing beneficial IT support to clients, creating monitoring and testing solutions, developing automation tools and services, and analyzing the security of websites and mobile applications. Effective communicator with a deep passion for technology, strong analytical skills, and ability to perform well in a team. An IT professional with 7 years of Site Reliability Engineer, Cloud Computing, Infrastructure Automation, and Application Build & Deployment expertise. Strong domain knowledge in AWS Cloud, Infrastructure configuration management, Container services, and Linux and Windows infrastructure scripting. Infrastructure Administration and troubleshooting skills: 3+ years of experience (Linux,python, Windows, AWS, Vmware, Dockers administration).

Overview

8
8
years of professional experience

Work History

Site Reliability Engineer

HSBC Technology & Services
01.2022 - Current
  • Creating yaml files for infrastructure as code Onboarding new client from scratch in gcp containers Decommission of client
  • Implemented deployment strategies using container orchestration tools such as kubernetes.
  • Created team strategy for SDLC automation, configuration management and release management.
  • Configuring monitoring on grafana
  • Working on incidents based on pager alerts Investigating on newer issues and providing permanent solutions.
  • Collaborated with development teams to support current environment while transforming into cloud architecture.
  • Improved and tuned operational efficiency within infrastructure and production environment.
  • Participated in deploying and supporting applications on private and public cloud environments.
  • Maintained metrics visibility using Datadog and Prometheus/Grafana to create useful dashboards and monitors.
  • Managed AWS assets and integrated multiple AWS resources into solutions appropriate for company projects.
  • Contributed ideas and suggestions in team meetings and delivered updates on deadlines, designs, and enhancements.
  • Developed software and provided hands-on technical knowledge to design, deploy and optimize large-scale fault-tolerant systems.

Site Reliability Engineer

Qualcomm
04.2020 - 01.2022
  • As a SRE, always strived to maintain 100% application Uptime
    and handled outages effectively and worked on root causes
    to avoid issue in future
  • Created shell script for sudoers version for latest and legacy
    images for different OS families
  • Created shell script for rhel 8 imaging automation
  • Migrated many legacy deployment processes into CICD by
    using Docker and Jenkins groovy pipeline scripting
  • Converted many shell scripts into ansible playbooks as part of
    configuration management process
  • Performed patching activity to non-compliance servers.
  • Applied engineering principles to troubleshooting and followed up on defined corrective actions to prevent reoccurrences.

Site Reliability Engineer

ADP Private
06.2017 - 04.2021
  • As a SRE, always strived to maintain 100% application Uptime and handled outages effectively and worked on root causes to avoid issue in future
  • Created shell script for sudoers version for latest and legacy images for different OS families
  • Created shell script for rhel 8 imaging automation Worked on UEFI support for RHEL 7 and RHEL 8
  • Migrated many legacy deployment processes into CICD by using Docker and Jenkins groovy pipeline scripting Converted many shell scripts into ansible playbooks as part of configuration management process
  • Performed patching activity to non-compliance servers
  • Created python scripts for user requests for access granting Scaled containers from ICP-Kubernetes console
  • Deployed micro services into ICP-Kubernetes using Jenkins Writing Playbooks in Ansible to Automate Deployment Full builds deployment and Environment creation
  • Created python scripts for pre check and health status of infrastructure
  • Automated Disk clean up activity with Python scripts Leading automation of implementation and configuration work through Python
  • Planned and smoothly executed DR Drills
  • Deployed web based application into tomcat web servers using jenkins
  • Monitoring and restarting tomcat web servers using Python Analyzing Logs, Thread dump, Heap Dump and GC, for troubleshooting issues
  • Provide technical expertise throughout Agile software lifecycle including design, implementation and delivery Acting as part of Emergency Response Team to handle unplanned outages, with in SLO
  • Troubleshooting issues for Global Clients in live applications Carrying out blameless “Post-mortems/RCA” and driving permanent fix
  • Proactive identification of potential outages and their remedy
  • Building dashboards and Aggregating metrics Participating in CAB meetings for Sprint planning and Downtime budget Management
  • Working on Reactive to Pro active conversion of Monitors Writing and Configuring ,monitoring scripts using python for web and app server to ensure uptime of application
  • Work Along with Managers to improve process and improve efficiency
  • Created a CICD pipe line for application deployments continuously using Jenkins
  • Created Event Driven templates to work on regular application issues using python
  • Created Infrastructure as Code using Ansible to build application from scratch
  • Working on Agile frame work (scrum) and participating on All ceremonies
  • Developed Playbooks for application Auto-healing
  • Created a Self-service form in Ansible(AWX) for super users to full fill their requirements
  • Troubleshooting Ansible connectivity issues to target nodes Created Blue prism Bots for orchestration and Event driven Using HPOO created Health checks for servers
  • Working on Continuous integration and Continuous
  • Recorded daily events and activities in site diary to evaluate process and improve productivity.

System Administrator

IBM
12.2015 - 12.2017
  • Deployment using Jenkins and Ansible
  • Configured Jenkins Slaves and troubleshooting issues related to Slave's connectivity from Jenkins
  • Created Pipe line for Automatic Application Build using Ansible
  • Working on multiple AWS services like EC2, S3, VPC, AutoScaling, ELB.
  • Working on Compute, Storage, Networking and Database services in AWS
  • Conducting systems design, feasibility and cost studies and recommending cost-effective cloud solutions
  • Automated complete end-to-end Application setup using Ansible and vSphere console.

Education

B.TECH - Electronics Engineering

Skills

Client Liaison/Requirement Analysis Automating infrastructure and applications Configuration Automation Frameworks Maintenanceundefined

Accomplishments

  • Achieved end to end infrastructure procurement & setup using Ansible.
  • Automated end to end deployment work flow using Jenkins with Groovy scripts.
  • Migrated services from Rancher to kubernetes
  • Automated deployment process end to end using Blueprism and Jenkins
  • Migrated all the URL's to Google Cloud DNS
  • Implemented Event driven Automation for incident resolution using Ansible and Apache Airflow.
  • Integrated all the splunk Alerts to Xmatters for On-call alerting
  • Implemented AWX tower for Ansible GUI scripts execution

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Quote

Every problem is a gift—without problems we would not grow.
Tony Robbins

Timeline

Site Reliability Engineer

HSBC Technology & Services
01.2022 - Current

Site Reliability Engineer

Qualcomm
04.2020 - 01.2022

Site Reliability Engineer

ADP Private
06.2017 - 04.2021

System Administrator

IBM
12.2015 - 12.2017

B.TECH - Electronics Engineering

Vishnu Vardhan PuttepuSite Reliability Engineer