Summary
Overview
Work History
Education
Skills
Certification
Languages
Websites
Timeline
Generic

NEERAJ JOSHI

London

Summary

Experienced problem analyst with a dedicated focus, offering nearly a decade of versatile IT expertise as a proficient Site Reliability Engineer and accomplished Production Support Lead across dynamic sectors such as Banking, Financial Services, Securities and Insurance. Demonstrated track record in elevating and sustaining applications spanning diverse technologies, offering immediate and effective assistance to business users, while also spearheading process automation initiatives.

Key Highlights:

  • Possessing an extensive proficiency in ITIL incident management, change management and problem management methodologies.
  • Demonstrated the successful implementation and utilization of observability tools such as Dynatrace, Splunk, AppDynamics and Thousand Eyes.
  • Adeptly experienced in Agile and Scrum management, collaborating effectively with cross-functional global delivery teams.
  • Proficiently skilled in a range of programming languages including Java-J2EE, SpringBoot, Web Service API, Oracle SQL, Oracle PL/SQL,DB2, and Unix Shell scripting.
  • Accomplished in resource management, with a proven track record of leading teams of 30+ members, coupled with 4.3 years of valuable on-site experience at client locations in the UK.
  • Proficient in mentoring and training junior team members, and adept at effectively distributing and managing workload.
  • Eager and dedicated learner, with a strong enthusiasm for studying and implementing new technologies, encompassing Cloud technologies, Java SpringBoot and DevOps practices.
  • Actively engaged with emerging technologies like AI, reflecting a strong curiosity for staying at the forefront of technological advancements.
  • A committed team player, embracing an open mindset for receiving feedback and continually expanding knowledge horizons.

Objective:

Pursuing an engaging role within the information technology sector, aiming to harness and amplify my skills and capabilities. Dedicated to propelling personal development while consistently contributing resourcefulness, innovation, and adaptability. Fueled by a steadfast commitment to acquiring and applying novel technologies, with the ultimate goal of elevating organizational achievements.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

Wipro LTD
Bangalore
12.2023 - Current
  • Monitored systems performance using various metrics such as latency, throughput, availability.
  • Ensured high availability and scalability of applications across multiple environments.
  • Implemented automation tools to increase efficiency in deployment processes.
  • Performed root cause analysis of production incidents and provided recommendations for improvement.
  • Documented best practices and procedures for incident response activities.
  • Troubleshooted complex issues related to application architecture and system configurations.
  • Developed and implemented monitoring solutions to improve system reliability.
  • Provided training sessions on SRE principles and best practices to team members.
  • Collaborated with development teams to ensure proper release engineering practices are followed.
  • Developed strategies for disaster recovery plans that ensured minimal downtime during an outage.
  • Coordinated with other engineers and construction professionals to manage various stages of construction projects.
  • Resolved unexpected technical difficulties and communicated solutions with clients and representatives.

Site Reliability Engineer

Wipro LTD
London
08.2019 - 11.2023

Playing a pivotal role in ensuring the utmost reliability, availability, and performance of critical banking systems and applications. My focus remains steadfast on cultivating a stable and efficient production environment while consistently elevating system reliability and scalability.

  • Proactively monitoring banking systems and applications to identify potential issues or incidents.
  • Conducted root cause analysis for incidents and implemented measures to prevent their recurrence.
  • Employed observability tools to analyze system performance and capacity trends, strategically predicting and planning for future requirements.
  • Streamlined system performance to seamlessly manage peak loads during high-traffic intervals, such as end-of-month or year-end financial activities, while maintaining an ongoing pursuit of refining system processes and procedures to elevate overall operational efficiency.
  • Developed, refined and maintained automation scripts and tools, a proactive endeavor that streamlined repetitive tasks, bolstered system reliability, diminished manual errors and magnified operational efficiency.
  • Engaged collaboratively with development, operations and cross-functional teams, encompassing developers, product managers, and business stakeholders, to adeptly oversee changes within the production environment and ensure impeccable operational continuity.
  • Ensured that all changes were well-documented and followed the organization's change management processes.
  • Worked with security teams to enforce security measures and best practices to protect sensitive financial data.
  • Actively participated in post-incident reviews, exerting meaningful contributions toward the perpetual refinement of system reliability and stability.
  • Proficient in comprehending and applying KPIs,metrics,and SLOs to drive performance and measure success.
  • Participated in the implementation of Dynatrace, educating application team members on its usage, alert setup & fine-tuning, dashboard building tasks, and enhancements as per application requirements.
  • Utilizing Splunk for effective searching, monitoring, and analyzing machine-generated data, including logs, metrics, and events, helped in troubleshooting, and providing valuable insights into system behavior.
  • Gained experience in migrating observability tools based on organizational requirements.
  • Independently orchestrated and led resolution efforts for critical incident calls from an application support standpoint.
  • Working on Proof of Concepts (POCs) for Ansible and Thousand Eyes implementations.
  • Getting involved in cloud migration activities from on-premises to the Google Cloud Platform

Techanical Lead/Senior Production Support Analyst

Wipro LTD
Bangalore
05.2017 - 08.2019

Managed and led multiple teams responsible for the production application support (L1/L2/L3), maintenance and enhancements of securities-based accounts, effectively prioritizing the workload management of the application support team.

  • Demonstrated adeptness in adhering to ITIL best practices encompassing Incident management, Problem management, Change management, and IT service management.
  • Owned and managed the problem management process, meticulously developing, coordinating, and advocating for the seamless execution of problem management activities across support teams.
  • Provided support up to L3 for both internally developed and third-party applications within distributed Linux and Windows environments. This support encompassed integration engineering, BCP solutions, stability monitoring and the timely notification of application events and deployments to business production environments.
  • Offered prompt and expert guidance on emerging trends and challenges that impacted service delivery and support.
  • Skillfully coordinated operational activities between the client and the offshore/onshore team.
  • Oversaw application infrastructure support, methodically structured DOP for infrastructural undertakings like patching and upgrades, and skillfully navigated life cycle management tasks.
  • Contributed to the meticulous planning of application releases and configuration changes.
  • Provided comprehensive user support for applications, efficiently managing outages and ensuring minimal to no business disruption.
  • Diligently analyzed live issues, effectively diagnosing root causes to mitigate problems.
  • Ensured problems were mitigated permanently through continuous stability follow-up (Problem Management), chairing weekly stability meetings with various development leads to deliberate production issues and their lasting solutions.
  • Conducted bi-weekly operational release activities through seamless coordination with developers and database teams.
  • Ensured the meticulous upkeep of process documentation to enhance batch and application support efficiency. This encompassed maintaining the both technical and non-technical documentation, along with delivering impactful contributions to management reports.
  • Facilitated daily status calls with the offshore team to enhance coordination.
  • Designed, mentored and crafted scripts/utilities for automating recurring daily tasks.
  • Drove automation endeavors by mentoring a team of over 30 offshore members.
  • Implemented essential production changes required for applications/tools(Change Management).
  • Enforced Agile methodology best practices in operational/monitoring tool implementations, manual task automations, and minor enhancements.
  • Assumed the role of a scrum master, diagnosing application defects, defining automation scopes, manual tasks, and problem record analysis, developing backlogs to be accomplished within the sprint period.
  • Efficiently onboarded new application support, ensuring smooth transitions, maintaining access, configuring monitoring and alerts, and establishing comprehension of SLAs and resource management.
  • Successfully implemented application monitoring intelligence tools like Dynatrace, Geneos, and Splunk, overseeing installations, data ingestion, visualization, report generation, and alert creation.
  • Extended support to both windows-based .NET applications and Java-based applications.
  • Designed and developed automated UI testing utilizing Selenium WebDriver.
  • Automated database check-outs and related requirements through Python scripting.
  • Spearheaded the integration of organization-specific bots for enhanced support activities (Wipro Holmes).
  • Engaged in minor enhancements and deployed applications over cloud infrastructure.
  • Developed Autosys JIL scripts and batch scripts for monitoring, along with the automation of application-specific requirements.
  • Collaborated closely with internal teams and external third-party vendors to troubleshoot and resolve intricate problems.
  • Possessed extensive experience in managing major incident calls, participating in problem review calls, and contributing effectively to client-facing meetings.
  • Expertly designed appropriate metrics to report on key performance and quality indicators, with a particular focus on in-depth trend analysis.
  • Prioritized both personal and team development, actively sharing anddisseminating knowledge among team members and internal stakeholders.
  • Proactively managed and enhanced the team's resource management, recognizingtraining needs for individuals and the team to meet application supportrequirements, ultimately fostering the growth of team members.
  • Effectively handled frequent changes in priority due to shifts in customerpreferences.

Software Engineer/ Technical Lead/Java Developer

Wipro LTD
Greater Noida
01.2013 - 05.2017
  • Analyzed customer feedback and created strategies for improvements.
  • Designed custom RESTful APIs for integrating third-party services.
  • Developed high-level architecture designs for complex software systems.
  • Implemented CI and CD pipelines for efficient application deployments.
  • Provided technical guidance and mentorship to software engineering teams.
  • Facilitated knowledge transfer workshops among team members on various topics.
  • Monitored performance metrics of deployed applications using APM tools.
  • Maintained documentation related to system architecture and operations.
  • Resolved production issues by troubleshooting root cause problems quickly.
  • Conducted interviews with prospective candidates during hiring processes.
  • Collaborated with product owners to define project requirements.
  • Optimized database queries for faster retrieval of data from databases.
  • Identified areas of improvement within existing software solutions.
  • Participated in sprint planning meetings to provide accurate estimates on tasks.
  • Evaluated new technologies and frameworks that could be used in projects.
  • Analyzed solutions and coding fixes for software problems.
  • Documented technical specifications and project testing methods for future reference.
  • Recommended enhancements and updates to system software based on performance data and user feedback.
  • Introduced automation tools to enhance workflow.

Education

M.TECH - Information Technology

BITS Pilani
Rajasthan
12-2016

Bachelor of Science - Non Medical

Vaish College
Rohtak
06-2012

Skills

  • ITIL
  • Dynatrace
  • Splunk Agile & Scrum
  • ServiceNow
  • JIRA
  • Confluence
  • GIT
  • GCP
  • Unix Shell Scripting
  • Oracle SQL
  • Java & Spring Boot
  • REST Api
  • AppDynamics
  • Thousand Eyes
  • Selenium
  • Wipro Holmes
  • AWS
  • DevOps
  • SharePoint
  • MicroSoft PowerBI
  • Python

Certification

  • AWS Cloud Practitioner.
  • Oracle Cloud Foundation.
  • Oracle Multi cloud Associate
  • Oracle Cloud Associated
  • Oracle cloud data migration Foundational
  • Oracle Cloud Associate AI Foundational
  • Generative AI.

Languages

Hindi
First Language
English
Intermediate (B1)
B1

Timeline

Site Reliability Engineer

Wipro LTD
12.2023 - Current

Site Reliability Engineer

Wipro LTD
08.2019 - 11.2023

Techanical Lead/Senior Production Support Analyst

Wipro LTD
05.2017 - 08.2019

Software Engineer/ Technical Lead/Java Developer

Wipro LTD
01.2013 - 05.2017

M.TECH - Information Technology

BITS Pilani

Bachelor of Science - Non Medical

Vaish College
NEERAJ JOSHI