Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Timeline
Generic
Sujeet Yadav

Sujeet Yadav

Pune

Summary

Site reliability engineer with extensive experience exceeding 8 years in production support and AIOps. Strong focus on incident management and root cause analysis, leading to significant enhancements in system reliability and performance. Skilled in managing SLAs and SLOs for large-scale distributed systems in the financial services industry.

Overview

9
9
years of professional experience

Work History

Associate - Projects

Cognizant
Pune
07.2024 - Current
  • Ensured high availability and reliability of critical banking applications, supporting uninterrupted financial services.
  • Led incident management, root cause analysis, and postmortem reviews, reducing frequency of recurring issues.
  • Standardized monitoring and alerting with Dynatrace, improving response times across multiple applications.
  • Utilized AIOps tools for alert correlation, noise reduction, and intelligent incident management.
  • Automated deployment workflows, enhancing efficiency and reducing manual effort.
  • Monitored system performance metrics, including availability, latency, and throughput.

Consultant

Infosys
Bengaluru
10.2023 - 07.2024
  • Enhanced system stability through proactive monitoring and alerting to prevent downtime
  • Streamlined service delivery by executing ITSM processes for improved efficiency
  • Facilitated capacity planning and ensured production readiness for optimal resource allocation
  • Supported high-volume transaction systems with strict SLA requirements

IT Analyst

Tata Consultancy Services
Bengaluru
10.2017 - 10.2023
  • Achieved a 20% reduction in system downtime through proactive measures and automation.
  • Conducted regular system monitoring, tuning, and troubleshooting to maximize performance efficiency.
  • Fine-tuned configurations and implemented scaling strategies to optimize system performance.
  • Oversaw management of various operating systems, maintaining over 40 applications and 100+ Linux servers.
  • Engineered Dynatrace OneAgent for diverse applications, enhancing overall monitoring.
  • Developed disaster recovery plan that enhanced system resilience and minimized recovery duration.
  • Identified process improvements that increased operational efficiency and boosted team output.
  • Collaborated with cross-functional teams to implement continuous improvements across services.

Education

Bachelors of Engineering - Computer Engineering

Amrutvahini College of Engineering
Sangamner
06-2017

Skills

  • Incident management and monitoring tools
  • Application monitoring
  • Availability monitoring
  • Disaster recovery
  • Dynatrace, Splunk, OpenSearch, Prometheus, Grafana, Wily, Sitescope, Argos
  • AIOps tools and BigPanda integration
  • Automation strategies
  • Capacity planning
  • Change management
  • Process improvement
  • Root-cause analysis
  • Cross-functional collaboration
  • Team collaboration
  • Problem solving
  • ChatGPT and Biggy integration

Accomplishments

  • Reduced system downtime by 20% through proactive monitoring and automation.
  • Implemented a disaster recovery plan that improved system resilience and reduced recovery time.
  • Optimized system performance by fine-tuning configurations and implementing scaling strategies.
  • Problem-solving and finding innovative solutions to enhance system reliability and performance.
  • Collaborating with cross-functional teams to contribute to a culture of continuous improvement.
  • BigPanda Operator certified.

Timeline

Associate - Projects

Cognizant
07.2024 - Current

Consultant

Infosys
10.2023 - 07.2024

IT Analyst

Tata Consultancy Services
10.2017 - 10.2023

Bachelors of Engineering - Computer Engineering

Amrutvahini College of Engineering
Sujeet Yadav