Dynatrace
Total Experience: 10 Years 8 Months
Work Aspirations
I would like to work in a dynamic environment which can offer me diverse roles and opportunities to grow, a place where I can utilize my potential and capabilities to the fullest towards the organization and can learn new things, adding-on to my current knowledge base and experience.
Experience Summary
Playing role of SRE (Site Reliability Engineering) Technical Manager responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services.
· Passionate about system reliability to influence and drive the strategic SRE mission Engage, influence, and evangelize SRE practices with development, operational and product groups to align technology service/solution delivery for for application monitoring for CompuCom-CSI India Pvt Ltd acquired by American E-Commerce & retail company ‘Office Depot Inc.’
· Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed-up.
· Manage availability, latency, scalability, and efficiency of Bloomberg applications development by instilling engineering reliability into our development life cycle with a focus on fault tolerant approaches.
· Drive capacity planning, performance analysis, instrumentation, and other non-functional systems requirements.
Define and report "progress" on strategic initiates and project level tasks to all stakeholders including senior executives, clients and use effective communication approaches with each constituency.
· Implement metrics driven processes to ensure service quality targets are met.
· Designed & implemented different Monitoring and Observability end-to-end solutions with tools Dynatrace, New Relic, Zabbix, Nagios, Prometheus, Grafana and provides functional and technical support, handle complex incident management, and execute controls for application monitoring.
· Excellent APM using Dynatrace, New Relic of production flow in very fast-paced environment. Excellent analytical, troubleshooting, and problem-solving abilities; including Root Cause Analysis and permanent fix. Strong in integration application maintenance and support, strong integration SDLC experience.
· Expert in implementing custom instrumentation for various technology stack & business divisions through APM tools.
· Expert in creating monitoring dashboards, transactions, alerts, policies, SOP, SLO and SLI for tech services.
· Strong communication skills to act as a liaison between developers, business, and other stake holders.
· Experience leveraging and supporting log/error/audit monitoring tools: Dynatrace/Splunk & New Relic/Zabbix/Nagios Monitoring, incident management, including RCA, improvement/efficiency opportunities, and support management.
· Technical, Application, System, Security, and Administration management of including patches, minor upgrades, etc.; both on-prem and cloud.
· Execute full Disaster Recovery and support Business Continuity Plan. Batch and Interface monitoring and management. Regression testing support and production deployments. Expertise in working with creating dashboard and data extraction using dashboard reports.
· Expert in Real User monitoring & Synthetic Monitoring Well acquainted Database Monitoring using DB monitoring Dynatrace plugin along with infrastructure.
· Familiar with Splunk queries & dashboard.
· Familiar with latest technologies like AWS Services, Azure, Kubernetes, Micro-services, Messaging Platform AMQ / Kafka & Ansible. Strong oral and written communication skills.
Playing role of SRE (Site Reliability Engineering)
Technical Manager responsible for system reliability,
developer productivity and reducing time to market by
striving to reduce technical debt of the services.
Leading a Monitoring Team for different business solutions & modules which implements end-to-end monitoring
Individual contributor for developing PTC system Monitor using Dynatrace appmon solution for PLM, ALM, SLM web applications
Worked as software testing engineer for PTC monitoring tool based on dynatrace
Project Trainee Engineer
Reliability / Monitoring / Observability SME
undefinedDynatrace
New Relic
Zabbix
Nagios
Splunk
Ansible Tower / AWX
Postman / Insomnia
Prometheus
OpsGenie