Summary
Overview
Work History
Education
Skills
Timeline
Generic

Kavana M

Senior Engineer
Bengaluru,KA

Summary

Dynamic Senior Engineer with a proven track record at Dell EMC, specializing in observability solutions using Prometheus and Grafana. Expert in automating monitoring deployments and enhancing incident response times. Skilled in PostgreSQL and cross-functional collaboration, driving performance optimization and delivering impactful results in high-availability environments.

Overview

13
13
years of professional experience

Work History

Senior Engineer

Dell EMC
06.2019 - Current


  • Designed and maintained observability stack using Prometheus and Grafana, enabling real-time monitoring and alerting for critical services.
  • Developed and optimized dashboards for system health, performance metrics, and SLA compliance.
  • Managed CPDB, an in-house observability tool running on k3s and PostgreSQL, ensuring high availability and scalability.
  • Automated deployment and configuration of monitoring components across multiple environments.
  • Collaborated with cross-functional teams to define SLOs and improve incident response times.
  • Designed and implemented an Agent AI-powered chatbot within CPDB to translate natural language queries into actionable metrics and dashboards.
  • Migrated ScienceLogic monitoring to Prometheus/Grafana stack.

Consultant

Capgemini
10.2016 - 06.2019

Client: Cisco Systems, Inc.



Responsibilities:

• Installation, maintenance and configuration of ScienceLogic tool and DB.

• Installing ScienceLogic Agents across the environment and onboard the components supposed to be monitored.

• Writing dynamic apps, RBAs.

• Creating and configuring event policies, automation policy and action policy.

• Investigating why alert/ticket wasn’t generated.

• Investigating swap issues, sigterm issues.

• Working on issues with services like mysqld, snmptrapd, trapserver.

• Whitelisting and blacklisting syslogs mnemonics and traps.

• Integrating EM7 with third party ticketing tool like casesentry and eBonding.

• Investigating any issues with the integration of the same.

• Working with DRBD sync issue and split brain issue.

• Working with High frequency and medium frequency issues.

• Maintaining message collectors and data collectors.

• Importing and compiling MIBs.

• Installing powerpacks.

System Engineer

NetApp
05.2015 - 10.2016

• Work closely with different teams including vendor end consultants in the Setup, configuration and management.

• Adding Windows, Linux, Network devices and URLs under monitoring and managing them using SNMP, WMI and WINRM in a distributed architecture.

• Writing notification and triggers.

• Adding users and assigning roles and responsibilities by setting the permissions.

• Installing ZenPacks. Adding MIBs.

• Writing transforms to include/exclude alerts and set the severity for the traps from network devices. Setting up parameters, thresholds, action profiles.

• Adding Collectors and Hubs.

• Integration of Zenoss with service now ticketing tool.

• Troubleshooting issues with monitoring.

• Managing splunk indexers and search heads.

Assistant Engineer

TCS Siruseri
12.2012 - 05.2015

Client: NetApp

IBM Tivoli Monitoring:

• Installing OS and universal agents in newly added Windows, UNIX, Linux servers.

• Configuring the agentless monitoring for Windows, Linux, and UNIX.

• Configuring different monitoring parameters and new requirement in log file and server monitoring.

• Troubleshooting the issues related with agent communication and history data collection.

• Creating Users and customized the workspace based on the client requirement.

IBM Tivoli Netcool Omnibus:

• Customizing the console based on client requirements like FILTERS, VIEWS, etc.

• Monitoring all the alerts from server and network monitoring tools and troubleshooting issues related with automatic mail notification mechanism.

• Working on triggers.

• Creating users and roles as per the requirement in TIP Portal.

IBM Tivoli Network Manager:

• Creating the customized Network topology views.

• Discovering new device links and monitoring the network devices of multiple vendors.

• Configuring alert for the traps generated from devices.

• Troubleshooting issues related network devices monitoring. HP Server automation:

• Creating Device groups and user groups and Assign roles based on the users requirements.

• Installing Opsware agent and Opsware clients.

• Troubleshooting the issue on agent installation and accessing the servers through HPSA client console.

HP Network Automation:

• Creating Device groups and user groups and Assign roles based on the users.

• Adding new devices and map to the groups to access by the network administrators.

• Troubleshooting the issue on accessing the Network Devices through HPNA client console.

Education

Bachelor of Engineering - Computer Science And Engineering

UBDT College of Engineering
Davangere, India
06-2012

Skills

  • Monitoring & Alerting: Prometheus, Grafana, Alertmanager
  • Custom Observability Tools: CPDB (in-house platform)
  • Container Orchestration: Kubernetes (k3s)
  • Database Management: PostgreSQL
  • Scripting & Automation: Ansible, Bash, Python
  • Performance Optimization & Troubleshooting
  • Infrastructure as Code: Helm

Timeline

Senior Engineer

Dell EMC
06.2019 - Current

Consultant

Capgemini
10.2016 - 06.2019

System Engineer

NetApp
05.2015 - 10.2016

Assistant Engineer

TCS Siruseri
12.2012 - 05.2015

Bachelor of Engineering - Computer Science And Engineering

UBDT College of Engineering
Kavana M Senior Engineer