Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
AJITH KUMAR A

AJITH KUMAR A

Bengaluru

Summary

Results-driven IT Solutions Manager with over a decade of experience in IT operations, automation, and technical support for mission-critical systems. Proven expertise in designing and implementing anomaly detection tools and automation solutions using Python, Prometheus, ELK stack, and Grafana, significantly enhancing operational efficiency and proactive issue resolution. Adept at managing Remote Infrastructure Management (RIM) and maintaining 99.9% uptime across large-scale client environments.

Skilled in IT management, with a strong background in leading cross-functional teams, including L4 and senior engineers, to ensure seamless operations and achieve business objectives. Proficient in cloud architecture (AWS Certified Solutions Architect), database management (MySQL, MariaDB), and monitoring tools, delivering exceptional results in system observability and performance optimization.

Proven ability to handle critical escalations, coordinate with senior leadership and clients, and drive collaboration between product and business teams. Experienced in infrastructure technologies such as VMware, Apache, Nginx, and Linux system administration, with hands-on expertise in Ansible playbooks and DevOps practices. Strong communicator and strategist, consistently driving innovation, reducing downtime, and delivering measurable business value.

Overview

19
19
years of professional experience
1
1
Certification

Work History

Manager - Solutions

OnMobile Global
06.2021 - Current
  • Developed an anomaly detection tool leveraging ELK stack (ElasticSearch, Logstash, Kibana), Python, and Prometheus to analyze historical data and generate alarms and notifications.
  • Successfully implemented the anomaly detection tool across multiple in-house products, enhancing operational efficiency and proactive issue resolution.
  • Enabled Digital Marketing teams to optimize promotion scheduling by leveraging insights provided by the anomaly detection tool.
  • Supported Operations teams in identifying and addressing issues before application downtime, ensuring business continuity.
  • Designed and implemented various automation tools using Python, Prometheus, and Grafana to streamline monitoring and reporting processes.
  • Hands-on experience in ElasticSearch, Logstash, Kibana (ELK stack) for log aggregation, analysis, and visualization to improve system observability.
  • Proficient in creating Ansible playbooks for automated configuration management and infrastructure orchestration.
  • Achieved AWS Certified Solutions Architect certification, showcasing expertise in cloud architecture and deployment strategies.
  • Collaborated with cross-functional teams to drive automation initiatives, reducing manual effort and improving system reliability.

Manager - Product Support

OnMobile Global
07.2014 - 07.2021
  • Managed and supported all in-house products, technical applications, and systems, ensuring seamless operations and issue resolution.
  • Led and supervised the L4 support team, acting as the final escalation point for critical issues reported by L1, L2, and L3 operations teams.
  • Proficient in Linux systems administration with expertise in MySQL and MariaDB administration for database management and troubleshooting.
  • Demonstrated in-depth knowledge of load balancer technologies such as Apache and Nginx, optimizing performance and reliability for business-critical applications.
  • Acted as the primary point of contact for senior management and client teams during critical escalations, ensuring timely resolution and communication.
  • Conducted weekly issue analysis meetings with the Product and Business teams to review recurring problems and implement plans to prevent future occurrences.
  • Designed and delivered knowledge-sharing sessions for team members and cross-functional teams, fostering skill development and operational efficiency.
  • Established and maintained strong collaboration with product teams to align technical support strategies with business objectives.
  • Enhanced incident management processes by implementing proactive monitoring solutions and refining escalation workflows.

Lead - Operations

OnMobile Global
07.2008 - 07.2014
  • Managed and maintained over 500 client servers, ensuring 99.9% uptime for critical applications and systems.
  • Provided technical application support to ensure high availability and reliability of business-critical tools and services.
  • Coordinated with cross-functional teams to implement, test, and deploy applications and tools, ensuring seamless integration and functionality.
  • Supervised and mentored a team of engineers and senior engineers, fostering a culture of collaboration and continuous improvement.
  • Designed and enforced operational workflows to enhance system performance, monitoring, and incident management.
  • Conducted performance evaluations and provided hands-on training to team members, ensuring alignment with operational goals.
  • Established clear communication channels with stakeholders to facilitate the timely resolution of technical and operational challenges.
  • Played a pivotal role in identifying and addressing potential risks, ensuring minimal downtime during application rollouts or server maintenance.
  • Utilized data-driven approaches to analyze system performance and implemented measures to improve efficiency and reliability.

Senior Engineer

Kryptos Networks
02.2006 - 07.2008
  • Managed Remote Infrastructure Management (RIM) for critical client environments, ensuring seamless operations and reliability.
  • Maintained and monitored client servers, consistently achieving 99.9% uptime for business-critical systems.
  • Specialized in VMware technologies, performing virtual server provisioning, configuration, and maintenance.
  • Worked extensively with Concurrent Versions System (CVS) for version control and configuration management of client applications.
  • Conducted performance tuning, troubleshooting, and issue resolution for virtualized and physical server infrastructures.
  • Collaborated with cross-functional teams to support infrastructure upgrades and resolve complex technical issues.
  • Played key role in maintaining server health and optimizing resource utilization to support business objectives.

Education

MCA - Computer Applications

Bharathiyar University
Coimbatore, India

Bachelor of Science - Physics

Bharathiyar University
Coimbatore

Skills

  • IT Management: Expertise in managing IT operations, teams, and technical escalations for mission-critical systems
  • Anomaly Detection Tools: Proficient in developing and implementing anomaly detection solutions using ELK stack, Python, and Prometheus
  • Monitoring Tools: Skilled in Prometheus, Grafana, Nagios, and other monitoring tools to ensure system uptime and performance optimization
  • Cloud and Virtualization Technologies: Hands-on experience with AWS (Certified Solutions Architect), VMware, and virtualized environments In-depth knowledge of Linux and Windows Server Administration
  • DevOps and Automation: Skilled in creating Ansible playbooks, implementing automated workflows, and using CI/CD pipelines for seamless deployments

Certification

AWS Solutions Architect

Timeline

Manager - Solutions

OnMobile Global
06.2021 - Current

Manager - Product Support

OnMobile Global
07.2014 - 07.2021

Lead - Operations

OnMobile Global
07.2008 - 07.2014

Senior Engineer

Kryptos Networks
02.2006 - 07.2008

AWS Solutions Architect

MCA - Computer Applications

Bharathiyar University

Bachelor of Science - Physics

Bharathiyar University
AJITH KUMAR A