Summary
Overview
Work History
Education
Skills
Websites
Certification
Timeline
Generic

Arun K

Bengaluru

Summary

Experienced Site Reliability Engineering (SRE) Lead with over 9 years of expertise in building and maintaining highly available, scalable, and secure systems across cloud and hybrid environments. Skilled in applying SRE principles, doing root cause analysis, driving automation, and enhancing observability using tools like Splunk, Grafana, Dynatrace, and Kibana. Strong background in infrastructure provisioning, incident management, and team leadership. Certified as an AWS Developer Associate, Azure Fundamentals, and Azure AI Fundamentals. Known for fostering operational excellence and aligning technical efforts with business goals.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Packaged App Development Specialist

Accenture
Bangalore
06.2023 - Current
  • Lead SRE initiatives to ensure the high availability, scalability, and performance of distributed production systems.
  • Continuously monitor system health and implement proactive strategies to detect and resolve issues before they affect end users.
  • Collaborate closely with development and infrastructure teams to enhance system reliability and reduce technical debt.
  • Design and maintain observability frameworks for effective monitoring, alerting, and root cause analysis.
  • Drive automation efforts to eliminate operational toil and improve deployment and recovery processes.
  • Ensure system scalability and provision infrastructure resources on AWS by managing EC2 instances to support variable workloads.
  • Mentor and develop team members to strengthen technical capabilities and promote a culture of reliability and continuous improvement.
  • Oversee ITIL-aligned processes for incident, problem, and change management to ensure operational stability and compliance.

Technical Lead

Wipro Technologies
Chennai
06.2015 - 06.2023
  • Led reliability engineering initiatives to ensure the scalability, availability, and performance of distributed systems in production environments.
  • Enabled early issue detection and minimized downtime through proactive monitoring, alerting, and incident response strategies.
  • Partnered with development and infrastructure teams to enhance system resilience and address tech debts.
  • Designed and implemented observability frameworks using tools like Splunk, Grafana, Dynatrace, and Kibana to drive data-informed decisions.
  • Drove automation of repetitive operational tasks using modern SRE tools and practices, significantly reducing toil and improving efficiency.
  • Mentored and guided team members to foster a culture of reliability engineering, continuous learning, and operational excellence.

Education

MASTER OF TECHNOLOGY - SOFTWARE ENGINEERING

Birla Institute of Technology & Science
11.2019

BACHELOR OF COMPUTER APPLICATION -

Bharathiar University
03.2015

Skills

  • Java Microservices
  • Python
  • Python Flask
  • Reactjs
  • Nodejs
  • Amazon Web Services
  • Splunk
  • Dynatrace
  • Grafana
  • Kibana
  • SQL
  • Jenkins
  • Git
  • Github Actions
  • Openshift
  • Shell Scripting
  • IBM Tealeaf
  • Catchpoint

Certification

  • Microsoft Azure AI Fundamentals
  • Microsoft Azure Fundamentals
  • AWS Certified Developer Associate

Timeline

Packaged App Development Specialist

Accenture
06.2023 - Current

Technical Lead

Wipro Technologies
06.2015 - 06.2023

MASTER OF TECHNOLOGY - SOFTWARE ENGINEERING

Birla Institute of Technology & Science

BACHELOR OF COMPUTER APPLICATION -

Bharathiar University
Arun K