Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic
Shubham Latkar

Shubham Latkar

Nagpur

Summary

Results-driven DataOps and DevOps Engineer with a proven track record at Electronic Arts (EA), specializing in AWS, Terraform, Kubernetes, Ansible, Docker, Jenkins, and Python. Experienced in automating monitoring solutions, optimizing resource management, and enhancing system performance to drive operational efficiency. Skilled in data visualization and infrastructure automation, ensuring scalable and high-performance solutions in dynamic, fast-paced environments. Recognized for strong problem-solving abilities, leadership, and a commitment to continuous improvement in cloud and DevOps operations.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Data OPS Engineer

Electronic Arts (EA)
Hyderabad
07.2023 - Current
  • Deployed Prometheus, Alertmanager, and Grafana in Kubernetes using Helm charts for real-time monitoring.
  • Integrated Grafana with Prometheus, Loki, and AWS EKS to collect and visualize system metrics and logs.
  • Packaged exporters in AWS ECR and deployed them within AWS EKS for efficient monitoring.
  • Migrated and centralized multiple legacy dashboards into Grafana, improving data accessibility.
  • Designed Grafana dashboards to monitor system performance, logs, and traces from Prometheus and Loki.
  • Configured Alertmanager with Slack and email notifications for real-time incident response.
  • Implemented Loki and Promtail agents for Kubernetes log collection and visualization.
  • Set up BlackBox Exporter to monitor endpoint availability and performance across multiple protocols.
  • Enhanced monitoring system performance using auto-scaling policies and post-incident analysis.
  • Used Python and Apache Airflow to schedule and automate compliance audits with stakeholder reporting.
  • Performed SLA Miss audits for Jira tickets and Airflow ETL jobs, implementing failure alerts.
  • Developed critical SLA dashboards to track ETL job performance and adherence to SLAs.
  • Created Airflow DAGs to extract query stats and store them in AWS RDS-MySQL (short-term) and S3 (long-term).
  • Designed dashboards to monitor query execution status, error logs, and performance trends.
  • Built a responsive toolbox in CMS for easy access to monitoring tools like Prometheus, Alertmanager, Grafana, and Runbooks.
  • Deployed, configured, and monitored Apache Airflow DAGs on AWS EMR to ensure efficient execution of ETL workflows while meeting SLAs.
  • Integrated Airflow alerts with email notifications for proactive failure detection, enabling timely resolution and minimizing disruptions.
  • Enforced annual IAM access key rotation policies and implemented automated key rotation mechanisms.
  • Optimized AWS infrastructure costs for EC2, EBS, ELBs, and S3 using AWS Trusted Advisor recommendations.

System Engineer

Infosys
Pune
09.2021 - 07.2023
  • Created and managed IAM users, roles, policies, and groups through AWS Console and Terraform, ensuring least-privilege access control.
  • Automated IAM policy updates and S3 bucket policies using Terraform, enforcing security best practices.
  • Implemented secure credential sharing by utilising OneShot, AWS Secrets Manager and role-based access.
  • Provisioned and managed EC2 instances, Elastic Load Balancers (ELB), and Auto Scaling Groups (ASG) using Terraform for high availability and scalability.
  • Designed and deployed launch templates to standardize EC2 configurations across environments.
  • Executed AMI upgrades from CentOS 7 to Oracle Linux 8 for multiple services running on EC2, ensuring minimal downtime and smooth transition.
  • Created and updated S3 bucket policies, configured lifecycle policies, and automated bucket policy updates through Terraform.
  • Managed DNS CNAME routes and security group rules, ensuring optimal traffic routing and network security.
  • Provisioned and optimized EMR clusters, automated cloning and termination, reducing cloud costs and ensuring efficient resource allocation.
  • Set up and managed AWS CloudWatch alerts to monitor infrastructure health and trigger automated responses.
  • Enforced annual IAM access key rotation policies and implemented automated key rotation mechanisms.
  • Optimized AWS infrastructure costs for EC2, EBS, ELBs, and S3 using AWS Trusted Advisor recommendations.
  • Managed support initiatives on Data & AI projects at EADP for seamless operations.
  • Performed on-call responsibilities with excellence, rapidly addressing issues to ensure peak performance and dependability in data engineering operations.
  • Created Ansible playbooks to automate exporter installations on specified AWS IPs.
  • Automated routine administrative tasks using Shell Scripting to enhance efficiency.
  • Integrated and managed Oozie workflows for automated job scheduling.
  • Monitored cluster health using Nagios, reducing downtime through proactive issue resolution.

Analyst

Hudl India PVT.LTD
Pune
07.2020 - 12.2020
  • Managed AWS EC2 instances, optimizing performance, storage, and networking.
  • Configured Auto Scaling for high availability and cost efficiency.
  • Set up security groups and network ACLs for access control.
  • Implemented Elastic Load Balancers (ELB) for traffic distribution.
  • Managed EBS volumes, snapshots, and backups.
  • Created AMIs and launch templates for Auto Scaling.
  • Designed and deployed VPCs with custom subnets and route tables.
  • Configured Internet Gateways and NAT Gateways for secure access.
  • Deployed Application and Network Load Balancers for reliability.
  • Segmented VPC networks for web, app, and database layers.

Education

Bachelor of Engineering - BE - Information Technology

Jayawantrao Sawant College of Engineering Pune
Pune, Maharashtra, India
06-2019

Skills

  • AWS
  • Terraform
  • Argo CD
  • Git
  • Linux
  • CI/CD
  • Docker
  • Helm Charts
  • Kubernetes
  • Ansible
  • Ariflow
  • Prometheus
  • Grafana
  • Alertmanager
  • SlackSDK
  • PromQL
  • LogQL
  • Loki
  • Promtail
  • KairosDB
  • Hadoop
  • Python

Accomplishments

Certification Of Appreciation - Insta Awards from Infosys, 02/01/23, Recognized with an Appreciation Certificate from Infosys for outstanding project deliverables.

Certification

c,c++ nanodegree certificate PrepInsta

Timeline

Data OPS Engineer

Electronic Arts (EA)
07.2023 - Current

System Engineer

Infosys
09.2021 - 07.2023

Analyst

Hudl India PVT.LTD
07.2020 - 12.2020

Bachelor of Engineering - BE - Information Technology

Jayawantrao Sawant College of Engineering Pune
Shubham Latkar