Summary
Overview
Work History
Education
Skills
Certification
Timeline
background-images

AMANPREET BHATIA

Bengaluru

Summary

Experienced IT professional with 10 years of expertise in Big Data, Cloud Engineering, and Data Pipeline Development. Skilled in processing massive datasets using Hadoop, Spark, Hive, and HBase. Proven success in building scalable data solutions across AWS and Azure with strong proficiency in Python, SQL, CI/CD automation, microservices, and performance optimization.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Model Monitoring

Experian
03.2022 - Current
  • Designed and implemented a system for model drift detection
  • Built data exhaust pipelines in Python and automated calculations using Airflow
  • Created APIs for drift retrieval and deployed microservices on AWS ECS via Jenkins CI/CD
  • Managed data validation using PySpark and stored results in Elasticsearch
  • Technologies: PySpark, AWS (S3, EMR, ECS, ECR, CloudWatch), Airflow, Flask API, Kubernetes

Depletions

Diageo
06.2019 - 06.2021
  • Built global data lake for distributor analytics using Spark and Azure Data Factory
  • Standardized multi-format data using Spark
  • Designed pipelines with ADF and Dremio; implemented CI/CD for Databricks
  • Technologies: Spark, Python, Azure (ADF, ADLS, Blob), Dremio, Databricks

Experian BIS SALT

Tavant Technologies
01.2017 - 04.2019
  • Integrated SBFE data into Experian’s Big Data ecosystem
  • Developed Spark/Scala pipelines for validation and ingestion
  • Technologies: Spark, Scala, HBase, Hive

Scanview & AMD-CodeXL

AMD India Pvt. Ltd (via Magna Infotech)
07.2014 - 01.2017
  • Developed Spark batch processing systems for GPU performance analysis
  • Built backend data pipelines and conducted GPU testing
  • Technologies: Spark, Python, OpenCL, OpenGL

Education

B.E. - Computer Science

RGPV
Bhopal

Sr. Secondary - undefined

MP Board
01.2009

High School - undefined

MP Board
01.2007

Skills

  • Programming: Python, Scala
  • Big Data: Spark, Hadoop, Hive, HBase, MapReduce, Databricks, Dremio, NiFi
  • Cloud Platforms: AWS (S3, EMR, ECS, ECR, CloudWatch, SageMaker, Kubernetes, Lambda), Azure (ADF, ADLS, Blob)
  • DevOps & CI/CD: Jenkins, Azure DevOps, Terraform, ARM Templates
  • Messaging & APIs: Kafka, FastAPI
  • Databases: MySQL, HBase
  • Monitoring & Observability: Grafana, Prometheus
  • Tools: Maven, Jira
  • Operating Systems: Linux, Unix, Windows
  • IDEs: Eclipse, IntelliJ, PyCharm

Certification

Databricks Developer Certification for Apache Spark

Timeline

Model Monitoring

Experian
03.2022 - Current

Depletions

Diageo
06.2019 - 06.2021

Experian BIS SALT

Tavant Technologies
01.2017 - 04.2019

Scanview & AMD-CodeXL

AMD India Pvt. Ltd (via Magna Infotech)
07.2014 - 01.2017

Sr. Secondary - undefined

MP Board

High School - undefined

MP Board

B.E. - Computer Science

RGPV
AMANPREET BHATIA