Summary
Overview
Work History
Education
Skills
Projects
Accomplishments
Timeline
Hi, I’m

Shashank Anupam

Bangalore
Shashank Anupam

Summary

Driven Data Engineer with 8 years of expertise in Hadoop, Spark, Scala, and reporting tools within the Data analytics domain. Proficient in architecting, developing, and optimizing data pipelines for actionable insights. Seeking a challenging role where I can apply my proven skills to propel data-driven decision-making and make significant contributions to innovative projects.

Overview

8
years of professional experience

Work History

Walmart

Senior Data Engineer
06.2021 - Current

Societe Generale

Senior Software Engineer
09.2020 - 06.2021

Tech Mahindra Ltd.

Software Engineer
01.2016 - 08.2020

Education

B.P.U.T

B.Tech

University Overview

GPA: 8.0

Skills

  • ETL development
  • Big Data Processing
  • Data Pipeline Design
  • Data Modeling
  • GCP
  • Big Query
  • Scala
  • Spark
  • SQL
  • Hive
  • Hudi
  • Airflow
  • Kafka
  • Tableau
  • PySpark
  • Talend
  • CI/CD
  • Unix
  • Git Version Control
  • Performance Tuning
  • Advanced SQL

Projects

IMS(Inventory Management System) :

  • Designed, created, and maintained data pipelines for Inventory data Management.
  • Consolidated Inventory reporting table to create a consolidated data view and reduce data redundancy. It saved around 60% of cloud costs on a daily basis to run the pipeline.
  • Created Inventory pipeline for promotional items and shelf management which generates yearly GMV of $300M.
  • Created Data validation and Data quality framework to ensure data completeness and correctness.

LIMO - Item Path Eligiblity :

  • Designed, created, and maintained NRT data pipelines to load BQ and Hudi tables.
  • Handled Upsert data volumes in TBs with the final table having 2 Trillion records and approx 120 TB storage volume. Business value generated by this project is $150M.

Cloud cost optimisation :

  • Spark tuning of jobs to reduce cluster size and job run time.
  • Created compression utility to store raw data in datalake compressed format. Enabled around 70% raw data compression and 60% storage cost saving.
  • Worked on data asset consolidation to reduce data redundancy in the system. Enabled autoscaling in cluster creation to optimize resource memory allocation in the data pipelines.

Accomplishments

  • Received Badgify Badges - Customer champion, Collaborator, Impact driver, Fire fighter.
  • Receives Bravo Award for managing Inventory Mart.

Timeline

Senior Data Engineer

Walmart
06.2021 - Current

Senior Software Engineer

Societe Generale
09.2020 - 06.2021

Software Engineer

Tech Mahindra Ltd.
01.2016 - 08.2020

B.P.U.T

B.Tech
Shashank Anupam