Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic
SHANTANU  WADHANKAR

SHANTANU WADHANKAR

MUMBAI

Summary

Experienced software engineer adept at data migration, specializing in SAS to Databricks conversion using PySpark. Skilled in optimizing SQL queries for performance and leveraging cloud platforms like AWS and AZURE. a proven track record in combating fraud within healthcare insurance, implementing robust data solutions to enhance operational efficiency and ensure financial integrity.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Software Engineer

Axion Connect
Mumbai
12.2023 - Current

Project :

SAS to Databricks Migration (Finance)

  • Designed and implemented an efficient ETL process to migrate financial data from Oracle to Azure Data Lake Storage Gen2 (ADLS gen-2) using Databricks and Azure Data Factory.
  • Translated SAS jobs into PySpark to replicate existing functionality and enhance performance using optimization techniques.
  • Developed ETL pipelines using Databricks and Azure Data Factory to extract data from Oracle databases to ADLS gen-2.
  • Translated SAS job logic into PySpark to maintain and improve data processing workflows.
  • Implemented optimization techniques to enhance job performance and efficiency.
  • Utilized spark.sql and SQL to convert SAS proc sql statements to PySpark equivalents for seamless migration and improved performance.

Associate Software Engineer

GSSI
Hyderabad
07.2021 - 12.2023

Combatting Fraud in Healthcare Insurance:

  • Addressed the pervasive issue of fraudulent claims in healthcare insurance, which accounts for a significant portion of payments worldwide, posing a multi-billion-dollar challenge. Implemented solutions to enhance fraud detection and prevention.
  • Roles & Responsibilities:
  • Migrated data from RDBMS sources to AWS and Snowflake using PySpark.
  • Developed PySpark jobs for data extraction into S3 and transformation/ingestion into Snowflake.
  • Optimized SQL scripts using Spark SQL for improved performance.
  • Created SQL scripts for CDC data on transactional tables.
  • Wrote Snowflake DDL scripts and queries for data analysis.
  • Conducted unit testing for the developed tables.
  • Implemented SCD type 1 & 2 logic and joins in PySpark scripts.

Education

PGDGI -

Swami Ramanand Teerth Marathwada University
Nanded
05-2021

Master of Science -

Babasaheb Ambedkar Marathwada University
Aurangabad
04-2019

Bachelor of Science -

Shivaji University, Kolhapur
Kolhapur
03-2017

Skills

  • Relational Database: MySQL, Oracle
  • SQL, sparksql, proc sql
  • Big Data Ecosystem: HDFS, Hive, Map-Reduce, Apache Spark,and Pyspark
  • Python,
  • AZURE Services: ADF, Azure Databricks, Data Lake Storage, andAzure SQL Database
  • AWS Services: S3, EMR, RDS, GLUE, LAMDA, EC2, REDSHIFT
  • Operating System: Linux (ubuntu), Windows

Certification

  • Fundamentals of the Databricks Lakehouse Platform Accreditation (V2)

Languages

English
First Language
Hindi
Proficient (C2)
C2
Marathi
Proficient (C2)
C2

Timeline

Software Engineer

Axion Connect
12.2023 - Current

Associate Software Engineer

GSSI
07.2021 - 12.2023

PGDGI -

Swami Ramanand Teerth Marathwada University

Master of Science -

Babasaheb Ambedkar Marathwada University

Bachelor of Science -

Shivaji University, Kolhapur
SHANTANU WADHANKAR