Summary
Overview
Work History
Education
Skills
Current Project: OPTUM
Project: American National Insurance Company
Project: Intelligent Shipping Route
Certification
Hobbies and Interests
Disclaimer
Languages
Timeline
Generic
Abhishek Balasaheb Mali

Abhishek Balasaheb Mali

Pune

Summary

Accomplished Data Engineer with a proven track record at OPTUM, specializing in transforming raw data into actionable insights. Proficient in SQL and Azure Data Factory, I excel in designing efficient ETL processes and optimizing data pipelines. My strong analytical skills drive impactful business decisions through data visualization and governance.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Optum
Pune
02.2025 - Current

Validated each FHIR entity using Scala validation functions and internal tools like Smith.

Migrated ETL processes from manual execution in Databricks to a production-grade approach.

  • Developed ETL code in IntelliJ, packaging it as a JAR for orchestration with Azure Data Factory.
  • Stored output as Delta tables in Silver layer for downstream consumption by Gold team.
  • Collaborated with DAP team to ingest raw healthcare data from source systems into Bronze layer.
  • Supported Gold team in mapping data to FHIR R4 entities for standardized healthcare reports.
  • Ensured accurate data transformation through rigorous validation and testing procedures.

Data Engineer

IBM
Pune
07.2022 - 01.2025
  • Data Engineer with over 3.3 years of experience transforming raw data into actionable insights and scalable data solutions.
  • Adept at designing and optimizing data pipelines, performing complex data analysis, and building dashboards that drive business decisions.
  • Proven track record in orchestrating ETL processes, implementing machine learning pipelines, and managing large-scale data architecture in cloud environments.
  • Strong expertise in SQL, Python, and big data technologies, with hands-on experience across various industries including finance, healthcare, and technology.
  • Core Competencies: Data Modeling and Pipeline Architecture, Snowflake Data Engineer, ETL/ELT Development & Automation, Data Wrangling & Preprocessing, Advanced SQL & Python (Pandas, PySpark), Data Warehousing (Redshift, Snowflake, BigQuery), Business Intelligence (Power BI, Tableau), Cloud Platforms (AWS, Azure), Machine Learning Pipeline Integration, Data Governance & Compliance (GDPR, SOC2), Data Visualization & Reporting.
  • Designed data pipelines for efficient data processing and integration.
  • Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.

Education

B.Tech - Civil Engineering

Kolhapur Institute Of Technology Engineering College
Kolhapur, India
01.2022

Skills

  • Azure Databricks
  • Azure Data Factory
  • Pyspark
  • SCALA
  • Python
  • Snowflake
  • Teradata
  • SQL
  • ETL
  • AWS
  • Redshift
  • Glue
  • BigData
  • Data Modeling
  • Data Migration
  • GIT
  • UNIX
  • PowerBI
  • Pandas
  • Data wrangling
  • Preprocessing
  • Data Warehousing
  • Business Intelligence
  • Cloud Platforms
  • Machine Learning
  • Data Governance
  • Compliance
  • Data Visualization
  • Reporting
  • Matplotlib
  • Numpy
  • Scikit Learn
  • Seaborn
  • NoSQL
  • MongoDB
  • PostgreSQL
  • DevOps
  • Agile
  • V-Model
  • Logistics Regression
  • Linear Regression
  • K-means
  • Hadoop
  • Cloud Computing
  • Apache Spark
  • Java
  • SQL Developer
  • MySQL
  • Data Mining
  • Data Analysis
  • Continuous Integration
  • Jenkins
  • REST
  • JSON
  • Django Rest Framework
  • PyCharm
  • Notepad
  • S3
  • Lambda
  • RDS
  • Step Function
  • Docker
  • Scrum

Current Project: OPTUM

Project Summary: Key Responsibilities & Workflow: Impact:

Feb 2025 – Present

Project: Fetch & Catch UHG (UnitedHealth Group)
Role: Data Engineer

Domain: Healthcare (FHIR / Medallion Architecture)
Technology Stack: Scala, PySpark, Azure Databricks, ADF, Storage Account, Git, IntelliJ, Delta Lake, Smith Tool

The project focuses on transforming raw healthcare JSON data into FHIR-compliant structured datasets using the Medallion architecture (Bronze → Silver → Gold). The DT team is responsible for validating, processing, and packaging reusable ETL pipelines for automation and downstream consumption.

  • Validations performed using Scala functions and Smith tool (e.g., id, meta, resourceType).
  • PySpark/Scala-based ETL converts JSON to Delta tables.
  • Code developed in IntelliJ and converted into JAR files.
  • JARs executed via Azure Data Factory pipelines for scheduled runs.

• Automated and standardized data transformation
• Ensured FHIR-aligned structured outputs
• Minimized manual runs via JAR packaging and ADF pipelines
• Enabled reliable downstream analytics and entity mapping

Project: American National Insurance Company

  • Role : Data Engineer
  • Domain: Insurance
  • January 2023 – January 2025
    Worked on automating ingestion, validation, cleaning, and matching of monthly client insurance data using Snowflake, AWS (S3, Glue, Athena), and Python (PySpark, Pandas), delivering high-quality, analytics-ready datasets with reduced manual effort.

Project: Intelligent Shipping Route

  • Client: Ocean Blue Logistics Solutions
  • Role: Data Engineer
  • Domain: Shipping & Logistics
  • July 2022 – December 2022
    Worked on data migration, ML-based automation pipelines, and ETL workflows using Python, AWS (Lambda, Glue, S3), Azure Data Factory, and Logic Apps to deliver optimized, data-driven solutions for logistics operations.

Certification

  • Fundamentals of AI & GenAI
  • Introduction to GenAI on Azure

Hobbies and Interests

  • Reading
  • Traveling
  • Bike riding

Disclaimer

I hereby declare that the above-mentioned information is correct to the best of my knowledge.

Languages

Marathi
First Language
English
Intermediate (B1)
B1
Hindi
Advanced (C1)
C1

Timeline

Data Engineer

Optum
02.2025 - Current

Data Engineer

IBM
07.2022 - 01.2025

B.Tech - Civil Engineering

Kolhapur Institute Of Technology Engineering College
Abhishek Balasaheb Mali