Summary
Overview
Work History
Education
Skills
Projects
Work Availability
Websites
Software
Timeline
AdministrativeAssistant
Arghadip Paul

Arghadip Paul

Data Analyst
Kolkata

Summary

Data analyst committed to driving results, equipped with practical data engineering skills, specializing in data modeling, ETL development, SQL, and creating dependable data pipelines to support advanced analytics and business intelligence.

Overview

2
2
years of professional experience

Work History

Data Analyst

MOL-IT
09.2023 - 01.2026
  • Designed and developed scalable ETL/ELT pipelines using PySpark and Apache Spark, processing large-scale structured and semi-structured datasets.
  • Built optimized Spark Data Frame and Spark SQL transformations, improving query performance through partitioning, caching, broadcast joins, and execution plan analysis.
  • Hands-on experience with Azure Data Lake Storage (ADLS Gen2) for scalable storage, Azure Synapse Analytics for data warehousing and analytics workloads, Azure Data Factory for pipeline orchestration, and Azure Purview (Microsoft Purview) for data cataloguing and governance.
  • Architected and implemented Lakehouse solutions using Delta Lake, enabling ACID transactions, schema evolution, and time-travel capabilities.
  • Orchestrated data pipelines and batch workflows ensuring reliability, monitoring, and fault tolerance.
  • Implemented logging, monitoring, and observability practices using metrics-based tracking to proactively detect and resolve service disruptions.
  • Diagnosed and resolved performance bottlenecks across distributed Spark environments(RDDs) through query plan analysis and cluster resource monitoring.
  • Designed data governance frameworks including lineage tracking, schema validation, and cost optimization strategies.
  • Developed unit and functional test cases for data pipelines ensuring data quality and consistency across environments.
  • Created operational documentation, runbooks, and UAT support documentation for smooth production deployments.
  • Collaborated with cross-functional teams to translate business requirements into scalable data models (Fact & Dimension tables) following star schema design principles.

Internship

VARUN BEVERAGES Ltd.
07.2022 - 08.2022
  • Monitored beverage production processes handling 50,000+ bottles per shift, ensuring compliance with FSSAI, GMP, and HACCP standards.
  • Assisted in optimizing blending and filling line operations, contributing to a 5% reduction in minor process deviations.
  • Supported CIP (Clean-in-Place) procedures, helping reduce contamination risk incidents by 10% during the internship period.
  • Pepsi CO. Manufacturer & Distributor

Education

West Bengal Board of Secondary Education (WBBSE)
West Bengal, India
08-2017

West Bengal Council of Higher Secondary Education (WBCHSE)
West Bengal, India
08-2019

B.E. - Food Technology & Biochemical Engineering

Jadavpur University
West Bengal, India
08-2023

Skills

ETL and ELT pipeline development

Lakehouse architecture

Delta Lake

Data modeling (fact and dimension tables)

PySpark integration

Data governance

Cloud data architecture

Python programming

SQL Server Management Studio (SSMS)

Spark SQL

Azure Databricks

Microsoft Purview

Azure Data Factory

Azure Data Lake Storage (ADLS Gen2)

Databricks

Azure log analytics

Projects

Home-Asphere - Airbnb-like Rental Marketplace

A production-ready full-stack rental marketplace web application built with modern technologies.

  • https://github.com/Divine89/Hope-Asphere.git

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Software

Databricks

Azure(ADF)

SSMS

Power BI

DevOps

Git, Github

Timeline

Data Analyst

MOL-IT
09.2023 - 01.2026

Internship

VARUN BEVERAGES Ltd.
07.2022 - 08.2022

West Bengal Board of Secondary Education (WBBSE)

West Bengal Council of Higher Secondary Education (WBCHSE)

B.E. - Food Technology & Biochemical Engineering

Jadavpur University
Arghadip PaulData Analyst