Sayak Mukhuty

Senior Consultant In Deloitte

Kolkata

Summary

Dynamic Senior Consultant with extensive experience at Deloitte, specializing in large-scale data migrations to Azure Databricks. Proven expertise in Spark Scala and Delta Lake, enhancing security and governance. Adept at leading teams and driving project success, while fostering collaboration and innovation in fast-paced environments.

Overview

years of professional experience

Work History

Senior Consultant

Deloitte

07.2023 - Current

Led two large-scale migrations for a financial services client from Hadoop to Azure Databricks, migrating twelve Spark Scala applications with ADLS Gen2 integration and Delta Lake implementation.
Drove Unity Catalog migration for non‑UC applications, enhancing security, governance, and access controls; successfully migrated all 12 applications, and supported new business use cases.
Acted as Lead Developer, managing offshore delivery with teams of up to four developers and two testers, resolving technical, access, and domain challenges while ensuring delivery quality.
Designed migration architecture, collaborated closely with clients for process approval, provided regular status updates, and performed hands-on troubleshooting to stabilize and optimize applications.
Supported data validation and reconciliation, partnering with QA teams on data comparison between Hadoop and Databricks environments.
I participated as a technical interviewer for Databricks and Azure Data Engineer roles, conducting approximately 50 project-specific interviews, including weekend panels.
Contributed at the firm level by supporting resource planning and deployment of approximately 70 professionals across multiple Deloitte engagements.
I worked with leadership on rate cards, IFA inputs, and helped track timesheets and revenue leakage in coordination with finance teams.

Senior Data Engineer

Hexaware Technologies

03.2021 - 07.2023

Developed Spark Scala batch processing jobs to load and transform large-scale transaction data using Spark Core APIs and Spark SQL, implementing complex business transformations.
Scheduled and monitored batch workloads on an on-premise CDH cluster, ensuring reliable and timely data processing.
Improved batch load performance by applying Spark best practices, including data partitioning, broadcast joins, and key-salting techniques to mitigate data skewness, and optimize execution time.
Performed advanced analytics using Spark, Scala, and loaded curated datasets into Hive tables, enabling the BI team to generate monthly reports, dashboards, and analytical charts.
Worked on an Azure Databricks Proof of Concept (POC) involving: creation and configuration of Azure Data Lake Storage (ADLS) Gen2, setup of Service Principal and Secret Scopes to securely mount DBFS to ADLS Gen2, integration with Azure Key Vault for credential management, creation and management of Delta Lake tables, development and execution of Spark Scala jobs and transformations in Azure Databricks, and orchestration of Spark jobs using Azure Data Factory.

Senior Data Engineer

LTM(Mindtree)

06.2018 - 03.2021

Developed multiple MapReduce jobs to ingest flat files into NoSQL Cassandra, enabling a comprehensive Customer 360 view, and supporting advanced analytics use cases.
Led the migration of MapReduce workloads from on-premise clusters to the AWS cloud, leveraging Amazon S3, EC2, AWS Console, Athena, and Airflow to orchestrate and execute cloud-based data processing pipelines.
Contributed to the customer activation platform migration from on-premise infrastructure to AWS, transitioning Hive table storage from HDFS to Amazon S3. Automated execution of HQL scripts using Airflow on EC2, and performed data validation using Amazon Athena.
Developed a Spark Structured Streaming application to encrypt customer PII data in near real time, consuming data from Kafka topics, and ensuring compliance with data security and privacy requirements.

Education

B.Tech - Computer Science

Kalinga Institute of Industrial Technology

Bhubaneswar

04.2001 -

Skills

Scheduling Tools: Oozie, AutoSys

Database: SQL ,PostgreSQL, NOSQL-Cassandra

Domain Worked: Healthcare , Hospitality(Hotel chains), Financial Services

Big data: Hadoop, Map reduce, HDFS, Spark, Yarn ,Hive, Spark streaming ,Impala

Tools : Databricks , Jenkins , terraform and Annsible basics , Delta lake , Delta tables, CI/CD

Cloud(Basic) : Aws s3 , Aws Athena , Azure Proficiency

Others : Scala, Java, Kafka basics, Agile methodology, Python basics

Timeline

Senior Consultant

Deloitte

07.2023 - Current

Senior Data Engineer

Hexaware Technologies

03.2021 - 07.2023

Senior Data Engineer

LTM(Mindtree)

06.2018 - 03.2021

B.Tech - Computer Science

Kalinga Institute of Industrial Technology

04.2001 -