Summary
Overview
Work History
Education
Skills
Certification
Interests
Languages
Timeline
Generic
SAMIT RANJAN JENA

SAMIT RANJAN JENA

Bhubaneswar

Summary

Experienced Senior Data Engineer with over 10 years in IT, specializing in BI and data warehouse products. Possesses extensive technology skills, expertise in Azure Cloud and Microsoft Business Intelligence (MSBI). Proven ability to design end-to-end solutions utilizing Azure Data Factory (ADF), Data bricks, Azure SQL DW, Spark, Python, Scala, SQL, and Power BI. Proficient in data modelling, implementing ETL pipelines, optimizing Spark jobs, and developing insightful reports and dashboards using Power BI and DAX. Skilled in developing cost-effective solutions and effectively communicating with stakeholders.

Overview

12
12
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

FARFETCH
Bengaluru, KA
02.2022 - Current
  • Led end-to-end data integration solutions using Azure Data Factory (ADF) and Azure Databricks to orchestrate data movement and transform from various sources, including Azure Blob Storage, Azure SQL Database, Azure Data Lake Storage, and on-premises systems
  • Implemented error handling and logging mechanisms within Azure Data Factory, resulting in a substantial reduction in data processing errors and an overall improvement in data reliability
  • Developed Azure Databricks notebooks to efficiently handle large volumes of data and execute complex data transformations, enhancing the overall data processing capabilities
  • Implemented optimizations in Azure Databricks, including partitioning, broadcast joins, caching strategies, dynamic allocation, and Spark cluster configuration adjustments, leading to a remarkable 30% improvement in Spark job performance
  • Conducted in-depth analysis of Directed Acyclic Graph (DAG) execution plans, identified and addressed bottlenecks in Spark SQL queries, significantly improving the efficiency of Spark job execution and reducing unnecessary data shuffling
  • Collaborated with stakeholders and product owners to analyze requirements and design appropriate solutions, following the Agile/Scrum methodology to ensure efficient project delivery
  • Key Accomplishments
  • Successfully implemented multiple pipelines to process influencer data from 3rd Party API
  • Contributed to business insights by providing valuable information on marketing investments and ROI
  • Optimized and established separate Azure Data Factory pipelines for different stakeholders, reducing dependencies and improving ETL efficiency by an impressive 35%.

Senior Data Engineer

MAERSK
Bengaluru, KA
06.2018 - 02.2022
  • Designed and implemented efficient Extract, Transform, Load (ETL) pipelines to read and process data using Azure Data Factory (ADF) and Azure Databricks, ensuring timely execution and delivery of critical data
  • Integrated Azure Databricks with Azure Data Factory, optimizing data processing workflows and improving overall pipeline efficiency by 30%, implemented dynamic parameterization in ADF pipelines
  • Successfully migrated legacy data workflows to Azure Data Factory, converting SQL code to Databricks notebooks
  • Experience with Azure Key Vaults to mask sensitive data
  • Expertise in optimizing Spark Jobs by analyzing Directed Acyclic Graph (DAG) execution plans, applied strategic optimizations techniques to the DAG structure, resulting significant enhancement of Spark job execution efficiency
  • Developed SSAS Tabular Model, wrote DAX queries
  • Automated SSAS Model Role mechanism to add/remove users
  • Key Accomplishments
  • Migrated Excel-based forecasting reports into Power BI and integrated PowerApps to automate forecasting data preparation process and enable the write-back feature for customers, saving expenses by $40K annually
  • Successfully migrated several on-premise ETL processes to ADF, improving overall data quality and ETL efficiency by 40%.

Senior Software Developer

ACCIONLABS INDIA
Bengaluru, KA
11.2016 - 06.2018
  • Created complex SSAS cubes with multiple fact measure groups, and multiple dimension hierarchies and implemented Time Intelligence functions in SSAS cubes
  • Wrote MDX and DAX queries, implementing dynamic cube role security mechanism and cell level security in cubes using MDX expressions to introduce user restrictions
  • Performed the ETL from databases and flat files sources by using SSIS Packages and implemented custom logging in SSIS
  • Contributed to building Data Marts and multi-dimensional models like Star Schema and Snowflake schema
  • Wrote T-SQL scripts, dynamic SQL, complex stored procedures, functions, and triggers; scheduled and maintained SSIS packages on a daily, weekly and monthly basis using SQL Server Agent in SSMS
  • Created multiple Power BI reports, developed custom calculations using DAX in Power BI, utilized Power BI query editor and data modelling features
  • Implemented row-level-security (RLS) in Power BI
  • Key Accomplishments
  • Developed a master SSAS cube from which all other similar cubes could be created and deployed, it helped to reduce the maintenance work
  • Automated the code deployment process in production environment which optimized code deployment process

Senior Software Engineer

IMS HEALTH
Bengaluru, KA
01.2012 - 10.2016
  • Formulated and documented detailed business rules and guidelines
  • Created and maintained 40+ MDX cubes, including complex SSAS cubes with multiple fact measures groups and dimension
  • Worked on complex MDX query and created SSIS packages for reading and processing daily files
  • Provided application support to improve quality and troubleshoot business issues in a timely manner
  • Key Accomplishments
  • Automated jobs using SSIS and MDX to create multidimensional cube comparison report between current and previous version of cubes, which optimized cube deployment process in production system.

Education

Bachelor of Technology - Electronics and Telecommunication Engineering

BIJU PATNAIK UNIVERSITY OF TECHNOLOGY (BPUT)
Bhubaneswar
05.2010

Skills

  • Technical skills:
  • Azure Databricks, Azure Data Factory (ADF), Azure Data Lake Store (ADLS), Azure Blob Storage, Azure SQL DW, Azure SQL DB, Logic App, Spark, Python and PySpark, Scala, T-SQL, Data Modelling, Star Schema and Snowflake Schema, Microsoft Azure, MSBI, SSIS, SSAS, DAX, MDX, Power BI

Certification

08/2020 DP-200 Implementing an Azure Data Solution Certification Score: 750 11/2019 70-778 Analyzing and Visualizing Data with Microsoft Power BI Certification Score: 845 10/2019 70-761 Querying Data with Transact-SQL Certification Score: 850

Interests

HOBBIES AND INTERESTS , Sports: cricket, chess and badminton Watching movies and travelling

Languages

  • English – Expert
  • German – A2
  • Hindi – Native
  • Odia – Native
  • Timeline

    Senior Data Engineer

    FARFETCH
    02.2022 - Current

    Senior Data Engineer

    MAERSK
    06.2018 - 02.2022

    Senior Software Developer

    ACCIONLABS INDIA
    11.2016 - 06.2018

    Senior Software Engineer

    IMS HEALTH
    01.2012 - 10.2016

    Bachelor of Technology - Electronics and Telecommunication Engineering

    BIJU PATNAIK UNIVERSITY OF TECHNOLOGY (BPUT)
    SAMIT RANJAN JENA