Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Timeline
Generic
Binit Chowdhary

Binit Chowdhary

Bengaluru

Summary

Result-oriented and innovative Data Engineer with 11 years of experience. Passionate about learning and implementing data solutions & pipelines to help organizations derive value from data.

Ownership & responsibility fan, like to demonstrate that in daily work. Having good leadership experience of leading a team of 10 developers. Considers new challenges as opportunities to explore, learn, overcome the problem and grow.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Lead Data Engineer

Epam Systems Pvt. Ltd.
Bangalore/Kolkata
07.2023 - Current

Client : LSEG

Project :

Working on developing platform and solution for fintech domain to create data products for end clients. This is in collaboration with Microsoft.

Key Deliverables :

  • Designed and developed automated tool for LDM to PDM conversion for the data to be served to the client.
  • Worked with downstream team to serve the LDM PDM data to integrate it with Purview for establishing governance around this.
  • Identified performance issues, came up with ideas and implemented solution to mitigate the issues.
  • Worked on implementing XML parsing framework.

Skills : Microsoft Fabric, SQL, PySpark, Python, Azure Data Factory(ADF), Azure Data lake(ADLS), Data Warehousing, Azure Synapse, ETL, Azure DevOps, GitHub, Storage Accounts etc.

Senior Data Engineer

JLL
Bangalore/Kolkata
07.2022 - 07.2023

Client : Internal

Project :

Working on developing and maintaining pipelines to support Capital markets/Valuations business of JLL globally. End goal is to make property data available for Smart Property Application. This highly scalable application is used by business to drive insightful/data backed investments by investors. This helps investors realize their investment needs/returns more effectively.
To achieve this, I am responsible to make data available in the required format which could be loaded to Application.
This starts with ingesting data from various sources like SFTP,RDBMS,API etc. followed by transforming data to JLL common model for silver layer consumption. Next step is to transform data to a scalable data model and store in Data Hubs followed by generating the final data file which would be consumed by end application.

Key Deliverables :

  • Implemented Data Hub for Global Data Sources as existing was region specific Data Hubs for storing Facts and Dimensions.
  • Took initiative and migrated Data Hub from Azure SQL server to Data Lake Delta.
  • Tuned performance of existing Stored procedure and brought down run time by approx. 4-5 times.
  • Tuned performance of Data bricks scripts and brought down load/run time from ~11 hrs to less than 2 hrs.
  • Developed incremental process to identify incremental records and merge to data hub. Helped in load time improvement from ~5 hrs to 25-30 mins.
  • Worked with automated test cases and ensured code coverage is per industry standards.

Skills : SQL, PySpark, Python, Databricks, Azure Data Factory(ADF), Azure Data lake(ADLS), Data Warehousing, Azure Synapse, ETL, Azure SQL Server, Azure Functions, Azure DevOps, GitHub, Storage Accounts etc.

Associate Manager

HTC Global Solutions Pvt. Ltd
Bangalore/Kolkata
11.2021 - 07.2022

Client : JLL( Deployed as Senior Data Engineer @JLL on Contract)

Project : Deployed to JLL as contractor hence, added details under JLL.

Key Deliverables : Deployed to JLL as contractor hence, added details under JLL.

Skills : SQL, PySpark, Python, Databricks, Azure Data Factory( ADF), Azure Datalake(ADLS), Azure Synapse, Data Warehousing, ETL, Azure Sql Server, Azure Functions, Azure DevOps, GitHub, Storage Accounts etc.

Software Engineer III

Cerner Healthcare Solutions Pvt. Ltd
Bangalore/Kolkata
01.2020 - 11.2021

Client : Cerner

Project :

Worked on developing and maintaining data pipelines to support Finance and Supply Chain dataset. This was modernizing project to migrate reporting data from on-premise legacy to cloud.

To achieve this, was responsible to ingest data from ERP system OLTP database to Data lake. Followed by combining initial with incremental data to produce daily snapshots using Data bricks. Data transformation was done on snapshots to produce required dimensions and facts using Databricks. This was loaded to Snowflake to support the BI/reporting needs. Was done by understanding transformations in legacy ETL, followed by creating logic in new platform leveraging Databricks.

Key Deliverables:

  • Implemented logic to load incremental data instead of full load based on business requirement.
  • Developed utility notebook with functions to perform merge for SCD 1 and 2 in Snowflake.
  • Developed ADF pipeline to read and load data from Oracle OLTP to data lake.
  • Developed orchestration pipeline using ADF to enable Databricks notebooks to run per schedule.
  • Developed merge script in Databricks to merge initial and incremental data to produce daily snapshots.

Skills : SQL, PySpark, Python, Databricks, Azure Data Factory(ADF), Azure Datalake( ADLS), Azure Synapse, Data Warehousing, ETL, Snowflake, Kafka, Azure DevOps, GitHub, Key Vaults, Storage Accounts etc.

Application Development Team Lead (Last Position)

Accenture Solutions Pvt Ltd
Bangalore
05.2013 - 12.2019

Client : North America Truck Manufacturer

Project :

Led team of 10 developers to support the Finance and Supply chain development effort. Was responsible for Requirement analysis, solutioning, technical design and development in FSCM.

Key Deliverables:

  • Worked on developing integration of data from on-premise to cloud for analytics.
  • Moved Finance data to ADLS followed by developing logic to create and load the dimensions and facts.
  • Performed data crunching to figure out Past Due invoices for which Org did not charge penalty and informed Business. This was appreciated by client end Director.
  • Designed and developed interface to fetch real time financial reports from 3rd party repository and show in user interface.
  • Developed Interface to receive Time data from Kronos to load into PeopleSoft Time.

Skills : Azure Data Factory, Azure Databricks, Pyspark, Azure Sql Server, SQL, Peoplecode, RDBMS, AE, BI Publisher, IB etc.

Education

Bachelor of Engineering -

Hindustan University
Chennai
05.2013

Higher Secondary -

St. Joan's School
Kolkata
05.2009

Matriculation -

St. Joan's School
Kolkata
03.2007

Skills

Azure Databricks

SQL

PySpark

Microsoft Fabric

Data Modelling

Python

Azure Data Factory( ADF)

Azure Datalake (ADLS)

Snowflake

Data Warehousing

ETL

Azure DevOps

Azure Functions

OLTP and OLAP DBs

Azure SQL Server

Azure Synapse Analytics

Certification

Microsoft Certified - Azure Data Fundamentals - January, 2022

Microsoft Certified - Azure Data Engineer Associate - March, 2022

Databricks Certified - Data Engineer Associate - October, 2022

Accomplishments

    Awarded Accenture Celebrating Excellence (ACE) monetary award twice along the service period with Accenture.

Timeline

Lead Data Engineer

Epam Systems Pvt. Ltd.
07.2023 - Current

Senior Data Engineer

JLL
07.2022 - 07.2023

Associate Manager

HTC Global Solutions Pvt. Ltd
11.2021 - 07.2022

Software Engineer III

Cerner Healthcare Solutions Pvt. Ltd
01.2020 - 11.2021

Application Development Team Lead (Last Position)

Accenture Solutions Pvt Ltd
05.2013 - 12.2019

Bachelor of Engineering -

Hindustan University

Higher Secondary -

St. Joan's School

Matriculation -

St. Joan's School
Binit Chowdhary