Dedicated and adaptable professional with a proactive attitude and the ability to learn quickly. Strong work ethic and effective communication skills. Eager to contribute to a dynamic team and support organizational goals.
Overview
3
3
years of professional experience
1
1
Certification
Work History
System Engineer
Tata Consultancy Service
KOLKATA
08.2021 - Current
Developed and implemented ETL processes using IBM DataStage.
Analyzed source system data models to develop efficient extract-transform-load processes for loading into the target database tables.
Participated in agile development processes, contributing to sprint planning, stand-ups, and reviews to ensure timely delivery of data projects.
Researched and integrated new data technologies and tools to keep the data architecture modern and efficient.
Education
B.Tech - Computer Science
BP PODDAR INSTITUTE OF MANAGEMENT AND TECHNOLOGY
Kolkata
06-2021
Skills
Languages : Python, Java,SQL
Tools and Technologies: IBM Infosphere Datastage Designer and Director, Monitoring tools ie Newrelic,Scheduling tool ie ControlM,IBM MQFTE,Azure Datafactory,Azure Databricks,Pyspark,Git
Certification
Microsoft Certified, Azure Fundamentals - ( AZ-900)
Microsoft Certified, Azure Data Engineer Associate - (DP-203)
Languages
English
First Language
English
Proficient (C2)
C2
Affiliations
Project on Azure Datafactory:
Developed and implemented an end-to-end ETL pipeline using Azure Data Factory to extract, transform, and load COVID-19 datasets from various sources, ensuring data integrity and accuracy.
Utilized Azure Data Factory's data flow features to perform complex transformations, including data cleansing and normalization, improving the overall quality of the dataset.
Automated data ingestion processes by scheduling regular data updates and creating triggers, resulting in efficient data management and reduced manual effort.
Project on Azure Databricks:
Designed and developed a data analysis project on Formula 1 racing data using Azure Databricks, focusing on enhancing data processing and analytics skills.
Constructed a three-layer architecture consisting of the raw, processed, and presentation layers, optimizing data flow and organization for analytical tasks.
Read the CSV and JSON files
Performed data transformations and aggregations using PySpark and SQL, deriving insights from Formula 1 data