Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Arshia Saha

Arshia Saha

Bengaluru

Summary

An Azure Data Engineer with hands-on experience of development in Microsoft Azure.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Senior Technical Consultant

Ernst & Young LLP
08.2023 - Current
  • Ingestion of data from multiple sources (Service Now, Mulesoft, Saviynt, etc.) from Graph API or File and performing ETL operations and ingesting into Azure Data Lake Storage (ADLS) and making it accessible through SQL server tables based on the business requirements. This is to provide the market with an easy view of all key metrics using dashboards and reports to reduce the decision making turnaround time.
  • Building end-to-end data ingestion pipelines from client storage to SQL databases and Databricks Delta Tables, to be further processed by visualization tools, which involved working across multiple technologies like Azure Data Factory, Azure Databricks, etc.
  • Mainly 3 steps were involved, Ingestion, Transformation and Integration before the tables were populated in SQL server. The ingestion is orchestrated using Azure Data Factory (ADF) and Databricks jobs are run for the Ingestion process. The pattern uses a Metadata Driven approach for the table ingestions.
  • Daily monitoring of pipelines in production environment and fixing any pipeline that has failed including weekends.
  • Create proof-of-concept that demonstrate viability of solutions under considerations.
  • Orchestration: Azure Data Factory, Azure Databricks, Python, SQL, PySpark.

Tech Lead

Cognizant Technology Solutions
06.2019 - 08.2023
  • Building end-to-end data ingestion pipelines from client storage to SQL databases and Databricks Delta Tables, to be further processed by visualization tools, which involved working across multiple technologies like Azure Data Factory, Azure Databricks, Airflow, etc
  • And using languages like Python, PySpark and SQL
  • Developing a dynamic Unified Anaytics Framework which will ease the process and automate the handling of data from Data Ingestion to Data Processing across multiple data sources, using Azure Databricks, which includes several universal and custom methods and triggers
  • Cleansing and preparing the data in Bronze, Silver and Gold layers of database
  • Applying business logics to data in Databricks
  • Strictly follow AGILE SCRUM and SaFe methodology and actively involved in all Agile ceremonies, including Program Increment (PI) planning, Sprint planning, daily stand-ups, retros and product demos
  • Production analyst experienced in handling production activities following ITIL process on IAM, Azure AD, SLA Management
  • Creating and managing user's priviledged account, handling access related issues with AD group, shared drive and security groups.

Education

Bachelor of Technology - Electronics and Communication Engineering

Calcutta Institute of Engineering And Management
01.2019

12th - Science

G D Birla Center For Education
01.2015

10th -

G D Birla Center for Education
01.2013

Skills

  • Azure Data Factory
  • Azure Databricks
  • Pyspark
  • Python
  • SQL
  • Azure Data Lake
  • Spark SQL
  • Azure Blob
  • Bitbucket

Certification

  • Academy Accreditation - Databricks Lakehouse Fundamentals • Arshia Saha • Databricks Badges
  • DP 203
  • Scrum Master Certification - Udemy
  • ITIL v4

Timeline

Senior Technical Consultant

Ernst & Young LLP
08.2023 - Current

Tech Lead

Cognizant Technology Solutions
06.2019 - 08.2023

Bachelor of Technology - Electronics and Communication Engineering

Calcutta Institute of Engineering And Management

12th - Science

G D Birla Center For Education

10th -

G D Birla Center for Education
Arshia Saha