Summary

Overview

Work History

Education

Skills

Timeline

AAMIR RIYAZ

Mumbai

Summary

Experienced Senior Data Engineer adept in Spark, Hadoop, Azure Synapse, ETL, Data Modeling, and Data Warehousing. Proven track record of designing and implementing efficient data solutions. Possesses strong analytical skills, excels in problem-solving, and has a deep understanding of database technologies and systems. Confident in working independently and collaboratively, with excellent communication skills to facilitate seamless interactions.

Overview

years of professional experience

years of post-secondary education

Work History

Sr Data Engineer

Petroleum Chemical Organization

04.2024 - 12.2024

• Architected and implemented ETL pipelines using Azure Databricks, ADLS, and Delta Lake for scalable big data processing and analytics.

• Enhanced performance and cost efficiency by optimizing Spark jobs, tuning cluster configurations, and reducing storage redundancy through Delta Lake optimizations.

• Integrated Azure Databricks with Synapse Analytics and other Azure services to deliver end-to-end data solutions, ensuring reliability and scalability for business-critical workflows.

Sr Data Engineer

Reliance Jio

05.2022 - 04.2024

Led design and development of high-performance, scalable data processing system from inception to implementation.
Crafted project's High-Level Design (HLD) and Low-Level Design (LLD) while ensuring optimization of costs and adherence to acceptance criteria.
Utilized expertise in Big Data and Cloud technologies, specifically Azure and GCP, to architect solutions.
Employed key technologies such as PySpark, Python, Scala, Azure Synapse, HDInsight, Kafka, EventHub, Azure SQL, and DeltaLake.
Designed and implemented effective database solutions and models to store and retrieve data
Established DataMart for reporting team, utilizing Data Warehouse as source and creating Transaction layer with aggregated values.

Lead Developer

TEK Systems, Bigdata/ Databricks

10.2021 - 05.2022

Developed custom Python libraries using Spark for to deliver business requirement's in big data processing and analysis in Databricks notebooks.
Created comprehensive documentation for Python libraries, utilizing Sphinx to generate dynamic HTML documents.
Optimized and innovated Spark jobs, transforming data processing performance from minutes to seconds. Implemented strategic enhancements.

Senior Developer

Principal Global Services

03.2018 - 10.2021

Led dynamic Agile Scrum Team as Senior Developer/Team Lead, specializing in Python-based code development using Spark framework.
Led high-performing team 5 through the end-to-end development process of a Big Data Application, from the conceptualization phase to successful production deployment. Coordinated the team's efforts in designing and implementing scalable and efficient data processing solutions, ensuring the application met all functional requirements and performance benchmarks.
Resolved and implemented complex technical needs for project, including new developments, product design, and legacy system upgrades.
Successfully migrated legacy Informatica PowerCenter ETL processes to a modern, scalable solution using Scala, Spark, and Hadoop. Leveraged expertise in both legacy and emerging systems to ensure a seamless transition.
Designed and implemented robust framework for ingesting data from Kafka topics to database, providing high-end CI/CD solution.

ETL Developer

Cognizant Technology Solutions

06.2015 - 03.2018

Understanding Requirements and Creating Requirement Analysis and STTM Documents
Developed ETL workflows, sessions, mappings using Informatica Power Center as per the requirement.
Implemented SAP BW connectivity with Informatica. And triggering SAP Process Chain from Informatica and fetching data from Open HUB Destination.
Worked on Performance Tuning of Informatica Mappings/sessions.
Worked with different source/Target systems : Database(Oracle),Flat File, SalesForce and SAP BW
Performed root cause analysis on all processes and resolve all production issues and validate all data
Unit testing ETL code to ensure it can be delivered and run in a system testing environment.
Develop Informatica Cloud ETL code based on requirement and Data Model.
Developed Linux Shell Script for Process Orchestration

Education

B.Tech - Electronics & Communication Engineering

AMITY University

01.2010 - 04.2014

SSC 12th -

Dr. Virendra Swarup Education Center(C.B.S.E)

04.2008 - 03.2010

High School Diploma -

Dr. Virendra Swaroop Education Center

04.2007 - 03.2008

Skills

Big Data Stack : Hadoop, HDFS, Apache Spark, Python, Scala, Kafka, Hive, Apache Beam

Cloud Stack : Databricks, Azure Synapse, Azure HDInsights, Event Hub, Logic Apps, GCP Dataflow, GCP Cloud Storage, GCP Pub/Sub, Docker

ETL, Informatica Power Center, Data Warehousing, Dimensional Modeling

Unix/Linux Scripting, WSL

DBMS: DB2-Blu, SQL Server, Oracle, Azure SQL, Cosmos DB

CICD, GIT, Ansible, Azure DevOps, Airflow, Mage

Data analysis

Cloud Stack : Databricks, Azure Synapse, Azure HDInsights, Event Hub, Logic Apps, GCP Dataflow, GCP Cloud Storage, GCP Pub/Sub, Docker

ETL, Informatica Power Center, Data Warehousing, Dimensional Modeling

Unix/Linux Scripting, WSL

DBMS: DB2-Blu, SQL Server, Oracle, Azure SQL, Cosmos DB

CICD, GIT, Ansible, Azure DevOps, Airflow, Mage

Data analysis,

Timeline

Sr Data Engineer

Petroleum Chemical Organization

04.2024 - 12.2024

Sr Data Engineer

Reliance Jio

05.2022 - 04.2024

Lead Developer

TEK Systems, Bigdata/ Databricks

10.2021 - 05.2022

Senior Developer

Principal Global Services

03.2018 - 10.2021

ETL Developer

Cognizant Technology Solutions

06.2015 - 03.2018

B.Tech - Electronics & Communication Engineering

AMITY University

01.2010 - 04.2014

SSC 12th -

Dr. Virendra Swarup Education Center(C.B.S.E)

04.2008 - 03.2010

High School Diploma -

Dr. Virendra Swaroop Education Center

04.2007 - 03.2008