Summary
Overview
Work History
Education
Skills
Timeline
Generic
AAMIR RIYAZ

AAMIR RIYAZ

Mumbai

Summary

Experienced Senior Data Engineer adept in Spark, Hadoop, Azure Synapse, ETL, Data Modeling, and Data Warehousing. Proven track record of designing and implementing efficient data solutions. Possesses strong analytical skills, excels in problem-solving, and has a deep understanding of database technologies and systems. Confident in working independently and collaboratively, with excellent communication skills to facilitate seamless interactions.

Overview

10
10
years of professional experience
7
7
years of post-secondary education

Work History

Sr Data Engineer

Petroleum Chemical Organization
04.2024 - 12.2024

Architected and implemented ETL pipelines using Azure Databricks, ADLS, and Delta Lake for scalable big data processing and analytics.

Enhanced performance and cost efficiency by optimizing Spark jobs, tuning cluster configurations, and reducing storage redundancy through Delta Lake optimizations.

Integrated Azure Databricks with Synapse Analytics and other Azure services to deliver end-to-end data solutions, ensuring reliability and scalability for business-critical workflows.

Sr Data Engineer

Reliance Jio
05.2022 - 04.2024
  • Led design and development of high-performance, scalable data processing system from inception to implementation.
  • Crafted project's High-Level Design (HLD) and Low-Level Design (LLD) while ensuring optimization of costs and adherence to acceptance criteria.
  • Utilized expertise in Big Data and Cloud technologies, specifically Azure and GCP, to architect solutions.
  • Employed key technologies such as PySpark, Python, Scala, Azure Synapse, HDInsight, Kafka, EventHub, Azure SQL, and DeltaLake.
  • Designed and implemented effective database solutions and models to store and retrieve data
  • Established DataMart for reporting team, utilizing Data Warehouse as source and creating Transaction layer with aggregated values.

Lead Developer

TEK Systems, Bigdata/ Databricks
10.2021 - 05.2022
  • Developed custom Python libraries using Spark for to deliver business requirement's in big data processing and analysis in Databricks notebooks.
  • Created comprehensive documentation for Python libraries, utilizing Sphinx to generate dynamic HTML documents.
  • Optimized and innovated Spark jobs, transforming data processing performance from minutes to seconds. Implemented strategic enhancements.

Senior Developer

Principal Global Services
03.2018 - 10.2021
  • Led dynamic Agile Scrum Team as Senior Developer/Team Lead, specializing in Python-based code development using Spark framework.
  • Led high-performing team 5 through the end-to-end development process of a Big Data Application, from the conceptualization phase to successful production deployment. Coordinated the team's efforts in designing and implementing scalable and efficient data processing solutions, ensuring the application met all functional requirements and performance benchmarks.
  • Resolved and implemented complex technical needs for project, including new developments, product design, and legacy system upgrades.
  • Successfully migrated legacy Informatica PowerCenter ETL processes to a modern, scalable solution using Scala, Spark, and Hadoop. Leveraged expertise in both legacy and emerging systems to ensure a seamless transition.
  • Designed and implemented robust framework for ingesting data from Kafka topics to database, providing high-end CI/CD solution.

ETL Developer

Cognizant Technology Solutions
06.2015 - 03.2018
  • Understanding Requirements and Creating Requirement Analysis and STTM Documents
  • Developed ETL workflows, sessions, mappings using Informatica Power Center as per the requirement.
  • Implemented SAP BW connectivity with Informatica. And triggering SAP Process Chain from Informatica and fetching data from Open HUB Destination.
  • Worked on Performance Tuning of Informatica Mappings/sessions.
  • Worked with different source/Target systems : Database(Oracle),Flat File, SalesForce and SAP BW
  • Performed root cause analysis on all processes and resolve all production issues and validate all data
  • Unit testing ETL code to ensure it can be delivered and run in a system testing environment.
  • Develop Informatica Cloud ETL code based on requirement and Data Model.
  • Developed Linux Shell Script for Process Orchestration

Education

B.Tech - Electronics & Communication Engineering

AMITY University
01.2010 - 04.2014

SSC 12th -

Dr. Virendra Swarup Education Center(C.B.S.E)
04.2008 - 03.2010

High School Diploma -

Dr. Virendra Swaroop Education Center
04.2007 - 03.2008

Skills

    Big Data Stack : Hadoop, HDFS, Apache Spark, Python, Scala, Kafka, Hive, Apache Beam

    Cloud Stack : Databricks, Azure Synapse, Azure HDInsights, Event Hub, Logic Apps, GCP Dataflow, GCP Cloud Storage, GCP Pub/Sub, Docker

    ETL, Informatica Power Center, Data Warehousing, Dimensional Modeling

    Unix/Linux Scripting, WSL

    DBMS: DB2-Blu, SQL Server, Oracle, Azure SQL, Cosmos DB

    CICD, GIT, Ansible, Azure DevOps, Airflow, Mage

    Data analysis

Timeline

Sr Data Engineer

Petroleum Chemical Organization
04.2024 - 12.2024

Sr Data Engineer

Reliance Jio
05.2022 - 04.2024

Lead Developer

TEK Systems, Bigdata/ Databricks
10.2021 - 05.2022

Senior Developer

Principal Global Services
03.2018 - 10.2021

ETL Developer

Cognizant Technology Solutions
06.2015 - 03.2018

B.Tech - Electronics & Communication Engineering

AMITY University
01.2010 - 04.2014

SSC 12th -

Dr. Virendra Swarup Education Center(C.B.S.E)
04.2008 - 03.2010

High School Diploma -

Dr. Virendra Swaroop Education Center
04.2007 - 03.2008
AAMIR RIYAZ