Summary

Overview

Work History

Education

Skills

Timeline

Murtuza Alam

Data Engineer

MUMBAI,MH

Summary

Data Engineer with over 8 years of experience in designing and optimizing large-scale Big Data pipelines within telecom and enterprise sectors. Expertise in PySpark, Hadoop, Kafka, Hive, Airflow, SQL, and distributed data processing. Achievements include building batch and real-time pipelines, enhancing performance by 30-60%, and minimizing data failures to ensure reliable data products for analytics and reporting.

Overview

years of professional experience

Work History

Senior Software Engineer – Data Engineering

Reliance Jio

MUMBAI

12.2023 - Current

Developed PySpark pipelines on Hadoop for business KPI reporting.
Reduced runtime of PySpark jobs by 35 to 70 percent.
Optimized data models and created Hive tables for effective reporting.
Automated ingestion workflows to improve data processing efficiency.
Conducted root cause analysis to address data inconsistencies across distributed systems.

Big Data Engineer

ACL Digital |

Mumbai

05.2022 - 12.2023

Designed scalable ETL workflows utilizing PySpark, HDFS, and Hive for telecom analytics.
Automated ingestion pipelines, achieving a 60% reduction in manual effort.
Tuned Spark jobs using partitioning, bucketing, join optimization, caching, broadcasting, and coalesce.
Designed optimized PySpark transformations with partition pruning, caching, and broadcast joins, which improved job performance by 40%.

Deployment Engineer

Mobileum Technologies

Bangalore

03.2021 - 04.2022

Configured and installed Big Data revenue assurance and fraud management product.
Managed deployment cycles, user acceptance testing, system integration testing, and production rollout.
Integrated data feeds from 15 or more telecom network elements into Hadoop clusters.

Lead – Technical Operations (Big Data L2/L3)

Subex Ltd.