Summary
Overview
Work History
Education
Skills
Timeline
Generic

Murtuza Alam

Data Engineer
MUMBAI,MH

Summary

Data Engineer with over 8 years of experience in designing and optimizing large-scale Big Data pipelines within telecom and enterprise sectors. Expertise in PySpark, Hadoop, Kafka, Hive, Airflow, SQL, and distributed data processing. Achievements include building batch and real-time pipelines, enhancing performance by 30-60%, and minimizing data failures to ensure reliable data products for analytics and reporting.

Overview

9
9
years of professional experience

Work History

Senior Software Engineer – Data Engineering

Reliance Jio
MUMBAI
12.2023 - Current
  • Developed PySpark pipelines on Hadoop for business KPI reporting.
  • Reduced runtime of PySpark jobs by 35 to 70 percent.
  • Optimized data models and created Hive tables for effective reporting.
  • Automated ingestion workflows to improve data processing efficiency.
  • Conducted root cause analysis to address data inconsistencies across distributed systems.

Big Data Engineer

ACL Digital |
Mumbai
05.2022 - 12.2023
  • Designed scalable ETL workflows utilizing PySpark, HDFS, and Hive for telecom analytics.
  • Automated ingestion pipelines, achieving a 60% reduction in manual effort.
  • Tuned Spark jobs using partitioning, bucketing, join optimization, caching, broadcasting, and coalesce.
  • Designed optimized PySpark transformations with partition pruning, caching, and broadcast joins, which improved job performance by 40%.

Deployment Engineer

Mobileum Technologies
Bangalore
03.2021 - 04.2022
  • Configured and installed Big Data revenue assurance and fraud management product.
  • Managed deployment cycles, user acceptance testing, system integration testing, and production rollout.
  • Integrated data feeds from 15 or more telecom network elements into Hadoop clusters.

Lead – Technical Operations (Big Data L2/L3)

Subex Ltd.
Bangalore
11.2016 - 03.2021
  • Supported Hadoop-based revenue assurance applications across multiple operators.
  • Developed ETL decoders and automated data quality and integrity checks.
  • Handled incident management, RCA, and performance troubleshooting.
  • Created job monitoring shell scripts, automation scripts.
  • Worked successfully with diverse group of coworkers to accomplish goals and address issues related to our products and services.
  • Promoted high customer satisfaction by resolving problems with knowledgeable and friendly service.
  • Utilized advanced technical skills and expertise to troubleshoot complex problems and implement solutions.

Education

Bachelor of Technology - Electronics And Communications Engineering

Calcutta Institute of Technology
Kolkata
08-2015

12th Science Stream - Science Stream

Kendriya Vidalaya , Dum Dum
Kolkata
07-2011

Skills

  • Programming: Python, SQL, and PySpark
  • Big data technologies: Spark, Hadoop, HDFS, YARN, Hive
  • Real-time processing: Kafka, Spark Structured Streaming
  • Workflow orchestration: Apache Airflow, Cron, shell scripting
  • Data engineering: ETL/ELT pipelines, data modeling, data quality, schema design
  • Tools and platforms: Cloudera CDH, RHEL, Linux
  • Concepts: distributed computing, optimization, partitioning, joins, window functions

Timeline

Senior Software Engineer – Data Engineering

Reliance Jio
12.2023 - Current

Big Data Engineer

ACL Digital |
05.2022 - 12.2023

Deployment Engineer

Mobileum Technologies
03.2021 - 04.2022

Lead – Technical Operations (Big Data L2/L3)

Subex Ltd.
11.2016 - 03.2021

Bachelor of Technology - Electronics And Communications Engineering

Calcutta Institute of Technology

12th Science Stream - Science Stream

Kendriya Vidalaya , Dum Dum
Murtuza AlamData Engineer