Summary
Overview
Work History
Education
Skills
Websites
Certification
Accomplishments
Awards
Timeline
Generic

MUHAMMED FAZIL P H

Wayanad

Summary

Accomplished Systems Engineer with expertise in PySpark and SQL, driving data transformation at Infosys, Ltd. Enhanced ETL pipeline performance by 30% using Databricks optimization techniques. Collaborated effectively with analytics teams, delivering high-quality datasets for strategic insights. Proficient in Azure and Google Cloud, ensuring robust data solutions and seamless integration.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Systems Engineer

Infosys, Ltd.
01.2025 - Current
  • Built and maintained PySpark-based ETL pipelines in Databricks to process high-volume Telecom data stored in AZURE/Database.
  • Utilized Spark SQL and Delta Lake tables for data transformation, cleansing, and aggregation, enhancing analytics team capabilities.
  • Applied Databricks optimization techniques like partitioning, Z-ordering, and caching to enhance performance.
  • Developed and managed Delta Lake and external tables in Databricks, enabling structured access to processed datasets.
  • Debugged and optimized Spark jobs by addressing skew, applying broadcast joins, and adjusting partitions to minimize shuffle.

Senior Systems Associate

Infosys, Ltd.
07.2023 - Current
  • Developed batch ETL pipelines in Azure Databricks using PySpark, Spark SQL, and Delta Lake to process large-scale datasets efficiently.
  • Optimized Spark workloads by applying caching, repartitioning strategies, Adaptive Query Execution (AQE), and by refactoring expensive wide transformations to improve pipeline performance.
  • Designed and implemented Delta Lake tables following the Bronze → Silver architecture, applying efficient partitioning strategies based on columns such as date, region, and network type.
  • Collaborated with data analysts to deliver curated datasets that supported product usage insights, customer behavior analysis, and recharge trend reporting.

Systems Associate(Trainee)

Infosys, Ltd.
07.2022 - Current
  • Assisted in developing scalable batch data pipelines using PySpark and Spark SQL for a global banking client.
  • Assisted in developing PySpark jobs to load structured data into Hadoop Distributed File System (HDFS).
  • Created Hive tables and implemented transformations for reporting use cases.
  • Developed logging, monitoring, and validation scripts for ETL workflows.
  • Collaborated with team during Agile ceremonies to achieve sprint goals and meet deadlines.

Education

Bachelor of Science - Bachelor of Computer Applications

Manipal University
Jaipur
10-2025

Diploma in electrical and electronics engineering -

Govt Polytechnic Meenangadi, Department of Technical Education
10-2021

Skills

  • SQL and PL/SQL
  • PySpark and Python
  • Azure Data Factory
  • Databricks
  • Azure Synapse Analytics
  • Azure Data Lake
  • Google Cloud Storage (GCS)
  • Data warehousing
  • Data mapping
  • Data quality assurance
  • Data profiling

Certification

  • Microsoft Certified Azure Fundamentals.
  • Infosys Certified Junior Network Admin Professional.
  • Infosys Certified Python Developer.

Accomplishments

  • Improved data pipeline performance by 40% through optimized Spark and SQL transformations.
  • Built a data validation framework in Python reducing manual QA effort by 50%

Awards

  • Awarded for Delivery Ninja
  • Awarded for Business Ninja
  • Rise Awarded for Best Team - Project Excellence/ Solutions Architect

Timeline

Systems Engineer

Infosys, Ltd.
01.2025 - Current

Senior Systems Associate

Infosys, Ltd.
07.2023 - Current

Systems Associate(Trainee)

Infosys, Ltd.
07.2022 - Current

Bachelor of Science - Bachelor of Computer Applications

Manipal University

Diploma in electrical and electronics engineering -

Govt Polytechnic Meenangadi, Department of Technical Education
MUHAMMED FAZIL P H