Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic
Mubeena Kabeer

Mubeena Kabeer

Bengaluru

Summary

Data Engineer | 3+ Years | Azure | Snowflake | PySpark | SQL | Power BI

Results-driven Data Engineer with over 3 years of experience in designing, building, and optimizing end-to-end data pipelines on Azure and Snowflake ecosystems. Skilled in ETL/ELT, PySpark, SQL, Databricks, and Power BI, with a proven ability to deliver high-performance data solutions, data quality frameworks, and cloud migrations. Adept at collaborating with cross-functional teams to drive data-driven decision-making, while ensuring scalability, governance, and cost optimization.

Overview

3
3
years of professional experience

Work History

Data Engineer

IBM
Bangalore
05.2023 - Current
  • Designed and implemented end-to-end ETL/ELT pipelines using PySpark, Azure Data Factory (ADF), and Databricks, automating ingestion from APIs, flat files, and relational databases into Azure Data Lake and Snowflake.
  • Developed Bronze–Silver–Gold data architecture in Snowflake, improving data accessibility and usability for analytics and machine learning teams.
  • Optimized PySpark transformations with partitioning, caching, and broadcast joins, reducing pipeline runtime by 30–40% for datasets >1 TB.
  • Built data quality frameworks (null handling, schema validation, duplicate removal, business rule validation), improving reporting accuracy to 99.9%.
  • Automated workflows and dependencies using ADF pipelines, Airflow, and Snowflake Tasks, Power BI, and ensuring SLA compliance for daily and nearrealtime jobs.
  • Implemented incremental loads and CDC pipelines, reducing processing cost and runtime by 50%.
  • Developed pipeline monitoring & alerting system (Slack/Teams + logs), reducing downtime by 25%
  • Integrated Snowflake with Power BI dashboards, enabling business users to track licensing revenue, churn risk, and utilization KPIs.

System Engineer

Tech Mahindra Cerium
Kochi
08.2022 - 04.2023
  • Worked in semiconductor physical design engineering, supporting timing closure, data routing, and block-level design workflows.
  • Trained in Linux, SQL, and database management to support data-driven engineering tasks.
  • Developed internal documentation and supported training plans for engineering interns.
  • Strengthened fundamentals in system architecture, data flows, and structured problem-solving.

Education

Bachelor of Technology - Electronics And Communications

Toc H Institute of Science And Technology
Kochi
06-2022

Skills

  • Data Engineering: ETL/ELT Pipelines, Data Modeling, CDC, Data Warehousing, Data Quality, Orchestration
  • Technologies: PySpark, SQL(Stored procedures), Python, Pandas, NumPy
  • Cloud: Azure Data Lake, Azure Synapse Analytics, Azure Databricks, Azure Data Factory, Snowflake
  • Visualization: Power BI, dashboarding & reporting
  • Other: Jira, Confluence, ServiceNow, Agile/Scrum

Projects

Enterprise licensing data optimization and dashboarding  

Role: Data Engineer | Organization: IBM | Duration: May 2023 – present  

  • Built a licensing data pipeline using Azure Data Factory, Databricks, and Snowflake.
  • Designed an incremental pipeline with CDC and partitioning, reducing processing time by 45%.
  • Ingested raw CSVs (licensing data) into Azure Data Lake (Bronze) → transformed to clean datasets (Silver) → aggregated into business-ready fact tables (Gold).
  • Published results to Power BI dashboards for revenue tracking, churn analysis, and license utilization insights.

Timeline

Data Engineer

IBM
05.2023 - Current

System Engineer

Tech Mahindra Cerium
08.2022 - 04.2023

Bachelor of Technology - Electronics And Communications

Toc H Institute of Science And Technology
Mubeena Kabeer