Lubna Ishaq

CleverTap

09.2023 - 01.2025

Developed and maintained scalable ETL pipelines using Apache spark and pyspark to process large volumes of structured and semi-structured data from diversw source.
Worked with spark ‘s data serialization formarts (AVRO, Parquet, JSON, etc).
Built end-to-end pyspark pipelines on AWS EMR, reading data from AWS S3.
Designed and executed SQL queries for data extraction, trasformations , and aggregation, supporting business intelligence dashboards and ad hoc reporting.
Automated data ingestion and transformation tasks using python scripts, improving pipelines efficiency and reducing manual intervention.
Managed auto-scaling configuration on google compute engine instances to adapt to fluctuating workloads and reduce operational costs.
Used airflows pythonoperator and bashoperator to preprocess EMR step arguments and managed dependencies.
Applied performance tunning techniques to spark jobs ans SQL queries, achieving up to 30% reduction in execution time.
Participated in the migration of legacy data systems to cloud-native architectures, improving scalability and reducing infrastructure cost.
Collaborated with data analysts and business teams to understand data requirements and deliver clean, validated datasets for reporting and analytics.
Implemented data validation and quality checks to ensure accuracy, completeness, and consistency across data pipelines.

Similar Profiles

Sujith Yadav GSujith Yadav G

Data Analyst at Starlite Infotech LimitedData Analyst at Starlite Infotech Limited

Logose MarionLogose Marion

Agricultural supervisor at Tororo cement(under starlite fabricators limited)Agricultural supervisor at Tororo cement(under starlite fabricators limited)

Sushant MahadikSushant Mahadik

Delivery Manager at LTI Mindtree Limited (Erstwhile L&T Infotech Limited)Delivery Manager at LTI Mindtree Limited (Erstwhile L&T Infotech Limited)

Lingaraj MulgheLingaraj Mulghe

Billing Analyst/Senior PMO Analyst at Tata Consultancy Services (TCS)Billing Analyst/Senior PMO Analyst at Tata Consultancy Services (TCS)

MOINUDDIN PASHA SHAIKMOINUDDIN PASHA SHAIK

Software Engineer - Python at HELSON SOFTWARE SOLUTIONSSoftware Engineer - Python at HELSON SOFTWARE SOLUTIONS

Summary

Overview

Work History

Education

Degree -

High School -

High School -

Skills

Timeline

High School -

High School -

Degree -

Similar Profiles

Sujith Yadav GSujith Yadav G

Logose MarionLogose Marion

Sushant MahadikSushant Mahadik

Lingaraj MulgheLingaraj Mulghe

MOINUDDIN PASHA SHAIKMOINUDDIN PASHA SHAIK