Summary
Overview
Work History
Education
Skills
Timeline
Generic
Vaishali Gadekar

Vaishali Gadekar

Bangalore

Summary

Accomplished Data Engineer with extensive experience in designing and implementing scalable data solutions. Proven expertise in AWS Glue, PySpark, and ETL pipelines, with a strong focus on data lake architecture and optimisation. Successfully engineered multi-layer Telecom Data Lakes, enhancing query performance by 40% and ensuring >95% data accuracy. Adept at automating data ingestion pipelines, building analytical models for business insights, and integrating datasets for real-time reporting. Skilled in Python, SQL, Airflow orchestration, and Hadoop ecosystem management. Career goal: to leverage advanced data engineering skills to drive innovative solutions in dynamic environments.

Overview

3
3
years of professional experience

Work History

Data Engineer

AetosOrtec Group
08.2024 - 12.2025
  • Designed and implemented a multi-layer Telecom Data Lake (Bronze ' Silver ' Gold ' Platinum) using AWS Glue, PySpark, and S3 to process large-scale CDR, network usage, and performance datasets.
  • Automated RDS ' S3 ingestion pipelines using Glue Jobs with incremental loads, schema validation, and secure extraction, ensuring high-quality raw data availability.
  • Built Silver-layer transformation pipelines using PySpark, converting data to optimized Parquet/ORC formats, improving query performance by 40%.
  • Engineered Gold-layer analytical models enabling churn prediction, ARPU analysis, tower-level KPIs, and customer segmentation for telecom business teams.
  • Delivered business-ready Platinum datasets for dashboards, ML features, and cross-team consumption, aligned with daily SLA refreshes.
  • Implemented a robust data quality framework (null checks, threshold rules, schema drift detection) ensuring >95% data accuracy across layers.
  • Integrated curated datasets with Athena and Redshift, enabling near real-time reporting and high-speed analytical queries.
  • Optimized S3 storage and compute performance through partitioning, bucketing, and compression strategies.
  • Skills: AWS Glue, PySpark, Data Lake Architecture, ETL Pipelines, AWS RDS, Amazon S3, Athena, Redshift, Data Quality Frameworks, Data Lake Design, Python, SQL, Data Modeling.

Data Engineer (Engineer Service Associate)

Accenture
05.2023 - 08.2024
  • Designed and built efficient, reusable, and scalable data pipelines to ingest, process, and manage large-scale structured and unstructured datasets using cloud-based infrastructure.
  • Architected and implemented fault-tolerant, high-performance data solutions leveraging AWS services, Dataiku, and Hadoop ecosystem.
  • Automated and optimized ETL workflows to streamline data ingestion, transformation, and integration for enterprise-level analytics and reporting.
  • Monitored, troubleshot, and supported operational data pipelines, resolving production issues, implementing bug fixes, and deploying enhancements to meet evolving business requirements.
  • Collaborated cross-functionally with reporting teams, data scientists, product owners, and domain experts to design and maintain scalable data pipelines using Agile/Scrum methodologies.
  • Skills: Python, Airflow, PySpark, SQL, ETL Development, Data Modeling, AWS S3, Hadoop, Hive, Power BI, Tableau, GitHub

Data Science & Business Analyst Intern

Coders Ready
06.2022 - 12.2022
  • Skills: Python, Pandas, NumPy, Matplotlib, Scikit-learn, Jupyter Notebook

Education

Bachelor of Science - Aeronautics

Hal Pravara Aviation Institute
06.2022

Skills

  • Data engineering and ETL
  • Data pipelines and lakes
  • Batch and streaming processing
  • Python and SQL programming
  • PySpark and Shell scripting
  • AWS services (S3, Glue, EMR, Lambda, RDS, Redshift)
  • Azure Data Factory integration
  • Hadoop and Hive management
  • Spark SQL optimization
  • Airflow orchestration
  • Snowflake data warehousing
  • PostgreSQL and MySQL databases
  • Version control with Git
  • CI/CD with GitHub Actions
  • Containerization with Docker
  • Data visualization with Power BI and Tableau
  • REST API development
  • Project management with Jira
  • Documentation in Confluence

Timeline

Data Engineer

AetosOrtec Group
08.2024 - 12.2025

Data Engineer (Engineer Service Associate)

Accenture
05.2023 - 08.2024

Data Science & Business Analyst Intern

Coders Ready
06.2022 - 12.2022

Bachelor of Science - Aeronautics

Hal Pravara Aviation Institute
Vaishali Gadekar