Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Websites
Timeline
Generic
GANESH MALLAH

GANESH MALLAH

Bengaluru

Summary

Results-driven Data Engineer and Analyst with 4+ years of experience designing, building, and optimizing scalable data pipelines and cloudbased solutions. Strong expertise in Python, SQL, Apache Airflow, PySpark, AWS Glue, Snowflake, FastAPI, and distributed data systems. Ability to deliver high-performance ETL pipelines, real-time APIs, and analytics solutions that improve data reliability, efficiency, and business outcomes.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Data Engineer

Nielsen
01.2025 - 02.2026
  • Architect scalable ETL pipelines in Apache Airflow for automated daily batch processing, improving speed and reliability.
  • Manage containerized services with Docker and Kubernetes to ensure high availability and streamlined deployments.
  • Build FastAPI-based APIs for real-time ingestion, transformation, and integration with analytical platforms.
  • Leverage AWS S3, DynamoDB, and Athena for data storage, configuration management, and file validation.
  • Optimize distributed SQL queries on Trino and manage relational data in RDS/MySQL for ingestion, updates, and analytics.
  • Implement data quality frameworks for schema validation, anomaly detection, and record-level checks.
  • Utilize GitLab CI/CD for version control, testing, and CI/CD-driven deployments.

Data Engineer

Factspan Analytics Pvt. Ltd.
09.2023 - 01.2025
  • Orchestrated and maintained ETL pipelines using AWS Glue to optimize data workflows, efficiently handling structured and unstructured datasets, while also creating dynamic pricing models with Python, SQL, and AWS Redshift. This approach drove a 15% revenue increase by analyzing booking patterns, customer behavior, and seasonal demand fluctuations.
  • Administered and analyzed large-scale datasets using PySpark and AWS S3, delivering real-time insights into marketing campaign effectiveness, resulting in a 10% boost in customer engagement through targeted marketing initiatives.
  • Built and optimized predictive models and SQL queries to forecast booking trends, enhancing platform performance by 20% and reducing operational costs by 5% through better resource management.
  • Reduced data processing time by 40% with more efficient extraction and cleaning techniques, and mechanized notification systems using AWS Lambda and CloudWatch, cutting manual efforts by 30% and enhancing system reliability.

Data Analyst

Hills Safari Travels Pvt. Ltd.
01.2020 - 05.2021
  • Utilized advanced SQL queries for data extraction and transformation, reducing monthly report generation time by 15 hours while ensuring data accuracy and consistency across reporting processes.
  • Worked as a Data Analyst using Python, GCP, SQL, Power BI, and Tableau. Built end-to-end pipelines on Google Cloud Platform for data processing and storage, using BigQuery for analytics and access provisioning.
  • Streamlined workflows to fetch travel data and send notifications, reducing manual effort by 30%. Used PySpark for data processing. Leveraged Power BI and Tableau to generate real-time dashboards from BigQuery data, providing actionable insights using Generative BI, Narrative, and Copilot, while automating employee time tracking for improved efficiency.

Education

MTech. - Construction Technology and Management

NIT Warangal
01-2023

B.Tech. - Civil Engineering

NIT Arunachal Pradesh
01-2019

Skills

Cloud Platforms:
AWS (S3, Lambda, Glue, Redshift, Athena, DynamoDB, EMR, SNS, CloudWatch),
Google Cloud Platform (BigQuery, Cloud Storage, Compute Engine)

Data Engineering & Big Data:
PySpark, Apache Spark, Apache Airflow, Databricks, Hive, Apache Kafka,
ETL/ELT Pipelines, Data Transformation, Workflow Orchestration (Airflow, Step Functions)

Data Warehousing & Databases:
Snowflake, Amazon Redshift, BigQuery, Trino, PostgreSQL, MySQL

Programming Languages:
Python, JavaScript

Backend & API Development:
FastAPI, Django, REST APIs, API Integration, Asynchronous Processing

Business Intelligence & Analytics:
Power BI, Tableau

DevOps & Containers:
Docker, Kubernetes, Podman, Git, GitLab CI/CD

Data Quality & Monitoring:
Schema Validation, Data Validation, Monitoring, Alerting, Pipeline Failure Handling

Certification

  • Microsoft Certified: Power BI Data Analyst Associate
  • Certified Entry Level Python Program

Accomplishments

  • Data Automation Excellence, 2024, Factspan Analytics Pvt. Ltd.
  • Outstanding Performance in Power BI, 2024, Factspan Analytics Pvt. Ltd.

Timeline

Data Engineer

Nielsen
01.2025 - 02.2026

Data Engineer

Factspan Analytics Pvt. Ltd.
09.2023 - 01.2025

Data Analyst

Hills Safari Travels Pvt. Ltd.
01.2020 - 05.2021

MTech. - Construction Technology and Management

NIT Warangal

B.Tech. - Civil Engineering

NIT Arunachal Pradesh
GANESH MALLAH