Summary
Overview
Work History
Education
Skills
Timeline
AssistantManager
Himanshu  Mishra

Himanshu Mishra

Data Engineer | SQL | Python | Apache Spark | AWS

Summary

Data Engineer with a strong software engineering background and a Master’s degree in Data Analytics. Experienced in building scalable data pipelines, lakehouse architectures, and analytics-ready datasets using SQL, Python, Apache Spark, and AWS. Skilled in batch and near real-time data processing, data modeling, and performance optimization. Adept at delivering reliable data platforms that support analytics and business decision-making.

Overview

9
9
years of professional experience
5
5
years of post-secondary education

Work History

Data Engineer

Independent / Project-Based Experience
11.2023 - Current
  • Designed and implemented end-to-end data pipelines using SQL, Python, Apache Spark, and AWS.
  • Built batch and near real-time ingestion workflows using AWS services (S3, Glue, Athena, Lambda, SQS).
  • Implemented Bronze–Silver–Gold lakehouse architecture to convert raw data into analytics-ready datasets.
  • Developed Spark-based ETL jobs for data cleansing, enrichment, aggregation, and validation.
  • Applied schema enforcement and evolution strategies to ensure data consistency.
  • Optimized Spark jobs using partitioning, pruning, and efficient query design.
  • Enabled SQL-based analytics using Athena for reporting and business insights.
  • Followed production-grade practices, including modular code structure, version control, and monitoring.

Software Engineer – Android & Data Analytics

YourStory Media Pvt Ltd
06.2019 - 06.2020
  • Developed and maintained Android applications that generated high-volume, event-driven user interaction data.
  • Instrumented application events and analytics pipelines to capture user behavior, engagement, and feature usage.
  • Wrote Python scripts to extract, clean, and analyze application data for business and product teams.
  • Built SQL-based reports and metrics to track active users, growth trends, and content performance.
  • Collaborated with product, analytics, and business stakeholders to deliver data-driven insights.
  • Ensured data reliability and correctness through structured validation and monitoring.
  • Applied strong software engineering best practices (modular code, version control, testing) across systems.

Software Engineer – Android & Data Systems

Picstorie Technologies Pvt Ltd
06.2017 - 06.2019
  • Developed and maintained Android applications using Java/Kotlin, which powered a content-driven platform.
  • Instrumented user interaction and content events within the Android app, generating structured event data.
  • Worked with event-driven application data flowing into backend systems for analytics and reporting.
  • Built Python and Java-based batch processing logic to process application and content data.
  • Designed and optimized database schemas to support analytical queries and performance reporting.
  • Collaborated with backend, product, and business teams to enable data-backed decision-making.
  • Ensured data correctness and reliability through validation, logging, and structured debugging.

Education

Master of Science - Data Analytics

Dublin Business School
Dublin, Ireland
09.2022 - 10.2023

Bachelor of Science - Computer Science

GGSIPU
Delhi, India
06.2013 - 08.2017

Skills

Data Engineering: ETL/ELT Pipelines, Data Pipelines, Lakehouse (Bronze/Silver/Gold), Batch Processing

Big Data: Apache Spark, PySpark, Spark SQL

Programming: SQL (Advanced), Python, JAVA

Cloud (AWS): S3, Glue, Athena, Lambda, SQS

Data Architecture: Data Modeling, Schema Enforcement, Partitioning, Optimization

Tools: Git, GitHub, Airflow (basic), Docker (basic)

Timeline

Data Engineer

Independent / Project-Based Experience
11.2023 - Current

Master of Science - Data Analytics

Dublin Business School
09.2022 - 10.2023

Software Engineer – Android & Data Analytics

YourStory Media Pvt Ltd
06.2019 - 06.2020

Software Engineer – Android & Data Systems

Picstorie Technologies Pvt Ltd
06.2017 - 06.2019

Bachelor of Science - Computer Science

GGSIPU
06.2013 - 08.2017
Himanshu MishraData Engineer | SQL | Python | Apache Spark | AWS