Summary

Overview

Work History

Education

Skills

Timeline

Himanshu Mishra

Data Engineer | SQL | Python | Apache Spark | AWS

Summary

Data Engineer with a strong software engineering background and a Master’s degree in Data Analytics. Experienced in building scalable data pipelines, lakehouse architectures, and analytics-ready datasets using SQL, Python, Apache Spark, and AWS. Skilled in batch and near real-time data processing, data modeling, and performance optimization. Adept at delivering reliable data platforms that support analytics and business decision-making.

Overview

years of professional experience

years of post-secondary education

Work History

Data Engineer

Independent / Project-Based Experience

11.2023 - Current

Designed and implemented end-to-end data pipelines using SQL, Python, Apache Spark, and AWS.
Built batch and near real-time ingestion workflows using AWS services (S3, Glue, Athena, Lambda, SQS).
Implemented Bronze–Silver–Gold lakehouse architecture to convert raw data into analytics-ready datasets.
Developed Spark-based ETL jobs for data cleansing, enrichment, aggregation, and validation.
Applied schema enforcement and evolution strategies to ensure data consistency.
Optimized Spark jobs using partitioning, pruning, and efficient query design.
Enabled SQL-based analytics using Athena for reporting and business insights.
Followed production-grade practices, including modular code structure, version control, and monitoring.

Software Engineer – Android & Data Analytics

YourStory Media Pvt Ltd

06.2019 - 06.2020

Developed and maintained Android applications that generated high-volume, event-driven user interaction data.
Instrumented application events and analytics pipelines to capture user behavior, engagement, and feature usage.
Wrote Python scripts to extract, clean, and analyze application data for business and product teams.
Built SQL-based reports and metrics to track active users, growth trends, and content performance.
Collaborated with product, analytics, and business stakeholders to deliver data-driven insights.
Ensured data reliability and correctness through structured validation and monitoring.
Applied strong software engineering best practices (modular code, version control, testing) across systems.

Software Engineer – Android & Data Systems

Picstorie Technologies Pvt Ltd

06.2017 - 06.2019

Developed and maintained Android applications using Java/Kotlin, which powered a content-driven platform.
Instrumented user interaction and content events within the Android app, generating structured event data.
Worked with event-driven application data flowing into backend systems for analytics and reporting.
Built Python and Java-based batch processing logic to process application and content data.
Designed and optimized database schemas to support analytical queries and performance reporting.
Collaborated with backend, product, and business teams to enable data-backed decision-making.
Ensured data correctness and reliability through validation, logging, and structured debugging.

Education

Master of Science - Data Analytics

Dublin Business School

Dublin, Ireland

09.2022 - 10.2023

Bachelor of Science - Computer Science

GGSIPU

Delhi, India

06.2013 - 08.2017