Overview
Work History
Education
Skills
Certification
Projects
Accomplishments
Timeline
Generic

SOUMYA SAXENA

Allahabad

Overview

4
4
years of professional experience
1
1
Certification

Work History

National Australian Bank
Gurgaon
04.2023 - Current
  • Led the designing, development and maintenance of scalable data pipelines while optimizing complex queries and dashboards to support business decision-making.
  • Refactor existing data models, ETL processes, and codebases to improve performance and maintainability.
  • Automate data workflows, table updates, and reporting processes to reduce manual intervention and ensure timely data delivery.
  • Migration of data tables and views from one framework based on Postgres database to another framework based on Databricks.
  • Work closely with business leaders and stakeholders to understand requirements, address challenges, and deliver solutions that meet organizational goals.
  • Mentored junior engineers, instilling a culture of ongoing growth and knowledge exchange.

Data Engineer

Amazon
Bangalore
06.2022 - 03.2023
  • Hands-on experience with AWS analytics and big data related cloud services
  • Experience in designing and building data pipelines for analytics and business needs
  • Experience with ETL and BI tools - SQL, Redshift, Quicksight
  • Worked as oncall to monitor system and data pipeline health and resolve infrastructure related issues
  • Effective problem-solving and troubleshooting skills
  • Strong collaboration and teamwork skills with excellent written and verbal communication skills.

Cloud Support Engineer

Amazon Web Services
Hyderabad
08.2020 - 05.2022
  • Learned and gained expertise in cutting-edge technologies such as Amazon Redshift, Amazon Quicksight, Amazon Opensearch, Amazon Glue, Amazon Athena and Amazon S3
  • Applied advanced troubleshooting techniques and utilized AWS services to cater to customers' specific requirements.
  • Worked closely with cross-functional teams at Amazon in India and offshore to replicate and address customer concerns.
  • Implemented best practices for security and reliability of cloud environments.

Education

Bachelor of Technology - Computer Science Engineering

Dr. A.P.J. Abdul Kalam Technical University
Lucknow
01.2020

Skills

  • SQL
  • Python
  • Databricks
  • Amazon Redshift
  • Amazon Glue
  • Amazon Athena
  • Amazon EMR
  • Amazon S3
  • Spark - Pyspark and Spark SQL
  • Power BI (DAX and Visualisation)
  • Microsoft Excel
  • Version control tool - Git
  • Data Migration
  • Performance Tuning
  • Refactoring

Certification

- AWS Certified Solutions Architect – Associate Amazon Web Services

- Databricks Data Analyst Associate

Projects

Service Works:

The aim  of this project is to design and implement an end-to-end data pipeline on premises using Python and SQL to provide real-time insights into health and ensure timely issue resolution.

- Automate the extraction of data from data sources 

- Leverage Python for data cleaning and preprocessing and Utilize SQL to transform raw service data into meaningful metrics.

- Develop a comprehensive Power BI dashboard and integrate it with the data pipeline.

Dwell Time Metric Creation:

The aim of this project is to develop a directional business metric to capture the total time a customer spent on Amazon site in a transit.

- Exploring and analyzing data sources and ETL tools

- Writing complex SQL queries to compute transit duration

- Designing and building ETL pipelines to compute dwell time metric.

- Test ETL pipeline and validate the dwell data.

- Modify existing ETL pipelines to introduce dwell time metric.

- Create business reports using BI tool using Tableau.

 Marketplace Launch

 The aim of this project is to update ETL pipelines to introduce the data for new marketplace in prod datasets and business reports.

- Evaluate the impact of new marketplace launch on production datasets and business reports.

- Modify the ETL pipelines and dev code to introduce data for the new marketplace.

- Monitor the ETL pipeline and validate the data on the database.

Accomplishments

  • Demonstrated expertise in AWS services during a competitive AWS Game day event, resolving real world scenarios using EC2, S3, RDS and Lambda.

Timeline

National Australian Bank
04.2023 - Current

Data Engineer

Amazon
06.2022 - 03.2023

Cloud Support Engineer

Amazon Web Services
08.2020 - 05.2022

Bachelor of Technology - Computer Science Engineering

Dr. A.P.J. Abdul Kalam Technical University
SOUMYA SAXENA