Summary
Overview
Work History
Education
Skills
Websites
Certification
Hobbies and Interests
Projects
Timeline
Generic

Mousam Dey

Kolkata

Summary

Experienced Data Engineer with a strong background in designing, optimizing, and maintaining scalable data pipelines and cloud-based solutions. Proficient in AWS and Azure services, Python, SQL, and Apache Spark. Skilled in data modeling, workflow automation, and building interactive tools using Streamlit. Proven ability to improve performance and efficiency through query optimization and process automation.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Data Engineer

UsefulBI Corporation
Remote
11.2024 - Current
  • Built and maintained scalable data pipelines using AWS Glue and Apache Spark, enabling efficient processing and transformation of large datasets.
  • Designed SQL-based data models and performed analysis using AWS Athena to support business intelligence and reporting needs. Optimized existing SQL queries, resulting in a 70% improvement in overall performance, and reduced query execution time.
  • Developed and orchestrated automated workflows using AWS Managed Workflows for Apache Airflow (MWAA), improving pipeline reliability and reducing manual intervention.
  • Designed interactive dashboards and internal tools leveraging Python Streamlit for data visualization.
  • Managed code and collaboration through GitHub, implementing version control best practices across multiple data engineering projects.

Data Engineer (Part-time)

Scale AI
Remote
06.2024 - 10.2024
  • Developed an automated system for analyzing text sentiments and large language models, significantly enhancing data processing accuracy.
  • Implemented code-based solutions for complex problems generated on the web. The focus is to generate fine-tuned, proper, functional, and optimized solutions so that they maintain a prompt-response relation.
  • Streamlined data quality checks, reducing errors, and improving dataset reliability.
  • Developed and deployed machine learning models for predictive analytics, utilizing Spark and TensorFlow.

Data Engineer

LTIMindtree
Hyderabad
08.2021 - 05.2024
  • Configured cloud-based storage solutions, improving data access and analysis capabilities.
  • Designed and implemented Solutions using Apache Spark, Scala/Python, Azure Data Lake Storage (Gen1 and Gen2) resulting in a 50% reduction in application response time while increasing system scalability by 75%
  • Developed an architecture utilizing Apache Spark to improve data processing efficiency by over 60%, reducing the average query execution time from minutes to seconds.
  • Collaborated effectively with teams to understand project requirements, ensuring timely and under-budget project completion.

Education

BTech - Computer Science Engineering

Cooch Behar Government Engineering College
07.2021

Skills

  • Databricks
  • AWS (Glue, Athena, S3, Secrets Manager)
  • Azure (Synapse, Data Factory, Key Vaults, ADLS)
  • GitHub
  • C
  • Python
  • Apache Spark
  • Algorithms and Data Structures
  • Problem-solving abilities
  • SQL and databases
  • Team Collaboration
  • Effective communication

Certification

  • Machine Learning and Deep Learning, Udemy, 01/01/21
  • Intermediate Problem Solving, Hackerrank, 01/05/21

Hobbies and Interests

  • Musician
  • Health and Fitness
  • Social Work (angikarparibar.org)

Projects

  • Self-Developed Security Camera (Python OpenCV, External Webcam or System Camera)
  • Computer Vision Face Mask Detection (Convolution Neural Network (CNN) ML Algorithm, Python PIL Library, Tensorflow-keras, Pandas Library)
  • Online Food Ordering and Medical Diagnostic Machine Learning Chat Bots (node.js, MongoDB, Dialogflow Essentials Image Color Detection, Python openCV, PIL Library)

Timeline

Data Engineer

UsefulBI Corporation
11.2024 - Current

Data Engineer (Part-time)

Scale AI
06.2024 - 10.2024

Data Engineer

LTIMindtree
08.2021 - 05.2024

BTech - Computer Science Engineering

Cooch Behar Government Engineering College
Mousam Dey