Summary
Work History
Education
Skills
Timeline
Generic

Pritam Galande

Summary

Highly-skilled Data Engineer with 4 years of experience in developing, implementation and optimization of ETL processes. Seeking a challenging role to leverage my expertise in Python, SQL, and the AWS ecosystem to design and implement efficient data engineering solutions.

Work History

Software Engineer

  • Utilized AWS S3 for storing and managing large volumes of data
  • Cleaning, transforming and sorting the data in S3 using AWS Glue
  • Utilized components of AWS Glue like Data catalog, Crawlers, Sensitive Data Detection, Data quality and CloudWatch for monitoring jobs
  • Utilized Athena for further querying and analysis of transformed data
  • Created Lambda functions for triggering Glue jobs upon S3 object updates
  • Created Python utilities which will be used to connect source for the data processing we have used PySpark
  • Updated existing PySpark code and tried to make them more generic across the platform
  • Responsible for pull, push and committing the code using Github
  • Debugging and fixing the issues by sampling the data
  • Responsible for monitoring and troubleshooting ETL jobs.

  • Responsible for managing data from different sources, including S3 and RDS
  • Experience in reading various file formats (CSV, JSON, Parquet) from AWS S3 using Spark
  • Leveraged AWS EMR for big data processing and analytics
  • Improved performance and optimization of existing queries in EMR Cluster using Spark
  • Designed and developed databases and data schema in AWS Snowflake
  • Transferred large data volumes from S3 to Snowflake
  • Developed and automated airflow pipeline monitoring, alarming, and failure notification system.

  • Utilized AWS S3 for storing and managing large volumes of data
  • Cleaning, transforming and sorting the data in S3 using AWS Glue
  • Utilized components of AWS Glue like Data catalog, Crawlers, Sensitive Data Detection, Data quality and CloudWatch for monitoring jobs
  • Utilized Athena for further querying and analysis of transformed data
  • Created Lambda functions for triggering Glue jobs upon S3 object updates
  • Created Python utilities which will be used to connect source for the data processing we have used PySpark
  • Updated existing PySpark code and tried to make them more generic across the platform
  • Responsible for pull, push and committing the code using Github
  • Debugging and fixing the issues by sampling the data
  • Responsible for monitoring and troubleshooting ETL jobs.

Education

Bachelor Of Engineering -

Savitribai Phule Pune University

Skills

undefined

Timeline

Software Engineer

Bachelor Of Engineering -

Savitribai Phule Pune University
Pritam Galande