Summary
Work History
Education
Skills
Timeline
Generic

Pritam Galande

Summary

Highly-skilled Data Engineer with 4 years of experience in developing, implementation and optimization of ETL processes. Seeking a challenging role to leverage my expertise in Python, SQL, and the AWS ecosystem to design and implement efficient data engineering solutions.

Work History

Software Engineer

  • Utilized AWS S3 for storing and managing large volumes of data
  • Cleaning, transforming and sorting the data in S3 using AWS Glue
  • Utilized components of AWS Glue like Data catalog, Crawlers, Sensitive Data Detection, Data quality and CloudWatch for monitoring jobs
  • Utilized Athena for further querying and analysis of transformed data
  • Created Lambda functions for triggering Glue jobs upon S3 object updates
  • Created Python utilities which will be used to connect source for the data processing we have used PySpark
  • Updated existing PySpark code and tried to make them more generic across the platform
  • Responsible for pull, push and committing the code using Github
  • Debugging and fixing the issues by sampling the data
  • Responsible for monitoring and troubleshooting ETL jobs.

  • Responsible for managing data from different sources, including S3 and RDS
  • Experience in reading various file formats (CSV, JSON, Parquet) from AWS S3 using Spark
  • Leveraged AWS EMR for big data processing and analytics
  • Improved performance and optimization of existing queries in EMR Cluster using Spark
  • Designed and developed databases and data schema in AWS Snowflake
  • Transferred large data volumes from S3 to Snowflake
  • Developed and automated airflow pipeline monitoring, alarming, and failure notification system.

  • Utilized AWS S3 for storing and managing large volumes of data
  • Cleaning, transforming and sorting the data in S3 using AWS Glue
  • Utilized components of AWS Glue like Data catalog, Crawlers, Sensitive Data Detection, Data quality and CloudWatch for monitoring jobs
  • Utilized Athena for further querying and analysis of transformed data
  • Created Lambda functions for triggering Glue jobs upon S3 object updates
  • Created Python utilities which will be used to connect source for the data processing we have used PySpark
  • Updated existing PySpark code and tried to make them more generic across the platform
  • Responsible for pull, push and committing the code using Github
  • Debugging and fixing the issues by sampling the data
  • Responsible for monitoring and troubleshooting ETL jobs.

Education

Bachelor Of Engineering -

Savitribai Phule Pune University

Skills

PyCharm, Jupyter Notebook, Databricks and JiraOracle and MySQLGitRedshift and SnowflakeIAM, EC2, S3, EMR, Glue, Athena, SNS, Lambda, Redshift, Cloud watch, etc

Timeline

Software Engineer

Bachelor Of Engineering -

Savitribai Phule Pune University
Pritam Galande