Summary
Overview
Work History
Education
Skills
Timeline
Generic

Satyajit Ligade

GCP DATA ENGINEER
Bengaluru

Summary

  • I aim to leverage my expertise as a GCP Data Engineer to contribute to impactful data projects, drive innovation, and achieve strategic goals while advancing my skills in a growth-oriented organization.
  • A skilled Data Engineer with 4.2 years of experience working with various Data technologies.
  • Collaborated with cross-functional teams to gather business requirements and translate them into scalable data solutions.
  • Experienced in working extensively with Google Cloud Platform (GCP) services such as BigQuery, Cloud Storage, Cloud Composer (Apache Airflow), Cloud Dataflow and Cloud Dataproc.
  • Proficient in loading and transforming data efficiently into BigQuery for analytics and downstream consumption. Possesses knowledge of snowflake as well as databricks.
  • Extensive experience in designing, implementing, and optimizing data warehouse solutions using BigQuery for analytical workloads.
  • Hands-on experience in Big Data processing using Cloud Dataproc to build scalable and distributed data solutions.
  • Worked with various Spark components such as Spark Core, Spark SQL, DataFrame API, and different file formats including CSV, JSON, Parquet, Avro, and ORC.
  • Proficient in SQL concepts including DRL statements, Joins, Aggregate & Analytical functions, and Windowing functions for efficient querying and data manipulation.
  • Familiar with Python concepts, including core python concepts as well as advanced python concepts like Object-Oriented Programming (OOP).
  • Experienced in using Git as a version control tool for maintaining scripts in a repository.
  • Followed Agile methodologies in the SDLC to ensure best practices, continuous delivery, and efficient project execution.
  • Active participation in daily scrum calls, sprint planning, sprint review, sprint retrospection, and grooming sessions for efficient Agile execution.
  • Strong team player with excellent communication, documentation, analytical, problem-solving, and interpersonal skills to effectively collaborate across teams and deliver high-quality solutions.

Overview

4
4
years of professional experience

Work History

GCP Data Engineer

Manas Solutions
12.2020 - Current
  • Company Overview: Farfetch is a British e-commerce company focused on luxury clothing and beauty products. It operates as a digital marketplace that sells products from several hundred brands, boutiques and department stores from around the world.
  • Managing data ingestion from multiple sources, primarily in formats like CSV, JSON, and Parquet, using Cloud Storage as a centralized data lake.
  • Ensuring data quality through cleansing, deduplication before further processing.
  • Developing PySpark and SQL scripts on Cloud Dataproc to perform data transformations and aggregations based on business requirements.
  • Processing batch data using Cloud Dataflow (Apache Beam) for scalable and distributed transformations.
  • Implementing ETL pipelines using Cloud Composer (Apache Airflow) to orchestrate data workflows efficiently.
  • Testing and validating data transformation scripts to ensure accuracy and reliability.
  • Loading the refined Parquet/ORC data from Cloud Storage to BigQuery/Snowflake, ensuring data consistency and integrity.
  • Maintaining Data Warehouse solutions on BigQuery and Snowflake, implementing incremental loads and Slowly Changing Dimensions (SCD Type 2).
  • Farfetch is a British e-commerce company focused on luxury clothing and beauty products. It operates as a digital marketplace that sells products from several hundred brands, boutiques and department stores from around the world.
  • Client: Farfetch
  • Client: Spirit Airlines, Inc.
  • Client: TDS Telecommunications LLC.

GCP Data Engineer

Spirit Airlines, Inc.
12.2020 - Current
  • Company Overview: Spirit Airlines, Inc. is an American ultra-low cost airline headquartered in Dania Beach, Florida, in the Miami metropolitan area. Spirit operates scheduled flights throughout the United States, the Caribbean, and Latin America.
  • Responsible for managing data coming from different sources stored in Cloud Storage.
  • Writing PySpark scripts to perform efficient data transformations on Dataproc clusters.
  • Utilizing Cloud Storage features and storing data of Data Lake, staging area as well as refined data.
  • Loading the refined data from Cloud Storage to BigQuery to build a data warehouse while ensuring data quality.
  • Creating tables in BigQuery for maintaining relationships between them as per client requirements, and loading data from Cloud Storage.
  • Automating data extraction, transformation, and loading using Cloud Composer (Apache Airflow) to create job workflows.
  • Modifying the existing business logic if required after validating the data between source and target.
  • Spirit Airlines, Inc. is an American ultra-low cost airline headquartered in Dania Beach, Florida, in the Miami metropolitan area. Spirit operates scheduled flights throughout the United States, the Caribbean, and Latin America.

GCP Data Engineer

TDS Telecommunications LLC
12.2020 - Current
  • Company Overview: TDS Telecommunications LLC delivers high-speed internet, TV entertainment, and phone services to a mix of small to mid-sized urban, suburban and rural communities throughout the U.S.
  • Designing and implementing Python utilities for ETL scripts.
  • Working on SQL queries for fetching data from a database.
  • Using SQL for writing complex queries after understanding the parameters influencing the query performance.
  • Efficiently organizing data extracted from Oracle database and optimizing it for querying.
  • Debugging ETL scripts that are used for performing data cleaning using various checks.
  • Using Git as a versioning tool for pull, push and commit code in repository.
  • Testing the accuracy and reliability of the processed data after transformations.
  • TDS Telecommunications LLC delivers high-speed internet, TV entertainment, and phone services to a mix of small to mid-sized urban, suburban and rural communities throughout the U.S.

Education

Bachelor of Engineering - B.E -

Solapur University
Sangola

Skills

  • * Programming Languages : PySpark, Python, SQL
    * Databases : Oracle
    * GCP Storage : GCS, Cloud SQL
    * Data Warehouse : BigQuery
    * Data Processing : Dataproc, Dataflow
    * Orchestration & Automation : Cloud Composer (Apache Airflow)
    * Development Tools : PyCharm, Jupyter Notebook, Visual Studio
    * Version Control : Git, GitHub
    * Operating Systems : Windows, Linux(Basic)

Timeline

GCP Data Engineer

Manas Solutions
12.2020 - Current

GCP Data Engineer

Spirit Airlines, Inc.
12.2020 - Current

GCP Data Engineer

TDS Telecommunications LLC
12.2020 - Current

Bachelor of Engineering - B.E -

Solapur University
Satyajit LigadeGCP DATA ENGINEER