Overview
Work History
Education
Skills
Certification
Projects
Timeline
Generic

Omkar Shitole

GCP Data Engineer
Pune,Maharashtra

Overview

6
6
years of professional experience
16
16
years of post-secondary education
2
2
Certifications

Work History

Data Engineer

BBI
Pune
10.2022 - Current
  • Data Pipeline Development, Data Processing, Data Warehousing, Real-time Data Streaming, Data Integration and Transformation, Monitoring and Optimization

Senior Software Engineer

MediaAgility
12.2021 - 10.2022
  • Data Pipeline development, Data Security and Compliance, Collaboration and Documentation, Big data Processing

Data Analyst

ADP
05.2019 - 12.2021
  • Analyze large amounts of service request to discover trends as well as patterns and perform the exploratory data analysis using tableau
  • Present information using data visualization libraries like pandas, seaborn
  • Proposed model/automation and strategies to business challenges which leads to reducing 65% manual work
  • Cleansing, and verifying the integrity of data used for analysis
  • Deployment of the software model using Jenkins
  • Manage MySQL database systems via SQL Server Management Studio (SSMS) tool, maintain the client data on Prod and lower environments.

Windows Server Administrator

Softenger
11.2018 - 02.2019
  • Install, troubleshoot, maintain, and configure Microsoft Windows desktop
  • Patching

Education

B.E E&Tc -

JSPM's Imperial College Of Engineering Pune
08.2014 - 04.2018

HSC - undefined

Bhaskaracharya Junior College, Aurangabad
07.2012 - 05.2014

SSC - undefined

K.T.M.S, Aurangabad
06.2002 - 05.2012

Skills

Cloud Composer

undefined

Certification

Google Cloud Certified Professional Data Engineer (Google Cloud Platform), 09/01/22, www.credential.net/3f2eeb3e-8ec1-4ebb-a7...

Projects

Transunion: Event-Driven Data Pipeline integrating Cloud Pub/Sub with On-Premise SQL Server at Transunion, 10/01/23, Present

 This data pipeline uses events in a Pub/Sub topic to trigger processing. Cloud Functions read Parquet files, transform them, and land the data as CSVs in a GCS bucket. A Composer pipeline then processes these CSVs in chunks and stores them in real-time within an on-premise MS SQL Server. Responsibilities included collaborating on the architecture, utilizing Google Cloud services, and ensuring seamless integration.


Transunion: Cloud Function & Jenkins Integration for Seamless Data Uploads at Transunion, 05/01/23, 07/01/23

In this innovative initiative, we are revolutionizing our data upload process by implementing a Cloud Function that seamlessly integrates with Jenkins. The objective is to automate the triggering of Jenkins pipelines whenever new data is uploaded to GCS buckets. This streamlined approach enhances efficiency, reduces manual intervention, and accelerates our data processing workflows.


Transunion: Across Google Cloud Projects using Cloud Composer, 01/01/23, 05/01/23,

In this ground breaking initiative, we are building a sophisticated data pipeline to efficiently transfer data from GCS buckets in one Google Cloud project to BigQuery tables in another project. The implementation leverages the power of Cloud Composer, ensuring a scalable, orchestrated, and automated workflow for smooth data integration.


Google: New York State DEC + Google Data Pipeline (Data Warehouse to BigQuery), 02/01/21, 08/01/22

Implemented a robust data pipeline using Google Cloud services for the New York State Department of Environmental Conservation (DEC). The pipeline leveraged key components such as Cloud Run, Google Cloud Storage (GCS), Cloud Storage, Artifact Registry, and Cloud Composer. Orchestrated daily runs with Cloud Composer, seamlessly integrating it with Cloud Run for enhanced automation. Scheduled Cloud Composer to trigger Cloud Run for daily data extraction, sending GET requests to the client API and transforming raw JSON data into newline-delimited JSON format. Utilized Cloud Composer to efficiently load processed data into BigQuery, contributing to a streamlined workflow and improved accessibility of environmental data for DEC.

Timeline

Data Engineer

BBI
10.2022 - Current

Senior Software Engineer

MediaAgility
12.2021 - 10.2022

Data Analyst

ADP
05.2019 - 12.2021

Windows Server Administrator

Softenger
11.2018 - 02.2019

B.E E&Tc -

JSPM's Imperial College Of Engineering Pune
08.2014 - 04.2018

HSC - undefined

Bhaskaracharya Junior College, Aurangabad
07.2012 - 05.2014

SSC - undefined

K.T.M.S, Aurangabad
06.2002 - 05.2012
Omkar ShitoleGCP Data Engineer