Summary
Overview
Work History
Education
Skills
D O B
Timeline
Generic
Kenil Shah

Kenil Shah

Pune

Summary

A competent professional offering nearly 5+ years of rich experience in Data Analysis, Data Engineering. A keen data scientist, with a flair for adapting quickly to dynamic business environments and adopting pragmatic approach in improvising on solutions and resolving complex business issues. Excellent understanding of Machine Learning techniques and algorithms with experience in working upon data visualization tools. Insightful knowledge of business process Business & Data Analysis (As-Is, To-Be) and design, application based process reengineering, process optimization, cost control & revenue maximization. Excellent aptitude in supporting data-driven decisions and working with different teams to gather data, measure business performance & produce valuable conclusions that can help companies make informed decisions on the future of business. Talented Analyst with an exceptional background in utilizing data from diverse information systems to build tools and forecasting models that remarkably improve organizational decision-making capabilities. Skilled in data acquisition & extraction, data analysis (quantitative and qualitative), data presentation and reporting; proficient in analyzing & presenting conclusions gained from analyzing data.

Overview

3
3
years of professional experience

Work History

Data Engineer

Cognizant
  • Working with high performance Data Integration Solutions that connect multiple data sources for extracting and rapidly transforming the data for loading into the pre-designed data warehouses
  • Building scalable databases capable of ETL processes using SQL and Spark
  • Using Pyspark to create a Pre-processing End-to-End automation Pipeline for multiple vendors
  • Using Pyspark to store the data into HDFS and to implement spark for faster processing of data
  • Iterating and improving existing features in pipeline as well as adding new ones
  • Involves in creation of tables, partitioning tables, join conditions, sub queries, nested queries for application development in GCP
  • Participating in full development life cycle including requirement analysis, design, development, deployment and operations support
  • Working on building data engineering pipelines on GCP cloud platform.

Data Engineer

Infosys
- 12.2024
  • Worked with high performance Data Integration Solutions that connect multiple data sources for extracting and rapidly transforming the data for loading into the pre-designed data warehouses
  • Built scalable databases capable of ETL processes using SQL and Spark
  • Used Pyspark to create a Pre-processing End-to-End automation Pipeline for multiple vendors
  • Used Pyspark to store the data into HDFS and to implement spark for faster processing of data
  • Iterated and improved existing features in pipeline as well as added new ones
  • Involved in creation of tables, partitioning tables, join conditions, sub queries, nested queries for application development
  • Participated in full development life cycle including requirement analysis, design, development, deployment and operations support
  • Worked on building data engineering pipeline on GCP cloud platform as well.

Data Engineer

Comscore LLC
04.2019 - 03.2020
  • Performed End to End Data Engineering pipeline for many vendors Using Pyspark and python
  • Involved in Ingestion to deployment of pipeline
  • In on premise and gcp cloud platform.

Data Engineer

Analytix India Pvt Ltd.
01.2018 - 04.2019
  • Performed EDA on the data that the client has provided also applied pre-processing to the data like extra punctuation marks, question marks are removed from the data
  • Created end to end pre-processing automation pipeline for the different vendors in dunkin donuts using Pyspark & Python
  • Used Pyspark to store the data into HDFS cluster in parquet format
  • Created an ETL job for some of the Vendors using SSIS
  • Created a Hive Partitioning ad bucketing on the Clients data
  • Worked on data engineering pipeline on cloud platform as well.

Data Engineer

PRGX INDIA
01.2017 - 12.2017
  • Performed End to End Data Engineering pipeline for many vendors Using talend ETL Tool.

Education

MCA -

MIT Pune
01.2017

BSC(IT) -

Ganpat University
01.2014

Skills

  • Data analysis & Mining
  • HDFS
  • Hive
  • Spark
  • SQL
  • Airflow
  • Azure Data Factory
  • ETL & Data Warehousing
  • Data Engineering Pipeline
  • Team Building
  • GCP - Big query, Pubsub, GCP dataflow, GCP dataproc, IAM

D O B

04/17/94

Timeline

Data Engineer

Comscore LLC
04.2019 - 03.2020

Data Engineer

Analytix India Pvt Ltd.
01.2018 - 04.2019

Data Engineer

PRGX INDIA
01.2017 - 12.2017

Data Engineer

Cognizant

Data Engineer

Infosys
- 12.2024

MCA -

MIT Pune

BSC(IT) -

Ganpat University
Kenil Shah