Summary
Overview
Work History
Education
Skills
Projects
Certification
Timeline
Generic

Punitkumar Y Harsoor

Bangalore

Summary

Skilled Data Science with 2.5 years of experience in designing and implementing efficient ETL processes. Looking for transition and challenging position in Data engineering/Big data or data similar domains where I can apply and enhance skills to solve analytical and data related problems.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Data Science Consultant- Tech Product

TimesPro
06.2022 - Current
  • Migrated ~10GB of on-premises data to Azure Cloud using Azure Data Factory, Databricks, and ADLS Gen2, and developed an executive-level LMS dashboard by collaborating with 12+ client
  • Enhanced data processing by transitioning extract creation from Hive to PySpark, increasing speed by 63%, and optimizing Spark queries to improve extract performance by 20%
  • Automated data loading processes using shell scripts, reducing manual effort by 60% and errors by 70%, while ensuring data accuracy with custom PySpark transformations
  • Collaborated with clients to understand requirements and deliver tailored solutions while contributing to Data Science and Big Data L&D by developing training content and resources

Junior Data analyst

Jigsaw U-next
02.2021 - 05.2022

Graduate Student Research Assistant - Intern

IoTLabs-UVCE
12.2019 - 11.2020

Education

M.Tech - CSE

UVCE (Bengaluru University)
01-2021

B.E - CSE

RNS Institute of Technology
01-2017

12th - Science

Gurukul PU College
01-2013

Skills

  • Python
  • SQL
  • PySpark
  • Azure Data Factory
  • Azure Data Bricks
  • PowerBI
  • Tableau
  • Machine learning
  • Data Analysis
  • GitHub

Projects

Credit Lending Project, Executed comprehensive data cleaning on the Lending Club dataset, eliminating 95% of duplicates, null values, and datatype issues, enhancing analysis accuracy by 40%., Applied optimization techniques in PySpark to improve data processing performance and efficiency., Investigated and implemented compression techniques, reducing storage space and optimizing data transfer and processing times by 30%.

Certification

  • Basic to Advance PowerBI and Tableau certification, Udemy, 2023
  • Apache Spark 3 - Spark programming in python certification, Udemy, 2023

Timeline

Data Science Consultant- Tech Product

TimesPro
06.2022 - Current

Junior Data analyst

Jigsaw U-next
02.2021 - 05.2022

Graduate Student Research Assistant - Intern

IoTLabs-UVCE
12.2019 - 11.2020

M.Tech - CSE

UVCE (Bengaluru University)

B.E - CSE

RNS Institute of Technology

12th - Science

Gurukul PU College
Punitkumar Y Harsoor