Summary
Overview
Work History
Education
Skills
Websites
Certification
Projects
Timeline
Generic

JAY P BHOYAR

Mumbai

Summary

Data Engineer with 2.5 years of experience at IKS Health. Results-focused data professional equipped for impactful contributions. Expertise in designing, building, and optimizing complex data pipelines and ETL processes. Strong in SQL, Python, and cloud platforms, ensuring seamless data integration and robust data solutions. Known for excelling in collaborative environments, adapting swiftly to evolving needs, and driving team success.

Overview

2
2
years of professional experience
1
1
Certification

Work History

Data Engineer

IKS Health
05.2022 - Current
  • Developed and managed big data ingestion and processing applications on GCP, utilizing BigQuery, Dataflow, Composer, Cloud Storage, and Dataproc
  • Built and optimized BigQuery data models and datasets for enhanced performance and scalability, leading to improved query response times
  • Continuously monitored and optimized data processing and storage resources on GCP, resulting in cost savings and enhanced system performance
  • Diagnosed and resolved data pipeline issues and performance bottlenecks, ensuring seamless operations and minimal downtime
  • Conducted performance tuning and optimization of Spark and Apache Beam applications, resulting in reduced processing time by 30%
  • Implemented real-time data processing solutions using Kafka and Pub/Sub, enhancing data ingestion efficiency and system responsiveness
  • Managed the development life-cycle for multiple agile projects, ensuring timely delivery and adherence to project requirements
  • Collaborated with both technical and business stakeholders to identify needs, provide solutions, and ensure successful project outcomes
  • Created comprehensive documentation for data engineering processes, best practices, and technical specifications, facilitating knowledge sharing and onboarding
  • Migrated legacy systems to modern big-data technologies, improving performance and scalability while minimizing business disruption.
  • Automated routine tasks using Python scripts, increasing team productivity and reducing manual errors.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
  • Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.
  • Strengthened communication skills through regular interactions with others.

Education

Post Graduate Diploma - Big Data Analytics

IACSD
Pune
09.2022

Bachelor of Engineering - Mechanical Engineering

Sinhgad Academy of Engineering
Pune
01.2017

Skills

  • Python
  • PySpark
  • SQL
  • Google Cloud Platform (GCP)
  • Cloud Storage
  • BigQuery
  • Dataflow
  • Dataproc
  • Cloud Composer
  • ETL development
  • Data Pipeline Design

Certification

  • HackerRank SQL(Intermediate) Certificate, https://www.hackerrank.com/certificates/12684f4ba14b
  • Python Specialization Certificate (University of Michigan), https://www.coursera.org/account/accomplishments/specialization/certificate/K49PCUMUGFEL
  • Tableau Certification, https://www.udemy.com/certificate/UC-7ed925e9-bf72-4667-ad7b-c218618d92f0/
  • Microsoft Certified: Azure Data Fundamentals, https://learn.microsoft.com/en-us/users/jaybhoyar-5285/credentials/2d7ff792d03f6eff?ref=https%3A%2F%2Fwww.linkedin.com%2F

Projects

Data Pipeline, Spearheaded the creation of a GCP data pipeline from scratch as part of a pilot project., Developed a Python Data Extraction Application., Implemented Pub/Sub messaging to trigger a Cloud Function., Launched Dataflow jobs using Flex Templates for data validation., Successfully loaded clean, transformed, and validated data into BigQuery., Apache Beam, Dataflow, Python, GCS, Docker, Artifact Registry, Pub/Sub, BigQuery, SQL, Apache Spark, Dataproc, Successfully migrated an on-premises data pipeline to GCP, enhancing data processing efficiency by 40% and reducing errors by 95%. Schema Sync, Developed a Python Google Cloud Function to update BigQuery table schemas., Python, Google Cloud Storage, Google Cloud BigQuery, Storage Trigger, Reduced manual effort by 50% in updating table schemas. Data Extractor Utility, Created a utility to automate large data extraction processes from client EPMs., Python, GCS API, BigQuery API, Pub/Sub API, Reduced manual effort by 40% in extracting data from various client EPMs.

Timeline

Data Engineer

IKS Health
05.2022 - Current
  • HackerRank SQL(Intermediate) Certificate, https://www.hackerrank.com/certificates/12684f4ba14b
  • Python Specialization Certificate (University of Michigan), https://www.coursera.org/account/accomplishments/specialization/certificate/K49PCUMUGFEL
  • Tableau Certification, https://www.udemy.com/certificate/UC-7ed925e9-bf72-4667-ad7b-c218618d92f0/
  • Microsoft Certified: Azure Data Fundamentals, https://learn.microsoft.com/en-us/users/jaybhoyar-5285/credentials/2d7ff792d03f6eff?ref=https%3A%2F%2Fwww.linkedin.com%2F

Post Graduate Diploma - Big Data Analytics

IACSD

Bachelor of Engineering - Mechanical Engineering

Sinhgad Academy of Engineering
JAY P BHOYAR