Summary
Overview
Work History
Education
Skills
Timeline
Generic

Shubham Khadake

Data Engineer
Bangalore

Summary

Big Data Engineer around 5 years of experience in designing, developing, and optimizing high-performance data pipelines and architectures. Experienced data engineer with a specialization in Big Data processing and distributed computing. Proficient in designing and optimizing ETL/ELT workflows for seamless data integration using Python, SQL, Apache Spark and Azure, AWS

Overview

6
6
years of professional experience

Work History

Senior Software Engineer

Trinity Mobility Private
11.2022 - Current

Project Name: Emergency Respond Data Warehouse Analysis


  • Designed and implemented a complete data pipeline, integrating various data sources and leveraging technologies such as Apache Spark, Hive, Kafka, MySQL, HBase, NIFI, and Python.
  • Enhanced data processing efficiency by 30% through Spark-Hive and Spark-HBase integrations, reducing latency in real-time and batch processing scenarios.
  • Implemented daily jobs using Databricks to refresh data and maintain up-to-date information. I worked on the latest Azure Databricks feature, the Delta Live Table.
  • Secured a 40% increase in Kafka data ingestion speed, optimizing the streaming pipeline, and reducing overall processing time by 25% across various data pipeline components.
  • Increased data processing efficiency by 40% through parallel processing and Spark jobs, and provided solutions for production issues.

Associate Engineer

TCTSL
08.2019 - 12.2021

Project Name — Telecom Infra Data Analysis


  • ETL from different sources to different destinations, and transformed using SQL stored procedures and scheduled triggered jobs.
  • Proficient in writing complex SQL queries with multiple table joins and stored procedures, depending on requirements.
  • Have a good understanding of SQL functions for data preparation and transformation.
  • Created SQL jobs, tested, and monitored the same.
  • Transformed SQL complex queries into a readable format for better analysis.

Education

BE in Electronic And Telecommunication Engineering -

Solapur University of Engineering And Technology
Solapur, India
04.2001 -

Diploma In Electronics And Telecommunication Engi -

Government Polytechnic of KARAD
Karad, India
04.2001 -

Skills

Big Data Technologies: Hadoop, Apache Spark, Python, SOL, Hive, PySpark, Yarn, Hdfs, Map Reduce

Cloud Technologies: Azure(ADF, Synapse, Azure Databricks), AWS(S3,RDS,ATHENA,EMR)

Databases and Tools: Oracle, MySQL, NoSQL, PostgreSQL, HBase

Languages: Python, SQL

Scheduling: Apache Airflow, Cron Tab

Data Modeling: Data Warehousing , Data Lake

Data Pipelines: ETL, ELT, Streaming, Batch Processing, Data Strategy, Data Source, Production Issues, Data Transformation

Data Governance: Data Quality, Data Security, Data Lineage , Metadata Management

Version Control: Git, Jira , Agile, Scrum

Platforms: Windows, Linux

Timeline

Senior Software Engineer

Trinity Mobility Private
11.2022 - Current

Associate Engineer

TCTSL
08.2019 - 12.2021

BE in Electronic And Telecommunication Engineering -

Solapur University of Engineering And Technology
04.2001 -

Diploma In Electronics And Telecommunication Engi -

Government Polytechnic of KARAD
04.2001 -
Shubham KhadakeData Engineer