Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Romil Joshi

Bengaluru

Summary

A diligent and results-driven Data Engineer with three years of hands-on experience in designing, implementing, and maintaining scalable data pipelines. Proficient in various programming languages and technologies related to data processing and analysis. Demonstrated expertise in building scalable and reliable data solutions to support business analytics, machine learning, and decision-making processes.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Tekion Corp
10.2022 - Current
  • Designed, implemented, and maintained data pipelines to process large volumes of data from diverse sources, ensuring data quality and reliability.
  • Reduced S3 storage and read/write costs by up to 40% by optimizing Apache spark and delta lakes
  • Developed job orchestration tool for Apache Spark jobs using Java, ensuring efficient scheduling and management of data processing workflows
  • Worked on Audit Change Streaming Pipeline for ingestion of change events in warehouse, thereby reducing cost incurred by 60%
  • Led schema registry project that gave one view of 90% of tables schema present in different databases.

Associate Data Engineer

Tekion Corp
07.2021 - 10.2022
  • Developed and optimized ETL processes to efficiently extract, transform, and load data into data warehouses and data lakes.
  • Optimized raw data and fact creation pipelines improving ingestion time by 50%
  • Implementing encryption/decryption of 100% of PII data improving security and lowering risk of data breach
  • Worked on data validation project to detect mismatches in data at different places
  • Implemented data security measures and ensured compliance with regulatory requirements such as GDPR and CCPA
  • Worked on performance tuning and optimization of data processing workflows to improve efficiency and reduce processing times.

Intern

Tekion Corp
01.2021 - 07.2021
  • Assisted in designing and developing data pipelines for processing and analyzing large datasets using Python and Apache Spark.
  • Contributed to development of data models and schema designs for various data sources and use cases

Education

BTech - Information and Communication Technology

Dhirubhai Ambani Institute of Information And Communication Technology
Gandhinagar, Gujarat

Skills

  • Languages: C, Python, Java, SQL
  • Frameworks and Tools: Apache Spark, Spring boot, Delta lake, Kafka, Microsoft Azure, Snowflake, Databricks, EMR, Athena,Iceberg, Hadoop, Hive, Presto, Trino, AWS, Git
  • Databases: Postgres, Mongo, MySQL

Certification

Data Warehousing on AWS

Timeline

Data Engineer

Tekion Corp
10.2022 - Current

Associate Data Engineer

Tekion Corp
07.2021 - 10.2022

Intern

Tekion Corp
01.2021 - 07.2021

BTech - Information and Communication Technology

Dhirubhai Ambani Institute of Information And Communication Technology
Romil Joshi