Summary
Overview
Work History
Education
Skills
Languages
Certification
Accomplishments
Timeline
Generic
Rishi Dhariwal

Rishi Dhariwal

Surat

Summary

Senior Engineer with expertise in Databricks and data pipeline design, led a successful migration initiative at Fractal Analytics resulting in $196K in annual savings. Demonstrated proficiency in problem-solving and automation, engineered fault-tolerant data ecosystems that optimized storage costs by 70%. Enhanced analytics accessibility and reliability through innovative solutions.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Senior Engineer

Fractal Analytics Pvt. Ltd.
Mumbai
07.2021 - Current
  • Client: Skechers
  • Elvis Data Framework: Engineered a fault-tolerant data ecosystem using Databricks (Delta format), Confluent Kafka, and Qlik, adhering to enterprise data engineering best practices. Centralized reporting data into a Delta Lake, enhancing accessibility and reliability for downstream analytics.
  • File Ingestion Framework: Designed and deployed an end-to-end data ingestion pipeline with PySpark, Shell Scripting, Databricks, and Airflow to load structured files from multiple SFTP sources into Delta Lake, creating a unified source of truth for business analytics.
  • Talend to Databricks Migration: Led the automation of Talend job migration to Databricks, leveraging intelligent mapping and scalable design patterns to enable faster processing and performance gains across pipelines.
  • Cost Optimization Strategy: Reduced S3 storage costs by 70% for Delta tables via bucketing strategies and implementation of efficient vacuum techniques across Bronze and Silver layers, improving both storage efficiency and query performance.
  • Snowflake Migration Initiative: Headed a phased migration of Fact and Dimension tables from Snowflake to Databricks, translating complex SQL logic into scalable Databricks code. Delivered $196K+ in estimated annual savings, while modernizing the BI platform for performance and cost efficiency.
  • Company Code-Level Parallelization: Re-architected wholesale data pipelines to process data independently by Company Code (CC). Developed modular DAGs in Airflow to eliminate cross-dependencies, cutting multi-hour delays and streamlining data movement across Bronze, Snowflake, and MSTR layers.

Senior Software Engineer

Impetus Technologies India Pvt. Ltd.
Indpre
10.2020 - 07.2021

Client: Bank of America

  • Migrated VERTICA/TERADATA workloads to Hadoop using Spark, Hive, Shell Scripting.
  • Built Spark-SQL scripts for data transformations; applied optimization techniques.
  • Developed CI/CD pipeline using Jenkins; integrated Autosys for job scheduling.
  • Processed unstructured application logs to extract DDL/DML using Spark.
  • Created shell utilities for log processing; automated workflows end-to-end.

Senior Software Engineer

Clairvoyant India Pvt. Ltd.
Pune
09.2019 - 10.2020

Client: PayPal

  • Subledger Platform Recon: Built automated ETL pipelines and aggregation logic in Spark.
  • Generic Data Quality Framework: Designed parallel processing rules, handled batch/streaming data, and enabled multi-format data profiling.
  • Common Exception Framework: Developed end-to-end Spark Java logging system, integrated with CI/CD, and loaded Hive data into HBase.

System Engineer

Tata Consultancy Services (TCS)
Pune
08.2017 - 09.2019

Client: Morgan Stanley

  • Developed automated revocation system using Spark and Scala for UIOLI analytics.
  • Built parallel JSON parsers and optimized data validation and small file compaction utilities.
  • Designed scheduling and monitoring scripts using Autosys.

Education

Master of Computer Applications - Computer Science

Department of Computer Science, Rollwala Computer
Ahmedabad
03-2017

Bachelor of Computer Applications - Computer Science

C.B. Patel Computer College
Surat
03.2014

Skills

  • Technical Skills:
  • Big Data & Frameworks: Hadoop, Spark, Hive, HDFS, HBase, Impala, Solr
  • Languages: Java, Scala, Python, Shell Scripting, PL/SQL
  • Data Visualization: Tableau, Zeppelin
  • Databases: Oracle, PostgreSQL
  • CI/CD & Scheduling: Jenkins, Autosys
  • Platforms & OS: Unix/Linux, Windows
  • Data pipeline design
  • Databricks development
  • ETL automation
  • Data ingestion
  • Workflow orchestration
  • Soft Skills:
  • Time Management
  • Problem-Solving
  • Requirement Gathering
  • Technical Documentation
  • Client communication

Languages

Hindi
First Language
English
Proficient (C2)
C2

Certification

  • Spark Fundamentals I and II: July 24, 2022
  • GenAI for Everyone : December 24, 2024
  • Data Analyst (Tableau) Associate: March 09, 2025
  • Big Data Essentials: HDFS, MapReduce and Spark RDD: August 24, 2021

Accomplishments

  • TCS LIREL - November, 2018
  • TCS On-Spot - April, 2019

Timeline

Senior Engineer

Fractal Analytics Pvt. Ltd.
07.2021 - Current

Senior Software Engineer

Impetus Technologies India Pvt. Ltd.
10.2020 - 07.2021

Senior Software Engineer

Clairvoyant India Pvt. Ltd.
09.2019 - 10.2020

System Engineer

Tata Consultancy Services (TCS)
08.2017 - 09.2019

Master of Computer Applications - Computer Science

Department of Computer Science, Rollwala Computer

Bachelor of Computer Applications - Computer Science

C.B. Patel Computer College
Rishi Dhariwal