Summary
Overview
Work History
Education
Skills
Timeline
AccountManager

Sasikala Kumar

Data Engineer
Bangalore

Summary

Accomplished Software Engineer with expertise in ETL development and Spark optimization, previously at Inadev India. Enhanced data pipeline performance by 30% through innovative Delta Lake techniques. Proficient in Azure Data Factory, with a strong focus on collaboration and problem-solving to deliver efficient, scalable solutions.

Overview

3
3
years of professional experience
2
2
Languages
4
4
years of post-secondary education

Work History

Software Engineer

INADEV
07.2023 - 07.2024
  • Designed and developed end-to-end data ingestion pipelines using Azure Data Factory.
  • Built scalable data transformation workflows using PySpark in Databricks.
  • Implemented Delta Tables for ACID transactions, schema evolution, and incremental data loads.
  • Optimized Spark jobs by tuning partitions, caching strategies, and broadcast joins.
  • Developed real-time streaming pipelines using Spark Structured Streaming.
  • Improved pipeline performance by reducing job execution time by 20%.
  • Collaborated with analytics teams to deliver clean, transformed datasets.

Consultant

Capgemini Information Technology
04.2021 - 05.2023
  • Developed batch processing pipelines using Apache Spark.
  • Performed complex SQL transformations and data validations.
  • Implemented incremental loading strategies for large datasets.
  • Assisted in migrating legacy ETL workflows to Azure-based architecture.
  • Improved ETL pipeline performance by 30% by optimizing Spark execution plans, leveraging Delta Lake optimizations (Z-Ordering, OPTIMIZE), and tuning cluster configurations in Databricks.
  • Processed 50–200 GB of batch data daily using PySpark in Databricks.
  • Handled ~100 GB structured and semi-structured data from multiple Azure sources.

Education

B.E - ECE

Sri Ramakrishna Institute Of Technology
Coimbatore, India
08.2012 - 07.2016

Skills

SQL Development

PySpark & Python

Distributed Data Processing

Performance Optimization Techniques

Spark Structured Streaming

Apache Spark (Core & SQL)

Databricks Platform

Data Ingestion & ETL Pipelines

Azure Data Factory

Timeline

Software Engineer

INADEV
07.2023 - 07.2024

Consultant

Capgemini Information Technology
04.2021 - 05.2023

B.E - ECE

Sri Ramakrishna Institute Of Technology
08.2012 - 07.2016
Sasikala KumarData Engineer