Summary
Overview
Work History
Education
Skills
Timeline
Generic

Sanjay Bhakta K

Bengaluru

Summary

Results-driven Data Engineer with 4+ years of experience in building scalable, production-grade data pipelines for large-scale data migration and transformation projects. Proficient in designing end-to-end ETL workflows using Apache Spark, Java, and AWS Glue, with hands-on expertise in handling data movement from Hadoop and MFTS sources to cloud and analytical platforms like Teradata and Cassandra. Experienced in managing over 70 scheduled jobs using Ctrl-M, and validating data integrity using Java-based utilities. Strong command over SQL-based transformations, CDC logic implementation, and cloud data ingestion with a focus on performance, reliability, and business requirements. Adept at working in cross-functional teams to deliver high-quality data solutions, aligned with stakeholder goals.

Overview

4
4
years of professional experience

Work History

Software Engineer

Mphasis
Bengaluru
02.2023 - Current

Project: UDS/Hadoop Exit
Role: Data Engineer
Description:

  • Designed and implemented 20 end-to-end data pipelines for migrating data from Hadoop to AWS ecosystem and Teradata warehouse.
  • Automated file ingestion from Dropbox into AWS Glue tables using Spark and Glue ETL workflows.
  • Used Java and Spark to develop data transformation and validation logic ensuring data consistency between Hadoop and AWS sources.
  • Leveraged Ctrl-M for job scheduling and Jenkins for CI/CD to automate deployment and pipeline orchestration.
  • Loaded and provisioned cleansed data into Teradata, then performed detailed data comparison to validate successful migration.
  • Generated comparison reports to confirm data integrity and completeness post-migration.
  • Collaborated closely with data scientists and stakeholders to deliver reliable, accurate datasets for analytics.

Technologies: AWS Glue, Hadoop, Teradata, Spark, Java, Ctrl-M, Jenkins, Dropbox

Associate Software Engineer

Mphasis
Bengaluru
02.2021 - 02.2023

Project: MBODS Exit
Role: Data Engineer
Description:

  • Built and managed data pipelines for MBODS exit, ingesting data via MFTS routes into Amazon S3 buckets.
  • Triggered CDC (Change Data Capture) logic to generate full and delta files, followed by LRF (Load Ready File) creation for downstream use.
  • Used Spark to write and execute SQL queries based on business logic for data transformation.
  • Utilized a Java-based validation utility to verify record counts, business date accuracy, and sum field checks.
  • Loaded validated and processed data into Cassandra for downstream systems.
  • Maintained and monitored ~70 Ctrl-M jobs, ensuring reliable and timely execution of all workflows.
  • Collaborated with platform, QA, and support teams to resolve job failures, optimize performance, and uphold SLAs.

Technologies: Spark (SQL), AWS S3, MFTS, Java (utility usage), Cassandra, Ctrl-M, Linux, Shell Scripting

Education

M. Tech - Data Science And Engineering

BITS PILANI WILP
Rajasthan
05-2027

Bachelor in Engineering - Computer Science and Engineering - 7.02 CGPA

• Sai Vidya Institute of Technology
Bengaluru
2020

12th Grade - Computers - 88%

Sri Vani Vidya Kendra PU College
Bengaluru
04-2016

10th Grade - 86 %

Vidyanikethan Public School
Bengaluru
04-2014

Skills

  • Data pipeline management
  • Data quality assurance
  • ETL development
  • Data modeling
  • Data migration
  • Effective communication
  • Collaboration skills
  • Data warehousing

Timeline

Software Engineer

Mphasis
02.2023 - Current

Associate Software Engineer

Mphasis
02.2021 - 02.2023

M. Tech - Data Science And Engineering

BITS PILANI WILP

Bachelor in Engineering - Computer Science and Engineering - 7.02 CGPA

• Sai Vidya Institute of Technology

12th Grade - Computers - 88%

Sri Vani Vidya Kendra PU College

10th Grade - 86 %

Vidyanikethan Public School
Sanjay Bhakta K