Summary

Overview

Work History

Education

Skills

Languages

Soft Skills

Accomplishments

Timeline

Vanaja R

Chennai

Summary

Data Engineering professional with over 11+ years of overall IT experience, including 8+ years of strong, hands-on expertise in designing, developing and optimizing data pipelines and ETL workflows.
Skilled in leveraging Python (Including Pandas), PySpark, Hive and SQL to build scalable , high performance data solutions that support analytics, reporting, business intelligence initiatives.
Fair expertise in data modeling, performance tuning with focus on delivering reliable, quality driven data solutions.
Possess strong foundational knowledge of AWS cloud services like S3, Glue, Redshift, Athena, and Lambda through self learning, with an active interest in integrating cloud capabilities into data engineering workflows.

Overview

years of professional experience

Work History

Software Engineer II B

BA Continuum Pvt Ltd - Bank of America

Chennai, India

01.2021 - Current

Project Summary:
Working as a Senior Data Engineer on the WholeSale Credit Risk Platform (CRP) for a leading global bank, developing scalable, fault-tolerant data pipelines using PySpark and Python to process, cleanse, and transform financial risk data for regulatory reporting and advanced analytics.

Key Responsibilities:

Designed and implemented large-scale PySpark-based ETL pipelines on distributed data platforms to process high-volume credit risk data from upstream systems.
Developed modular, reusable Python code for data transformation, enrichment, validation, and aggregation across multiple risk categories and geographies.
Tuned PySpark jobs for optimal performance and memory management using techniques like caching, broadcast joins, and partitioning strategies.
Integrated with Oracle DB for source ingestion and final reconciliations, using JDBC connectors within PySpark workflows.
Implemented robust data quality checks , exception handling, and logging mechanisms to ensure data integrity and traceability across all pipeline stages.
Scheduled and monitored data jobs using Autosys , ensuring timely execution and alerting in case of job failures or SLA breaches.
Used Bitbucket for version control and code collaboration, following best practices for branching, pull requests, and peer reviews.
Worked closely with risk analysts, QA, and business users to gather requirements, validate output, and support UAT in production environments.

Data Engineer

Infosys Technologies

Chennai, India

01.2018 - 01.2021

Project Summary:

Worked as a Data Engineer for a global banking client, building robust data pipelines using Hive on Hadoop to support compliance reporting, customer account analytics, and operational dashboards.

Key Responsibilities:

Designed and developed end-to-end ETL workflows using HiveQL on the Hadoop ecosystem for processing large-scale structured data from multiple banking domains.
Built complex Hive tables (managed and external) , performed partitioning and bucketing , and optimized queries for faster reporting and downstream consumption.
Developed Unix shell scripts for data ingestion, job automation, error logging, and file-level validations as part of the ingestion layer.
Created and scheduled Control-M workflows for batch processing, ensuring job dependencies and SLAs were consistently met.
Participated in data quality checks , performed source-to-target mapping , and supported regression testing across multiple releases.
Used Bitbucket for version control and Sonarlint to enforce code quality standards and reduce technical debt in SQL and shell code.
Interacted with business users and analysts to understand data requirements and translated them into scalable Hive-based data models.

Software Engineer

Mphasis

Chennai, India

02.2014 - 12.2017

Project Summary:
Worked on a large-scale retail data processing system for a leading US-based client, focusing on the design, development, and maintenance of high-volume transactional and batch data pipelines in a mainframe environment.

Key Responsibilities:

Designed and implemented high-performance batch data pipelines using JCL, COBOL, and DB2 , handling millions of daily retail transactions.
Created and optimized SQL queries and DB2 procedures to support real-time and batch data processing for reporting and analytics needs.
Built and managed VSAM datasets and CICS transaction screens for inventory, sales, and pricing modules, ensuring data integrity and consistency.
Automated repetitive data workflows and reporting tasks using REXX scripts , reducing manual intervention and improving productivity.
Orchestrated scheduled jobs and complex batch chains using Autosys , ensuring reliable and timely data processing with proper dependencies and error handling. Automated repetitive data workflows and reporting tasks using REXX scripts , reducing manual intervention and improving productivity.
Orchestrated scheduled jobs and complex batch chains using Autosys , ensuring reliable and timely data processing with proper dependencies and error handling.

Education

Bachelor of Electrical And Electronics Engineering -

Tagore Engineering College

Chennai

05-2013

HSC - 12th -

Velammal Matriculation, Higher Secondary School

Chennai

05-2009

SSLC - 10th -

Velammal Matriculation, Higher Secondary School

Chennai

05-2007

Skills

ETL
Python
Pandas

Pyspark
SQL
GIT

Languages

Tamil

Bilingual or Proficient (C2)

English

Bilingual or Proficient (C2)

Soft Skills

Strong Collaboration & Cross-Team Communication
Analytical Thinking & Complex Problem Solving
Proactive Stakeholder Engagement
Adaptability in Fast-Paced Environments
Ownership & Accountability of Deliverables
Agile Methodology & Sprint Planning Participation

Accomplishments

Awarded with Silver, Gold, Platinum award for delivering key projects.
Insta Awards from Infosys for exceptional contribution to the project
Awarded “ SUMMIT AWARD ” for peak performance in the team
Awarded organization's “ PAT ON THE BACK ” award for automation

Timeline

Software Engineer II B

BA Continuum Pvt Ltd - Bank of America

01.2021 - Current

Data Engineer

Infosys Technologies

01.2018 - 01.2021

Software Engineer

Mphasis

02.2014 - 12.2017

Bachelor of Electrical And Electronics Engineering -

Tagore Engineering College

HSC - 12th -

Velammal Matriculation, Higher Secondary School

SSLC - 10th -

Velammal Matriculation, Higher Secondary School

Vanaja R

Summary

Overview

Work History

Software Engineer II B

Data Engineer

Software Engineer

Education

Bachelor of Electrical And Electronics Engineering -

HSC - 12th -

SSLC - 10th -

Skills

Languages

Soft Skills

Accomplishments

Timeline

Software Engineer II B

Data Engineer

Software Engineer

Bachelor of Electrical And Electronics Engineering -

HSC - 12th -

SSLC - 10th -

Similar Profiles

Mahmoud SolimanMahmoud Soliman

VIVEK RAJPUTVIVEK RAJPUT

Amogh ChivteAmogh Chivte

Abdul KalamAbdul Kalam