
Detail-oriented, organized and meticulous employee. Works at fast pace to meet tight deadlines. Enthusiastic team player ready to contribute to company success.
Forward-thinking Software Engineer / Data Engineer / AWS Cloud Developer with background working productively in dynamic environments. Fluent in Big Data Technologies such as Apache Spark, Hadoop etc.. , Python | Scala programming languages and AWS Cloud Technologies. Proud team player focused on achieving project objectives with speed and accuracy.
PAYROLL PANEL:
Payroll panel identifies payroll transaction from ~5000 employers using Payroll Identification Engine (PIE). Apart from payroll specific rules, PIE also internally invokes recurring algorithm (ARIS) to identify recurring patterns, payroll cadence and few other enrichments like anomalous payroll transactions, etc. PIE module also consumes PRISM model to identify transactions to SMB accounts in the data.
FALCON:
RECOMMENDATION SYSTEM & INTELLIGENT CREDIT ENRICHMENT
CLOUD MIGRATION:
RCIS Data Ingestion and Active‐Copy Framework:
Scope of this project was to rollout Attunity Replicate tool as a new Data Integration Platform, leverage its capability to capture real time data (cdc) and finally to integrate the tool with the existing Enterprise Hadoop Data Lake.
Secondly, the project was also about developing a new framework to handle incremental data changes and to create hive process to make the data in Enterpise Data Lake in‐sync with the Source Data.
Predictive Analytics Data Architecture:
The scope of this project was to setup Enterprise Data Lake and Integrated Analytics platform in Hadoop and to provide support for various uses cases including Analytics, processing, storing and Reporting of voluminous, rapidly changing, structured, semi structured and unstructured data.
The Data Platform provided end to end Data Lineage, transformation Lineage, Metadata management throughout the Data Lifecycle such as Ingestion, ETL, Curation and Models.
Multiple data sources like Relational Databases/Tables, Mainframe Off‐Loads, Structured Data Files (ASCII/Binary), Semi Structured Data Files (XML/JSON), and Un‐Structured Data Files (Raster/Shape Files/PDF/Images) were integrated with Enterprise Hadoop Data Lake.
Apache Spark
Hadoop
Python
Scala
AWS
Git, Kubernetes, Terraform, Docker
MongoDB, Redis, Shell Scripting