
Adaptable professional with 12 years background in developing and maintaining data in retail, banking and healthcare industries seeking to transition into new field. Seven years of expertise with AWS products, including S3, Redshift, Glue, Athena, EMR and Lambda to optimize performance and streamline data workflows. Proven ability to work with large datasets and collaborate with cross-functional teams to meet business needs. Committed to driving data-driven decision-making, enhancing data quality, and improving system performance. Competent in Python, SQL, and cloud computing platforms such as AWS, with an emphasis on delivering reliable, scalable solutions. Excellent problem-solving abilities and a talent for converting intricate specifications into useful insights.
FINTELLA MIGRATION:
Led end-to-end upgrade of a mission-critical APTransformation pipeline from EMR 6.7 → 7.3 and Hudi 0.11 → 0.15, driving platform modernization under a strict timeline aligned with upstream VP-level goals resolving cross-version compatibility issues, aligning DAG lineage in Airflow, and ensuring zero downstream disruption.
Responsibilities:
CONFLUENCE:
Designed and Developed the integration of VPS (Vendor Payment System) with Confluence Data Lake, architecting a solution to optimize data ingestion for a 4-petabyte enterprise data lake serving 530+ AP/AR datasets. Designed and implemented a centralized streaming architecture utilizing AWS SNS, Kinesis Firehose, and DMS, replacing individual database connections with a scalable event-driven system.
Responsibilities:
JIGSAW:
Jigsaw is designed to decrease the number of disputes, increase the accuracy of vendor payments, and assist vendors in understanding what was and was not paid and why. By creating additional ways to reimburse vendors for overages and, more crucially, giving them visibility into Amazon's payment-related decisions and the reasons behind the treatment they received, it will eventually decrease contacts for short-pays.
Responsibilities:
NEXT:
Selection Guidance –> New Products: Goal of this project is to find business opportunities and help sellers to expand their portfolio for new products in same marketplaces.
Selection Guidance –> Cross listing: Goal of this project is to recommend sellers to sell his fast-moving goods across other marketplaces.
Responsibilities:
DEBT MANAGER (DM9):
Debt Manager is a FICO product which through its recovery tactics and strategies allows BOA associates to efficiently recover charge off and delinquent Loans. The main objective of this project is to receive account information which is in the state of recovery from DM9, process the data and push them into tables for End Users.
Responsibilities:
Highmark is a health insurance company and we are involved in storing its claim and member data in data warehouse. We use Extract-Transform-Load (ETL) process where we receive data from source, transform it as required and finally load it into target using Mainframes and Teradata.
Responsibilities:
Demonstrated sustained high performance at Amazon with two consecutive Exceeds ratings, preceded by multiple years of top-band Meets, reflecting consistent delivery and ownership across complex initiatives.
End-to-end ownership of large-scale data platform migrations (EMR, Hudi, Airflow).
Strong judgment in making high-risk production decisions (snapshot vs incremental migration strategy).
Expertise in EMR, Spark, Hudi, Iceberg, AWS Data Lake architecture.
Designed scalable ingestion patterns (SNS, Firehose, DMS) in a 4PB+ data lake.
Experience across e-commerce, fintech, insurance, payments, and recovery systems.
Transitioned successfully from Mainframe/Teradata → Big Data → AWS Cloud → Modern Lakehouses (Hudi/Iceberg).
Trusted as a deep-dive expert for diagnosing and resolving critical data accuracy problems, ensuring reliability and correctness across end-to-end pipelines.