Results-oriented ETL and Data Engineer with over 7 years of hands-on experience delivering scalable data solutions across on-premise and cloud environments. Proven track record of driving ETL modernization efforts using PySpark on AWS EMR, resulting in up to 40% faster batch performance. Adept in building reusable data ingestion frameworks, implementing CDC logic, and automating data validation workflows. Strong expertise in debugging complex data pipelines, optimizing SQL and PL/SQL queries, and delivering business-aligned reporting solutions via Tableau. Passionate about enabling teams through mentorship and technical leadership, with a foundation built on QA engineering and deep skills in data validation, pipeline monitoring, and issue resolution using Snowflake, Splunk, and AWS services.
PySpark
Analytics Data Mart (ANM).