
Data Analyst with proven skills in ETL processes, data visualization, and team management. Expertise in Analysis of fraud-detection in banking transactions and delivering actionable insights through advanced analytics.
◦ Extraction, Transfer and Load (ETL) Development and Optimization: Designed and implemented robust ETL
pipelines related to transactional monitoring for PNC Bank. Utilized PySpark to handle large-scale data extraction,
transformation, and loading, ensuring data accuracy and integrity across all workflow stages.
◦ Data Extraction and Ingestion: Automated data extraction and ingestion processes from diverse sources, including
relational databases, APIs, and HDFS storage systems. Leveraged PySpark and Hive to streamline
data integration, reduce latency, and detect money laundering and identify suspicious activity leveraging information.
◦ Detection of fraudulent transaction: Data Engineer with 5+ years of experience designing and implementing data pipelines, ETL workflows, and analytics solutions for financial crime detection, including money laundering. Skilled in leveraging big data technologies and data query learning models to ensure compliance with Anti-Money laundering (AML) regulations in various categories like cash/card,checks,Electronic-fund transfer for a financial client across Canada and the
United States.
◦ Insight Deck Delivery: Developed comprehensive quarterly insight decks by processing and analyzing large datasets
using PySpark. Collaborated with business stakeholders to identify key metrics and deliver actionable insights, enabling datadriven
decision-making and aligning with organizational goals.
◦ Performance Tuning: Optimized data processing performance by implementing best practices in PySpark and ETL
tools. Tuned configurations to handle large data volumes efficiently, reducing processing times and improving overall
system performance.
◦ Collaboration and Reporting: Worked closely with cross-functional teams, including data analysts and business
leaders, to understand requirements and deliver insights aligned with business objectives
Technologies: PySpark, Hive, Hadoop, Tableau, HDFS,
Programming Languages: Python, MySQL,Pandas, NumPy
Expertise: ETL Processes, Data Visualization, Analytics fro fraud-detection
Soft Skills: Team Management, Collaboration, Mentorship, Client Communication