Seasoned Data Engineer with over 15 years of experience, including 6+ years in fintech and investment banking, and prior exposure to the telecom domain. Proven expertise in building scalable Big Data solutions, real-time data processing pipelines, and ETL frameworks using Python, PySpark, Hadoop, Hive, HBase, and Kafka. Skilled in data lake architecture, streaming analytics, and machine learning pipeline integration for use cases such as fraud detection, trade surveillance, and regulatory reporting. Adept at handling large-scale structured and unstructured data, optimizing query performance, and ensuring data quality, lineage, and compliance across enterprise systems. Experienced in Agile/Scrum methodologies and a Certified Scrum Master (CSM) with strong focus on cross-functional collaboration and delivery excellence.
Working as a Data Engineer in the Equity Division, delivering scalable Big Data and Machine Learning solutions for fraud detection and trade surveillance.
1. Cognitive Digital Intelligence
Digitized B2B financial documents (PDF, invoices) using OCR, Python, and machine learning. Implemented anomaly detection using Random Forest models.
Tech Stack: Python, ML, OCR, Shell Scripting, PDF parsing, Random Forest
2. CNEM – Customer Network Experience Management
Analyzed telecom complaints (call drops, speed issues) using PySpark, Hive, HBase, and PhoenixDB. Delivered actionable insights to network engineers to resolve cell-level issues.
Tech Stack: Python, PySpark, Hive, HBase, PhoenixDB, MySQL, Flask, Shell Scripting
Project: Vodafone Essar – Oracle CRM and Amdocs Transformation (AMS Support and Rollout)
Project: Vodafone CRM Operations – Application Support
Apache Hadoop
Apache Spark / PySpark
Data Engineering
Machine Learning (ML)
Hive / HBase / PhoenixDB
Big Data Analytics
SQL / PL/SQL Development
Agile and Scrum Methodologies (CSM Certified)
Java and Shell Scripting