
Senior Data Engineer with 10+ years of experience building scalable data platforms using Databricks, Snowflake, and Informatica. Skilled in developing PySpark-based data pipelines and Delta Lake architectures, and modernizing legacy ETL systems into cloud-native solutions.
Strong expertise in performance optimization, data modeling, and pipeline orchestration, supporting large-scale analytics in the insurance domain.
Hands-on experience with GenAI, including a RAG-based POC on Databricks for natural language data querying.
Databricks: PySpark, Delta Lake, Workflows, Unity Catalog
Cloud & Warehousing: Snowflake, AWS (S3, Glue)
ETL Tools: Informatica PowerCenter, IICS/CDI, SSIS
Programming: Python, SQL (Advanced), PL/SQL
Data Engineering: Data Pipelines, Medallion Architecture, Data Modeling
Optimization: Query Tuning, Partitioning, Caching
GenAI: RAG, LLM Integration, Vector-Based Retrieval
Databricks Certified Data Engineer Associate
Databricks Certified Data Engineer Associate
Databricks Certified Generative AI Engineer Associate
Cloud Data Integration for PowerCenter Developers – Foundation Certificate
Microsoft Certified: Fabric Analytics Engineer Associate