AI
1. Designed and implemented data ingestion pipelines using Azure Data Factory (ADF) to hydrate data from multiple ERP sources such as Oracle, SQL Server, SFTP, and flat files into Azure Data Lake (ADLS).
2 .Built and maintained Bronze, Silver, and Gold layer architectures in Azure Databricks using PySpark and SQL, ensuring scalability, modularity, and performance optimization for downstream analytics.
3. Applied complex data transformation logic, implemented business rules, and performed joins, aggregations, and validations using Databricks notebooks and Delta Lake architecture.
4. Ensured data quality, lineage, and compliance by integrating validations, logging, and monitoring across ADF and Databricks pipelines; followed best practices for governance and security.
5. Collaborated with business and BI teams to expose curated Gold layer data for Power BI dashboards, reporting solutions, and Azure Synapse Analytics for enterprise wide consumption.
1. Responsible for building big data pipelines to process monthly payments for retailers and distributors selling HP's Instant Ink subscriptionbased service globally, using customer database insights to generate payment reports.
2. Migrated existing NiFi pipelines to Databricks ETL workflows as part of optimization efforts, streamlining processes and developing reusable data pipeline components to support the team’s framework evolution.
3. Contributed to overall data architecture design, applying data modeling, storage, and retrieval expertise for efficient and scalable solutions.
4. Created data products for business analysts and data scientists, developing automated jobs to support their analysis and collaborating with stakeholders to define requirements and deliver insights.
5. Guided and mentored junior data engineers, fostering knowledge sharing and supporting professional development within the team.
6. Built data pipelines and implemented efficient ETL/ELT proce
Pyspark
undefinedAI
Chess
Swimming