
Senior Data Engineer & Solution Architect with 7+ years of experience designing large-scale data platforms, real-time analytics systems, and cloud-native ETL/ELT pipelines. Proven expertise in AWS (S3, Glue, Athena, Lambda, Kinesis, Step Functions, EMR) and Databricks, with strong command over PySpark, SQL, Delta Lake, and Medallion Architecture. Skilled at building reliable, high-performance data pipelines, implementing data quality frameworks, and optimizing big data workloads. Known for strong team collaboration, stakeholder management, and delivering impactful, scalable solutions that meet evolving business needs. Adaptable, dependable, and experienced in driving end-to-end architecture for enterprise-grade data ecosystems.
AWS Services: S3, Glue, Athena, EMR, Lambda, Kinesis, DynamoDB, RDS
Leading Beverages Company – AWS Data Hub (Completely AWS)
Role: Data Hub Architecture Lead | Stack: AWS S3, Glue ETL, Athena, Lambda, MSK Kafka, Databricks on AWS
• Architected a 100% AWS-native Data Hub on S3 with Delta Lake tables.
• Built ingestion using MSK Kafka + Lambda + Auto Loader.
• Implemented DQ checks, reconciliation, anomaly detection.
• Enabled analytics via Athena + QuickSight.
Leading Fashion & Sports Manufacturer – AWS ETL Modernization (Fully AWS)
Role: AWS Data Engineering Lead | Stack: AWS Glue, S3, Athena, Step Functions
• Migrated SSIS workloads to AWS Glue PySpark ETL.
• Built incremental ingestion and schema evolution workflows.
• Enabled SQL analytics through Athena.
• Implemented CI/CD via CodePipeline & CodeBuild.
Leading Electrical Appliances Manufacturer | Azure Data Migration Lead |
Azure Databricks (1 Year)
Client Name: Havells India
• Migrated legacy SSIS-based ETL workflows to a modern Lakehouse architecture
using Databricks and ADF.
• Ingested data from SAP RFCs and SQL Server into Delta Lake with incremental load
and schema auto-merge.
• Designed pipelines for transformation and exposed curated data to Synapse
Analytics and Power BI.
• Acted as end-to-end solution architect and developer, driving CI/CD implementation
and performance optimization.
Petronas Lubricants International | Sales Analytics Engineer | Databricks &
AWS (1 Year)
• Designed data pipelines to derive base oil and lubricant sales metrics supporting
EBITDA and margin analysis.
• Managed Medallion architecture (Bronze/Silver/Gold) on S3 for CSV and XLSX
datasets.
• Built CI/CD deployment pipelines with Azure DevOps and Databricks Asset Bundles
for multi-environment release.
• Collaborated with Outsystems team to populate PostgreSQL for UI display and
reporting.
Global Payroll Company | Workforce Predictive Analytics (ML Project) | Azure
Databricks (1 Year)
Client Name: Papaya Global
• Developed ML pipelines to predict employee attrition and detect payroll anomalies
across 40+ countries.
• Engineered features from HR and payroll datasets, achieving ~18% improvement in
model accuracy.• Trained classification models (Random Forest, XGBoost) using MLflow; automated
retraining via ADF.
• Published predictions to Synapse dashboards for HR insights and workforce
planning.
Medical Equipment Manufacturer | Sales & Operations| Data Engineer | Azure
+ Qlik (1+ Year)
Client Name: Fortec Medical
• Architected end-to-end data warehouse integrating Flat Files, SQL Server, and JSON
feeds.
• Defined Logical Data Models and developed stored procedures for ETL processing
and KPI modeling.
• Led client communication and data validation sessions ensuring on-time
deliverables.
IT&Services | HR Data Migration Engineer | Azure (1 Year)
Client Name: HCL
• Migrated legacy on-prem systems to Azure Data Platform using ADF and SQL stored
procedures.
• Designed incremental load frameworks and implemented automated alerts for
pipeline health.
• Acted as client liaison for status tracking, risk management, and issue resolution.