Summary
Overview
Work History
Education
Skills
Certification
HIGHLIGHTS & ACHIEVEMENTS
Projects
Timeline
Generic

Vikas Singh

Data Solution Architect
Delhi

Summary

Senior Data Engineer & Solution Architect with 7+ years of experience designing large-scale data platforms, real-time analytics systems, and cloud-native ETL/ELT pipelines. Proven expertise in AWS (S3, Glue, Athena, Lambda, Kinesis, Step Functions, EMR) and Databricks, with strong command over PySpark, SQL, Delta Lake, and Medallion Architecture. Skilled at building reliable, high-performance data pipelines, implementing data quality frameworks, and optimizing big data workloads. Known for strong team collaboration, stakeholder management, and delivering impactful, scalable solutions that meet evolving business needs. Adaptable, dependable, and experienced in driving end-to-end architecture for enterprise-grade data ecosystems.

Overview

8
8
years of professional experience
4
4
Certifications

Work History

Data Engineering Lead & Solution Architect

Polestar Analytics Pvt. Ltd.
04.2018 - 11.2025

Education

B.Tech - Electronics & Communication

Maharaj Surajmal Institute of Technology
Delhi
04.2001 -

Executive Programme - undefined

IIM Calcutta

Skills

AWS Services: S3, Glue, Athena, EMR, Lambda, Kinesis, DynamoDB, RDS

Certification

Databricks Data Engineer Professional

HIGHLIGHTS & ACHIEVEMENTS

  • Delivered 7+ major data engineering programs with zero escalations
  • Designed DQ frameworks reducing QA effort by 40%
  • Saved major costs for clients by improving DE processes and SLA

Projects

Leading Beverages Company – AWS Data Hub (Completely AWS)

Role: Data Hub Architecture Lead | Stack: AWS S3, Glue ETL, Athena, Lambda, MSK Kafka, Databricks on AWS

• Architected a 100% AWS-native Data Hub on S3 with Delta Lake tables.

• Built ingestion using MSK Kafka + Lambda + Auto Loader.

• Implemented DQ checks, reconciliation, anomaly detection.

• Enabled analytics via Athena + QuickSight.


Leading Fashion & Sports Manufacturer – AWS ETL Modernization (Fully AWS)

Role: AWS Data Engineering Lead | Stack: AWS Glue, S3, Athena, Step Functions

• Migrated SSIS workloads to AWS Glue PySpark ETL.

• Built incremental ingestion and schema evolution workflows.

• Enabled SQL analytics through Athena.

• Implemented CI/CD via CodePipeline & CodeBuild.


Leading Electrical Appliances Manufacturer | Azure Data Migration Lead |

Azure Databricks (1 Year)

Client Name: Havells India

• Migrated legacy SSIS-based ETL workflows to a modern Lakehouse architecture

using Databricks and ADF.

• Ingested data from SAP RFCs and SQL Server into Delta Lake with incremental load

and schema auto-merge.

• Designed pipelines for transformation and exposed curated data to Synapse

Analytics and Power BI.

• Acted as end-to-end solution architect and developer, driving CI/CD implementation

and performance optimization.


Petronas Lubricants International | Sales Analytics Engineer | Databricks &

AWS (1 Year)

• Designed data pipelines to derive base oil and lubricant sales metrics supporting

EBITDA and margin analysis.

• Managed Medallion architecture (Bronze/Silver/Gold) on S3 for CSV and XLSX

datasets.

• Built CI/CD deployment pipelines with Azure DevOps and Databricks Asset Bundles

for multi-environment release.

• Collaborated with Outsystems team to populate PostgreSQL for UI display and

reporting.


Global Payroll Company | Workforce Predictive Analytics (ML Project) | Azure

Databricks (1 Year)

Client Name: Papaya Global

• Developed ML pipelines to predict employee attrition and detect payroll anomalies

across 40+ countries.

• Engineered features from HR and payroll datasets, achieving ~18% improvement in

model accuracy.• Trained classification models (Random Forest, XGBoost) using MLflow; automated

retraining via ADF.

• Published predictions to Synapse dashboards for HR insights and workforce

planning.


Medical Equipment Manufacturer | Sales & Operations| Data Engineer | Azure

+ Qlik (1+ Year)

Client Name: Fortec Medical

• Architected end-to-end data warehouse integrating Flat Files, SQL Server, and JSON

feeds.

• Defined Logical Data Models and developed stored procedures for ETL processing

and KPI modeling.

• Led client communication and data validation sessions ensuring on-time

deliverables.


IT&Services | HR Data Migration Engineer | Azure (1 Year)

Client Name: HCL

• Migrated legacy on-prem systems to Azure Data Platform using ADF and SQL stored

procedures.

• Designed incremental load frameworks and implemented automated alerts for

pipeline health.

• Acted as client liaison for status tracking, risk management, and issue resolution.

Timeline

Data Engineering Lead & Solution Architect

Polestar Analytics Pvt. Ltd.
04.2018 - 11.2025

B.Tech - Electronics & Communication

Maharaj Surajmal Institute of Technology
04.2001 -

Executive Programme - undefined

IIM Calcutta
Vikas SinghData Solution Architect