Summary
Overview
Work History
Education
Projects
Tech Stacks
Timeline
Generic

SHAKTI PRASAD TRIPATHY

NOIDA

Summary

Azure Data Engineer with 2.5 years of experience in building and optimizing data pipelines using Azure Data Factory, Databricks (PySpark), and Synapse Analytics. Skilled in ETL, data modeling, performance tuning, and real-time analytics. Expertise in Delta Lake, SQL, cloud data security (RBAC, Key Vault). Passionate about scalable, cost-efficient data solutions and analytics-driven businesses.

Overview

8
8
years of professional experience

Work History

Data Engineer

Dalmia Bharat Group
08.2021 - Current
  • Batch Data Ingestion & ETL Development: Built Azure Data Factory (ADF) pipelines to extract and ingest structured & semi-structured data from SAP HANA, PLMS, Freight Tiger, and other systems into Azure Data Lake Storage Gen2 (ADLS)
  • Data Transformation & Processing: Developed PySpark-based transformations on Azure Databricks and optimized processing using Delta Lake for high-volume batch processing
  • Data Warehousing & Modeling: Designed fact and dimension tables in Azure Synapse Analytics, implementing partitioning, indexing, and materialized views for optimal query performance
  • Orchestration & Automation: Configured Azure Data Factory Triggers, Logic Apps, and Event Grid to automate batch workflows and ensure real-time monitoring
  • Data Quality & Governance: Integrated Azure Purview and Data Quality Services to manage metadata, ensure data consistency, and validate ingestion processes
  • Security & Compliance: Implemented RBAC (Role-Based Access Control), Private Endpoints, Managed Identities, and Azure Key Vault encryption for SOC 2 and GDPR compliance
  • Performance Optimization & Cost Efficiency: Optimized Azure Synapse SQL Pool performance using distribution strategies, auto-scaling, and workload isolation, reducing query execution time by 40%
  • Scalability & Reliability: Ensured system reliability with Azure Monitor, Log Analytics, and Synapse Query Store for proactive performance tuning and alerting
  • Tools used: Azure Data Factory (ADF), Azure Databricks (PySpark), Azure Synapse Analytics, Delta Lake, Azure Key Vault

Support Engineer

Thyssenkrupp Industries India Limited (TKIL)
Pune
07.2021 - 12.2023
  • Monitor, troubleshoot, and maintain web applications, ensuring seamless functionality and minimal downtime
  • Manage web servers, databases, and cloud infrastructure while optimizing performance and security
  • Provide user support, document issues, and assist with application updates and deployments
  • Collaborate with developers and IT teams to integrate systems enhance overall efficiency
  • Tools used: MySQL, Excel, MS Word, Linux

Electrical Engineer

CREW Pvt. LTD (Adani Power Mundra Limited)
12.2016 - 07.2021
  • Managed the operation, maintenance, and troubleshooting of electrical systems in the Coal Handling Plant (CHP), ensuring seamless functionality
  • Performed preventive and corrective maintenance on electrical equipment, minimizing downtime and improving system reliability
  • Conducted calibration, testing, and safety compliance checks, adhering to industry regulations and standards.
  • Collaborated with cross-functional teams to implement upgrades, maintain accurate records, and optimize plant operations.
  • Tools used: MS Excel, MS Word, SAP HANA, E-CAD

Education

Bachelor of Engineering - Electrical and Electronics Engineering

ROLAND INSTITUTE OF TECHNOLOGY
BERHAMPUR
07.2014

Projects

Enterprise-Scale Batch Data Processing 

 Designed and implemented a scalable batch data processing pipeline on Microsoft Azure for a fintech/logistics enterprise, ensuring efficient data ingestion, transformation, and reporting from multiple sources like SAP HANA, PLMS, Freight Tiger, Mobility, HR, and SDMS. Built a cost-efficient, high-performance architecture using Azure Data Factory, Azure Synapse Analytics, and Power BI to enable real-time analytics and business intelligence., 50% faster ETL processing with optimized pipelines., 40% improved query performance using indexing & partitioning., 99.9% availability with proactive monitoring & auto-scaling.

Tech Stacks

Azure Data Lake Storage Gen2,Delta Lake, Azure Data Factory (ADF), Azure Databricks (PySpark),Azure Synapse Analytics , Fact & Dimension Tables, ADF Triggers, Logic Apps, Event Grid, Azure Active Directory (AAD), Key Vault, Azure Monitor, Log Analytics, Application Insights, 

Timeline

Data Engineer

Dalmia Bharat Group
08.2021 - Current

Support Engineer

Thyssenkrupp Industries India Limited (TKIL)
07.2021 - 12.2023

Electrical Engineer

CREW Pvt. LTD (Adani Power Mundra Limited)
12.2016 - 07.2021

Bachelor of Engineering - Electrical and Electronics Engineering

ROLAND INSTITUTE OF TECHNOLOGY
SHAKTI PRASAD TRIPATHY