Summary
Overview
Work History
Education
Skills
Certification
Project
Timeline
Generic

SOUMYA RANJAN DASH

Navi Mumbai, Maharashtra

Summary

Profile Summary

Dynamic and results-driven Data Engineer with expertise in designing and implementing scalable, high-performance data pipelines and real-time analytics solutions. Proficient in cloud platforms like Azure, Databricks, and AWS, with hands-on experience in Big Data technologies including Apache Spark, Kafka, and SQL-based systems. Skilled in developing metadata-driven workflows, optimizing ETL processes, and integrating machine learning models for fraud detection and predictive analytics. Strong focus on data security, governance, and cost-efficient architectures, ensuring compliance and operational excellence. Proven ability to collaborate across cross-functional teams, solve complex data challenges, and deliver impactful solutions that enhance business insights and drive growth in high-scale environments.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Senior Software Engineer

Eclerx
Navi Mumbai, Maharashtra
04.2022 - Current

Clickstream Analysis and Fraud Detection Pipeline.

  • Designed and developed a scalable, metadata-driven data pipeline for clickstream and impression data analysis.
  • Utilized Azure Data Factory (ADF) for orchestrating dynamic workflows and seamless data integration.
  • Implemented real-time stream processing in Azure Databricks using Spark Structured Streaming and ML-based anomaly detection for fraud prevention. Integrated APIs to ingest live clickstream data, third-party impression data, and fraud detection services, ensuring real-time updates.
  • Designed and optimized Azure SQL Database schemas (fact and dimension tables) for efficient querying and data analysis.
  • Secured credentials and sensitive data with Azure Key Vault, and applied row-level security and encryption for compliance.
  • Configured Logic Apps for automated anomaly notifications, system alerts, and stakeholder reporting.
  • Achieved a 95% fraud detection success rate and enhanced click-through rate (CTR) analysis accuracy by 30%. Reduced operational costs by 40% through auto-scaling clusters and optimized resource utilization in Databricks.

Education

B.TECH - Electrical And Electronics Engineering

Silicon Institute of Technology
Bhubaneswar, Odisha
07-2022

Skills

  • PySpark
  • Databricks
  • Microsoft Azure
  • PostgreSQL
  • Python
  • Hadoop
  • Linux
  • Hadoop
  • Hive
  • Airflow

Certification

  • AWS Certified Solutions Architect – Associate (SAA-C03)
  • Completed a 1-year Cloud-based Data Engineering course from TrendyTech

Project

Airline Data Pipeline Automation and Incremental Processing with CI/CD Integration 


- Designed and implemented an incremental data processing pipeline for airlines data using Azure Data Factory (ADF), ensuring seamless data integration between Azure Data Lake Storage (ADLS) and Azure Synapse Analytics for efficient analytics.  
- Established a robust CI/CD process in Azure DevOps, including the setup of repositories, agent pools, and pipelines, enabling automated deployment and minimizing downtime in production.  
- Streamlined production releases by automating the deployment of ARM templates for ADF pipelines, ensuring consistency and reducing manual intervention during updates.  
- Optimized data workflows and system performance by leveraging Logic Apps for orchestration and seamless integration of various Azure services.  
- Collaborated with cross-functional teams to ensure scalability, maintainability, and adherence to best practices in cloud-based data engineering.  

Tech Stack: Azure Data Lake Storage (ADLS), Azure Data Factory (ADF), Azure Synapse Analytics, Logic Apps, GitHub, Azure DevOps

Timeline

Senior Software Engineer

Eclerx
04.2022 - Current

B.TECH - Electrical And Electronics Engineering

Silicon Institute of Technology
SOUMYA RANJAN DASH