Summary
Overview
Work History
Skills
Education
Skills
Timeline
Generic
Venkatesh Babu Menda

Venkatesh Babu Menda

Visakhapatnam

Summary

Experienced Azure Data Engineer with 5+ years of expertise in designing and implementing scalable, high-performance data pipelines using Azure Data Factory, Azure Databricks, and Spark (PySpark, Spark SQL). Proficient in building end-to-end ETL/ELT workflows, data lakehouse solutions, and enterprise-grade data platforms using Azure Data Lake, Blob Storage, and Synapse. Strong hands-on skills in SQL, data modeling, validation, and compliance with a focus on data quality and governance (Unity Catalog, Delta Lake). Adept in CI/CD automation via Azure DevOps and Agile delivery practices. Proven ability to collaborate cross-functionally and translate business needs into reliable, secure, and scalable data solutions.

Overview

5
5
years of professional experience

Work History

Azure Data Engineer

Innova Solutions
09.2020 - Current

Project: PVH.

Role: Data Engineer.

Environment: Azure Data Factory, Azure Databricks, PySpark, Spark SQL, Azure Synapse, Log Analytics Workspace, and EventHub.

Duration: Sep 2022 โ€“ Present.

Roles and Responsibilities:

  • Extracted, transformed, and loaded data from source systems to Azure storage services using Azure Data Factory, T-SQL, Spark SQL, and U-SQL in Azure Data Lake Analytics.
  • Ingested data into Azure Data Lake, Azure Storage, Azure SQL, and Azure DW; processed data in Azure Databricks.
  • Developed Spark applications using PySpark and Spark SQL to transform data from multiple file formats for analytical insights.
  • I wrote automation SQL scripts for pipeline orchestration and data validation.
  • Managed streaming data ingestion using Event Hub connection strings in Databricks.
  • Estimated cluster sizing, and monitored and troubleshot Spark Databricks clusters.
  • Applied the Spark DataFrame API for in-session data manipulation.
  • Demonstrated deep knowledge of Spark architecture, including Spark Core, SQL, Streaming, Executors, Tasks, and Deployment modes.
  • Implemented security and data governance policies using Databricks Unity Catalog.
  • Served on a technical committee for wastewater treatment system design initiatives.

Project: Optum.

Role: Associate.

Environment: Azure Data Factory, Azure Databricks, PySpark, Spark SQL, Azure Data Lake, and Azure Blob Storage.

Duration: Sep 2020 โ€“ Aug 2022.

Roles and Responsibilities:

  • Provisioned Hadoop and Spark clusters to support an on-demand data warehouse, and enable data access for data scientists.
  • Built data pipelines and processed data in Azure Databricks using PySpark and Spark SQL.
  • Imported data from MySQL and other systems into Azure Data Lake and Azure Blob Storage.
  • Created tables and performed data validation using Spark SQL in Azure Databricks.
  • Loaded and transformed structured, semi-structured, and unstructured data for advanced analytics.
  • Cleaned and parsed data for ingestion into Azure Databricks environments.
  • Monitored system health, handled warning/failure logs, and optimized job execution.
  • Reviewed application logs within Databricks, and managed storage-level logging.
  • Managed ingestion pipelines from cloud storage (Azure Blob, ADLS) to Databricks.
  • Enabled downstream consumption of refined data by data scientists and analysts.

Skills

Cloud & Azure Ecosystem
  • Azure Data Factory (ADF)
  • Azure Databricks
  • Azure Synapse Analytics
  • Azure Data Lake ADLS Gen2
  • Azure Blob Storage
  • Azure DevOps
  • Azure Key Vault
  • Azure Event Hub
  • Azure Logic Apps
  • Azure Dataflows
  • MS Fabric
  • Delta Live Tables
๐Ÿ”น Big Data Engineering & ETL
  • ETL/ELT Pipelines
  • Databricks & Hadoop
  • Structured Streaming
  • Delta Lake
  • Spark DataFrame API
  • Spark SQL
  • Spark Programming
  • Data Modeling
  • Data Governance (Unity Catalog)
๐Ÿ”น Programming & Scripting
  • PySpark
  • Python for Data Science
  • Data Analysis with Python
  • SQL (T-SQL/PL-SQL)
  • SQL for Data Analysis
๐Ÿ”น Other Technical Skills
  • CI/CD Pipelines
  • Performance Tuning (SQL, ADF, Spark)
  • Log Analytics / Monitoring
  • Agile (Scrum) Delivery

Education

Bachelor of Technology (B.Tech) -

Indian Institute of Technology Madras (IIT Madras)
Chennai, India

Skills

    Cloud & Azure Ecosystem
  • Azure Data Factory (ADF)
  • Azure Databricks
  • Azure Synapse Analytics
  • Azure Data Lake ADLS Gen2
  • Azure Blob Storage
  • Azure DevOps
  • Azure Key Vault
  • Azure Event Hub
  • Azure Logic Apps
  • Azure Dataflows
  • MS Fabric
  • Delta Live Tables
  • ๐Ÿ”น Big Data Engineering & ETL
  • ETL/ELT Pipelines
  • Databricks & Hadoop
  • Structured Streaming
  • Delta Lake
  • Spark DataFrame API
  • Spark SQL
  • Spark Programming
  • Data Modeling
  • Data Governance (Unity Catalog)
  • ๐Ÿ”น Programming & Scripting
  • PySpark
  • Python for Data Science
  • Data Analysis with Python
  • SQL (T-SQL/PL-SQL)
  • SQL for Data Analysis
  • ๐Ÿ”น Other Technical Skills
  • CI/CD Pipelines
  • Performance Tuning (SQL, ADF, Spark)
  • Log Analytics / Monitoring
  • Agile (Scrum) Delivery

Timeline

Azure Data Engineer

Innova Solutions
09.2020 - Current

Bachelor of Technology (B.Tech) -

Indian Institute of Technology Madras (IIT Madras)
Venkatesh Babu Menda