Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

VAMSHI KRISHNA BANDA

Azure Data Engineer
Hyderabad

Summary

Results-driven Data Engineer with 4 years of experience designing and optimizing scalable data pipelines using Azure, Data bricks, and PySpark. Improved data processing efficiency by 35% and reduced pipeline errors by 60% through automation and Delta Lake optimizations. Skilled in building HIPAA-compliant ETL work flows for health care analytics, with a passion for translating raw data into actionable business insights. Certified in Azure (DP-203) and committed to continuous learning in distributed systems and data governance

Overview

5
5
years of professional experience
2
2
Certifications

Work History

Azure Data Engineer

Tech Mahindra (Client: GSK)
05.2025 - 09.2025
  • Built Spark applications for managing complex workflows of extraction, transformation, and aggregation from source systems and stored the processed data into ADLS using Azure Databricks Notebooks.
  • Migrated existing Spark Scala applications into PySpark for smoothly-run and better-maintainable Geo Tracker-related functionalities.
  • Databricks level code deployments doing from one branch to another branch.

Azure Data Engineer

Infosys (Client: Molina Healthcare)
02.2022 - 11.2022
  • Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
  • Migrated 15 legacy SSIS workflows (on-prem SQL Server) to Azure Data Factory with PySpark U-SQL scripts, reducing Medicaid claims processing time from 8 hours to 35 minutes (22K claims/hour throughput).
  • Eliminated 12,000+ duplicate claims/month using Spark SQL window functions (row_number() over partition by MemberID, ClaimDate), saving $850K/month in erroneous payments.
  • Optimized Z-ordering on MemberID and EffectiveDate, reducing MCO reporting queries from 12 mins to 9 seconds (1.2B rows scanned/day).
  • Implemented PHI masking using PySpark UDFs (e.g., mask_ssn(), hash_email()) to comply with HIPAA audits, redacting 200+ fields across 15 clinical datasets.
  • Integrated with CMS Blue Button 2.0 API using ADF REST connectors to validate 4M+ Medicare member records against federal benchmarks.
  • Collaborated on ETL tasks, maintaining data integrity and verifying pipeline stability.
  • Developed and optimized Azure data pipelines to enhance data processing efficiency, improving overall data accessibility.
  • Collaborated with cross-functional teams to implement data integration solutions, ensuring compliance with healthcare regulations and standards.
  • Designed and maintained SQL databases, facilitating accurate data retrieval and reporting for healthcare analytics.
  • Automated data workflows using Azure Data Factory, significantly reducing manual processing time and improving data accuracy.

Azure Data Engineer

Infosys Technologies (Client: Optum)
01.2021 - 01.2022
  • Automated routine tasks using Python scripts, increasing team productivity and reducing manual errors.
  • Built 10+ ETL pipelines in Azure Data Factory to process 5TB/month of patient data, reducing manual workflow errors by 75% and accelerating daily reporting by 2 hours.
  • Optimized slow-running Spark SQL queries by tuning joins and partitioning strategies, cutting query runtime from 10 minutes to 90 seconds for 1M+ row datasets.
  • Collaborated on a FHIR-compliant data model to track 500K+ patient records, ensuring HIPAA compliance via column-level encryption in Databricks Unity Catalog.
  • Reduced cloud storage costs by 25% by archiving raw files to Cool Tier storage after Delta Lake ingestion.
  • Enabled faster audits for Medicare compliance teams.

Education

B.Tech - Mechanical Engineering

JNTU Anantapur
07.2021

Skills

Cloud & Services: Azure Data Factory (ADF), Azure Databricks, Azure Synapse Analytics, Azure Data Lake Gen2, Azure Blob Storage

Certification

Microsoft Certified: Azure Data Engineer Associate (DP-203)

Timeline

Azure Data Engineer

Tech Mahindra (Client: GSK)
05.2025 - 09.2025

Azure Data Engineer

Infosys (Client: Molina Healthcare)
02.2022 - 11.2022

Azure Data Engineer

Infosys Technologies (Client: Optum)
01.2021 - 01.2022

B.Tech - Mechanical Engineering

JNTU Anantapur
VAMSHI KRISHNA BANDAAzure Data Engineer