Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Career Break & Upskilling
Generic

BHARGAVI MANEPALLI

Bangalore

Summary

Experienced IT professional with 7 years in the industry, including 3+ years specializing in designing, building, and optimizing scalable data pipelines using Databricks, PySpark, SQL, and cloud platforms. Proficient in monitoring DAGs, troubleshooting pipeline failures, and ensuring data integrity through validation and testing in Databricks and Snowflake. Possesses 1 year of experience in developing machine learning models for regression, classification, and clustering, and 4+ years in incident resolution and defect management. Skilled in automation with UI Path to enhance operational efficiency. A collaborative team player with a strong Agile mindset, dedicated to delivering high-quality data solutions.

Overview

13
13
years of professional experience
1
1
Certification

Work History

Data Engineer

GSPANN Technologies (Nike – Client)
11.2021 - Current
  • Designed and developed ETL pipelines to ingest data from Oracle, Teradata, Snowflake, and Databricks Delta Lake into AWS S3.
  • Worked with system architects and design analysts to understand business and industry requirements.
  • Utilized Databricks for compute and Apache Airflow for orchestration, ensuring reliable and automated workflows.
  • Optimized AWS S3 and EMR workloads, resulting in significant cost savings.
  • Fine-tuned query performance and optimized database structures, resulting in a 70% improvement in ingestion and reporting execution times. Implemented data quality frameworks, reducing pre-production errors by 80%.
  • Migrated existing code and Airflow pipelines to Databricks workspace, enabling workflow execution within Databricks.
  • Leveraged the Databricks AI tool Genie for in-depth data insights and visualization through simple prompts.
    Developed Databricks dashboards for monitoring data quality and analyzing trends over time.
  • Worked on Beat and Spark Expectation Framework testing prior to go-live, ensuring requirements are met and advising on code standards. Assisted with Spark Expectation tool configuration and documented frequent failures to support the development team and end users.
  • Provided technical guidance and mentorship to junior team members, fostering a collaborative learning environment.
    Created comprehensive documentation and Lucidchart diagrams in Confluence, and conducted team demos for future reference.
  • Automated data quality checks using the Spark Expectation framework in both Airflow and Databricks workflows, deployed in the production environment to reduce manual validations.

Systems Engineer

Tata Consultancy Services(SFI - Client)
03.2012 - 01.2016
  • Handled critical incidents and defect management, ensuring resolution within SLA.
  • Performed root cause analysis and coordinated with cross-functional teams for permanent fixes.
  • Ensured smooth data operations and received TCS Gems Award for critical issue resolution.

Education

Post Graduate Program - Data Science

Purdue University
08-2021

B.E - Electrical & Electronics Engineering

Sir CRR College of Engineering
01.2011

Skills

  • Programming: Python (Pandas, NumPy, PySpark), SQL
  • Big Data & ETL: Apache Spark, Hadoop, Airflow
  • Databases: MySQL, Snowflake
  • Cloud : Databricks, AWS (S3, EC2,Athena)
  • Visualization: Tableau, Databricks Notebooks and Databricks Dashboards
  • Other Tools: GIT, Mobaxterm, JIRA, Confluence, Pycharm

Accomplishments

  • Awarded "Pat on the Back" certificate for outstanding performance on client projects.
  • Received "Ace Alliance" recognition for close collaboration and achieving team goals.
  • Honored with client appreciation and "HiFi" for timely deliverables.
  • Awarded "TCS Gems" for resolving critical issues within SLA.

Certification

  • Microsoft Certified: Azure Data Fundamentals (DP-900)
  • Microsoft Certified: Azure AI Fundamentals (AI-900)
  • Microsoft Certified: Azure Fundamentals (AZ-900)
  • Microsoft Certified: Azure Data Scientist Associate (DP-100)
  • AWS Technical Essentials – Simplilearn
  • Python 101 for Data Science – IBM

Timeline

Data Engineer

GSPANN Technologies (Nike – Client)
11.2021 - Current

Systems Engineer

Tata Consultancy Services(SFI - Client)
03.2012 - 01.2016

B.E - Electrical & Electronics Engineering

Sir CRR College of Engineering

Post Graduate Program - Data Science

Purdue University

Career Break & Upskilling

  • 2016 – 2021
  • Completed Post Graduate Program in Data Science, Purdue University (2021).
  • Upskilled in Cloud Data Platforms (AWS, Azure), Big Data (PySpark, Databricks), and Machine Learning techniques.
  • Worked on personal projects and advanced training to transition into Data Engineering.
BHARGAVI MANEPALLI