Summary
Overview
Work History
Education
Skills
Certification
Languages
Personal Information
Additional Information
Timeline
Generic

Venkat Phani Geeth Bodaballa

Hyderabad

Summary

Innovative Senior Data Engineer with 6+ years of experience in building scalable data pipelines, cloud-based data solutions, and AI-driven automation. Passionate about leveraging AI and cloud technologies to optimize data workflows, enhance efficiency, and drive business insights.

  • AI-Powered Data Engineering Assistant Lead: Leading the development of an intelligent AI agent that automates on-prem to cloud migrations, schema discovery, query execution, and real-time AI-driven data assistance.
  • Expertise in Google Cloud Platform (GCP), BigQuery, Apache Airflow, API Development, and AI-based query optimization.
  • Proficient in Java, Python, SQL, ETL, and Google Cloud Platform Services like DataFlow, Cloud Composer, BigQuery, Cloud Build, PubSub, GCS.
  • Skilled in LLMs, Retrieval-Augmented Generation (RAG), Redis caching, and real-time logging to enhance data engineering processes.
  • Proven ability to design high-performance ETL pipelines, automate workflows, and optimize database operations in cloud environments.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

AIonOS
09.2024 - Current
  • Designed and developed a framework for creating user cohorts in GCP BigQuery based on business requirements.
  • Built APIs using Cloud Run, Cloud Functions, and Python to retrieve customer-specific cohort data.
  • Developed an interactive interface for clients to interact with the API for seamless data retrieval.

Senior Data Engineer

MassMutual
09.2023 - 09.2024
  • Spearheading the design of custom frameworks for optimal data processing, reducing effort in building new data pipelines using Apache Airflow and SQL
  • Implementing automated scripts for Impact analysis across SQL scripts and datamart using Airflow and Python, resolving user requests and enhancing SQL queries
  • Designed and developed docker image to to reduce preprocessing time and reduce production errors due to python libraries mismatch.

GCP Data Engineer

Fractal Analytics
08.2021 - 09.2023
  • Designed real-time frameworks to migrate on-prem IBM Streams to Google Cloud, utilizing Apache Beam Python and Java with Dataflow as a runner
  • Created a Java framework for building Dataflow Pipelines, incorporating Design Patterns such as Factory Design Pattern, Singleton Design Pattern, Reflections, and interface implementations, ensuring cloud agnosticism
  • Developed custom Dataflow Flex templates for enhanced data processing and management
  • Implemented automated mail alerts for deprecated methods in Java projects, enhancing code quality and efficiency
  • Led the successful deployment of Dataflow pipelines using Jenkins for CICD, improving pipeline management and efficiency

Programmer Analyst

Cognizant Technology Solutions
04.2018 - 08.2021
  • Worked on Agile methodology, clarifying requirements, estimating effort, and enhancing existing code
  • Upgraded Spotfire dashboards with new features for improved data visualization and analysis

Education

Bachelor of Technology -

GITAM University
01.2017

Skills

  • Cloud Solutions Architecture
  • Data Transfer Management
  • Real-Time Data Processing
  • Data Pipeline Architecture
  • Agile Methodology Implementation
  • Automation & Framework Design
  • Google Cloud Platform, Dataflow,Cloud Composer, BigQuery
  • SQL,Python, Java, Shell Scripting
  • Jenkins, Docker
  • Apache Beam, Apache Airflow,ETL
  • Bi Tools(TIBCO Spotfire, PowerBi, Google Data Studio)
  • AI-Driven Process Optimization

Certification

  • Google Cloud Certified Professional Data Engineer
  • Google Cloud Certified Associate Cloud Engineer
  • Fractal Certified Data Engineer
  • Google Data Analytics Professional Certificate
  • Databricks Certified Data Engineer Professional

Languages

English
Telugu
Hindi

Personal Information

Date of Birth: 11/01/95

Additional Information

Personal Project:

AI-powered Data Engineering Assistant (Ongoing)

  • Developing an AI-driven data assistant to automate on-prem to cloud migrations, schema discovery, and query execution.
  • Utilizing LLMs (Google Gemini) for smart schema mapping and query generation.
  • Implementing Retrieval-Augmented Generation (RAG) to improve query accuracy and ensure contextual relevance.
  • Designing an interactive chatbot-style UI for real-time data interaction and query assistance.
  • Enabling seamless integration with PostgreSQL, BigQuery, and other databases.
  • Implementing modular AI agents for database discovery, query optimization, and execution automation.
  • Incorporating Redis caching to enhance performance and reduce redundant query generation.
  • Establishing real-time logging mechanisms to track query execution and debugging.
  • Automating deployment and updates using Docker and CI/CD pipelines.

Timeline

Senior Data Engineer

AIonOS
09.2024 - Current

Senior Data Engineer

MassMutual
09.2023 - 09.2024

GCP Data Engineer

Fractal Analytics
08.2021 - 09.2023

Programmer Analyst

Cognizant Technology Solutions
04.2018 - 08.2021

Bachelor of Technology -

GITAM University
Venkat Phani Geeth Bodaballa