Work Preference

Summary

Overview

Work History

Projects

Education

Skills

Accomplishments

Certification

Timeline

AWARDS & CERTIFICATIONS

DASHBOARDS

Open To Work

Rajesh Mandapati

Data Engineer

Bengaluru,KA

Work Preference

Job Search Status

Open to work

Desired start date: Open to discussion

Desired Job Title

Data EngineerMachine Learning EngineerAI Developer

Work Type

Full Time

Location Preference

On-SiteRemoteHybrid

Open to relocation: Yes

Salary Range

45000/yr - 200000/yr

Summary

Data Engineer with 3 years of experience building scalable ETL and data processing pipelines across GCP and Hadoop environments. Strong in SQL, Python, BigQuery, Spark, and microservice-based workflow orchestration, with hands-on experience in privacy-compliant data workflows for PII scanning, masking, and deletion. Proven impact in reducing Spark runtime by 20%+, lowering scanning and deletion costs by $5,000+ per month, and enabling self-service onboarding across 100+ datasets. Experienced in FastAPI-based microservices, event-driven architectures, CI/CD-supported releases, data quality, production support, and stakeholder collaboration.

Overview

years of professional experience

Certificates

Work History

Data Engineer

Lowe’s India

06.2025 - Current

Redesigned scanning and deletion pipelines by introducing a lookup-based delete strategy aligned with stakeholder workflows, reducing costs by $5,000+/month.
Optimized Spark jobs using tier-based configurations, reducing runtime by 20%+.
Designed a multi-factor scoring model using weighted parameters (SLA, request state, submission date) to dynamically prioritize DSR requests, improving processing efficiency by 40%.
Led migration from on-prem Hadoop to GCP (Dataproc Serverless, BigQuery, GCS) for privacy compliance platform.
Designed scalable BigQuery data models supporting PII workflows across 100+ datasets.
Built event-driven architecture using Kafka producers and consumers to handle asynchronous request processing.
Developed microservice-based ETL workflows for PII scanning, masking, and deletion across distributed Hadoop and GCP environments.
Built configuration & computation services using MongoDB to manage datasets, PII columns, and Spark configs and the core computation logic.
Owned end-to-end production releases including deployment validation and pipeline testing, ensuring zero SLA breaches.
Took ownership of sprint execution and JIRA workflow management during lead’s absence, ensuring on-time delivery of planned work.
Acted as primary data engineer in a lean team, handling critical deliveries post team transitions.
Mentored junior engineers through knowledge transfer (KT) sessions and onboarding support.
Led bi-annual audit processes ensuring compliance through documentation and validation.
Resolved high-priority production issues ensuring system stability and business continuity.
Enabled platform adoption by onboarding and guiding 100+ data custodians on KAIZEN workflows including scanning, masking, and deletion processes.
Supported containerized deployment and release validation of microservices using Docker, YAML-based deployment configs, internal CI/CD tooling, and Kubernetes pod monitoring.

Associate Data Engineer

Lowe’s India

05.2023 - 05.2025

Automated Airflow DAG workflows, reducing manual effort by 60% and saving ~2 FTE bandwidth.
Built end-to-end automation for request processing (task grouping, preprocessing, scanning, deletion).
Implemented data masking solutions aligned with privacy policies, avoiding hard deletions.
Optimized Spark configurations across datasets improving processing efficiency.

Programmer Analyst Trainee Intern

Cognizant

10.2022 - 12.2022

Trained in Python and AWS fundamentals (EC2, S3, Lambda, SNS)

Projects

LLM-based Data Operation Chatbot

Built hybrid chatbot using RAG architecture with vector database (Pinecone)
Enabled natural language querying for operational insights (DSR status, failures, cost metrics)
Implemented metadata-based filtering to improve retrieval relevance
Integrated SQL and MongoDB querying with agent-based routing

Education

B.Tech -

JNTUK University College of Engineering

Vizianagaram, India

04.2001 -

Skills

Languages: Python, SQL, Java (Intermediate)

Cloud & Big Data: GCP (BigQuery, GCS, Dataproc Serverless), Hadoop, Hive, Apache Spark, AWS (EC2, S3, Lambda, SNS)

Databases: BigQuery, PostgreSQL, MongoDB, Pinecone, Weaviate, FAISS

Data Engineering: ETL Pipelines, Data Processing, Data Modeling, Dataframe-Based Transformations, Microservices

API & Frameworks: FastAPI, Apache Airflow

Messaging & Distributed Systems: Kafka

DevOps & Deployment: Docker, Kubernetes, Git, CI/CD, YAML-based deployment configuration, Postman, JIRA

Analytics & Visualization: Pandas, NumPy, Tableau, Power BI, Matplotlib, seaborn, scikit-learn, PyTorch

AI/ML: Machine learning, Deep learning, NLP, LLMs, RAG Pipelines, LangChain, LangGraph, Vector Databases

Accomplishments

Team Excellence-Award
Spot-Award – 3 times

Certification

Ekeeda - School of Data Science

Timeline

Data Engineer

Lowe’s India

06.2025 - Current

Associate Data Engineer

Lowe’s India

05.2023 - 05.2025

Ekeeda - School of Data Science

02-2023

Programmer Analyst Trainee Intern

Cognizant

10.2022 - 12.2022

Data Analytics with python

04-2022

B.Tech -

JNTUK University College of Engineering

04.2001 -

AWARDS & CERTIFICATIONS

Team Excellence-Award, Spot-Award – 3 times, Data Science training - Ekeeda school of data science, Data Analytics with python – NPTEL, B and C certificate - NCC

DASHBOARDS

Customer Analysis Dashboard | Tableau

Built a dashboard to analyze customer segments, business trends, and performance metrics.
Link: dashboard link

HR Analytics Dashboard | Power BI

Built a dashboard to analyze attrition, hiring, and workforce trends
Link: dashboard link