Summary
Overview
Work History
Education
Skills
Additional Information
Interests
Certification
Timeline
Manager
Saiteja Bachu

Saiteja Bachu

Data Engineer
Warangal

Summary

Data Engineer with ~4 years of experience delivering enterprise-scale data engineering solutions in Legal Compliance, Security, Audit, Fraud Detection, Financial Crime Control, AML, sanctions screening, transaction surveillance, and regulatory reporting domains. Proven expertise in designing scalable ETL/ELT pipelines, PySpark performance optimization, Airflow orchestration, Kafka streaming, S3-based ingestion frameworks, and data mart development for executive dashboards. Hands-on experience delivering GenAI-based analytics solutions for regulator reporting workflows. Strong background in SQL, Python, PySpark, workflow automation, and high-volume payment data processing.

Overview

4
4
years of professional experience
8
8
Certifications
3
3
Languages

Work History

Senior Officer

Dbs
07.2022 - 08.2023
  • Worked on Tableau-based data visualization solutions for operational and management reporting.
  • Supported foundational ingestion frameworks and build metadata and transformation jobs across multiple data sources.

Analyst

Dbs
09.2023 - Current

GenAI Data Engineering Use Case : (2025 – 2026)

  • Built feature mart jobs for GenAI-based analytics workflows to reduce manual effort for analytics users.
  • Integrated data from multiple sources to generate screening-ready datasets for automated narrative/report creation.
  • Enabled analyst review workflows before submission to regulators such as MAS and RBI.
  • Built, tested, deployed, and supported the end-to-end GenAI use case on a containerized CBD platform.
  • Improved reporting turnaround time and reduced analyst toil through automation.

Build data marts for various Use cases: (2023 – 2025)

Worked on strategic enterprise data solutions supporting Fraud Detection, Board Risk Management Committee, Financial Crime Control Board, Transaction Surveillance Units, and Macro Payments Dashboard.

Key Contributions:

  • Designed and developed scalable ETL/ELT pipelines to process high-volume payment and transaction data from multiple upstream systems.
  • Built enterprise data marts for Macro Payments Dashboard and Board Risk Management Dashboard to support payment-level and transaction-level analytics.
  • Enabled cross-border transaction monitoring frameworks for unusual flow of funds detection.
  • Delivered data solutions supporting Anti Money Laundering (AML), sanctions compliance, and financial crime surveillance.
  • Expanded customer screening capabilities from B2B to B2C customer base, significantly increasing compliance coverage.
  • Optimized PySpark compute jobs and reduced BAU processing runtime from ~2 hours to ~15 minutes through partition tuning, code refactoring, and efficient transformations.
  • Designed and implemented custom Apache Airflow DAGs for special-day/source-delay handling, reducing production failures and operational incidents.
  • Built ingestion pipelines using multiple data sources:
  • File-based ingestion
  • API-based ingestion
  • Database ingestion
  • Real-time Kafka streaming pipelines
  • Managed varying refresh frequencies including intraday, daily, and scheduled batch loads into AWS S3.
  • Developed Tableau dashboards and analytical reporting solutions for business stakeholders.
  • Supported production incidents, root cause analysis, SLA adherence, and operational stability improvements.

Education

Secondary School Certificate -

Tejaswi High School
Warangal
05-2016

Bachelor of Technology - Computer Science Engineering

Kakatiya Institute of Science And Technology(KITSW)
Warangal
04-2026

Intermediate - MPC

SR Junior College
Warangal
04-2018

Skills

Languages: Python, SQL

Big Data: Apache Spark, PySpark

Orchestration: Apache Airflow

Streaming: Apache Kafka

Storage: AWS S3

Visualization: Tableau

Monitoring: Elasticsearch, Kibana,Spark UI

Platforms: Containerized Platforms / CBD

Concepts: SCD2, Incremental Loads, Snapshot Loads, Real-time Pipelines, Data Marts

Tools: Git

Additional Information

Data Engineer, PySpark, Spark, SQL, Airflow, Kafka, AWS S3, ETL, ELT, AML, Sanctions, Fraud Detection, Regulatory Reporting, Tableau, GenAI, Data Mart, Performance Tuning, Transaction Monitoring, Cross Border Payments, Financial Crime, Compliance Analytics.

Interests

Solving the Data Challanges and active learning on latest tech stack

Playing Chess ,badminton and regular workouts

Mentoring juniors in Data Engineering and AI powered Data

Hosting Tech Meetups, Panel Moderation, Knowledge Sharing Sessions

Certification

Big Data in Financial Sector and Fundamentals

Timeline

Advanced Prompt Engineering Techniques

04-2026

DevOPs Foundations : Containers

03-2026

Avaloq Basic and Customisation

12-2025

Microservices Architecture

05-2024

Python Data Essentials

05-2024

LinkedIn Learning :Generative AI

12-2023

Analyst

Dbs
09.2023 - Current

Data Management Foundations

08-2022

Big Data in Financial Sector and Fundamentals

07-2022

Senior Officer

Dbs
07.2022 - 08.2023

Secondary School Certificate -

Tejaswi High School

Bachelor of Technology - Computer Science Engineering

Kakatiya Institute of Science And Technology(KITSW)

Intermediate - MPC

SR Junior College
Saiteja BachuData Engineer