Summary
Overview
Work History
Education
Skills
Certification
Tools & Technologies
Timeline
Generic

Gopikrishna Nallappareddi

Bangalore

Summary

Over 16 years of IT experience, including 9 years as a Big Data Engineer with a focus on Hadoop and Spark application development. Skilled in the Hadoop ecosystem, leveraging HDFS, Hive, Sqoop, and Spark for optimal data processing. Proven track record in performance management and memory tuning, successfully integrating Spark with Hive to manage datasets up to 40 TB. Developed comprehensive data pipelines and implemented optimization strategies to significantly enhance processing efficiency.

Overview

19
19
years of professional experience
1
1
Certification

Work History

PySpark Developer

Tamanna Solutions pvt ltd
Hyderabad
01.2024 - Current
  • Architected real-time data processing pipeline using PySpark Structured Streaming and Delta Lake, achieving under 2 minutes data latency.
  • Spearheaded migration from legacy Hadoop infrastructure to cloud-native Databricks Lakehouse, decreasing costs by 42% and enhancing job reliability to 99.7%.
  • Led cross-functional team of 8 engineers in deploying ML-powered anomaly detection on 15TB of transaction data, uncovering $3.2M in potential fraud during first quarter.

Spark/Big Data Engineer

BDO, Capgemini
Bengaluru
07.2021 - 12.2023
  • Refactored PySpark code and implemented dynamic partition pruning, reducing daily processing time by 68% and saving over 230 compute hours monthly.
  • Deployed a metadata-driven framework for data quality validation, automatically detecting schema drift and integrity issues across 200 datasets.
  • Collaborated with data scientists to productionize ML models, decreasing deployment time from weeks to two days while achieving 99.5% prediction accuracy.

Spark/Big Data Consultant

TravisMathew, Ameri100
Bengaluru
08.2017 - 09.2020
  • Developed reusable PySpark components for data transformation, standardizing code quality across six project teams.
  • Resolved performance bottlenecks in Spark SQL queries, enhancing job completion times by 45%.
  • Contributed to internal PySpark training program, onboarding 12 junior developers and reducing ramp-up time by 40%.

Senior Hadoop Consultant

Citigroup
Tampa, Florida
10.2015 - 07.2017

AML Cards & Optimization is a compliance project handling all credit card transactions, both retail and consumer. The main goal is to detect fraudulent transactions and generate alerts for such transactions over a data set of about 400 GB per month for the USA and Canada alone. The project is divided into two parts.

  • Segmentation (12-month historical data is provided to analysts.)
  • Transaction monitoring (alerts are generated on 12 months, and recurring feed data). This is a rule-based alert generation model.
  • Assisted in Agile adoption within the Big Data & Analytics team.
  • Led daily stand-ups and sprint planning for the development and analytics team.
  • Coordinated across multiple teams to deliver data-driven insights and solutions.
  • Facilitated Agile retrospectives to improve collaboration and remove blockers for development teams.

Hadoop Consultant

HSBC
New York City, New York
09.2014 - 10.2015
  • Developed Big Data Solutions that enabled the business and technology teams to make data-driven decisions..
  • Led Agile adoption within the development team to enhance sprint efficiency.
  • Implemented daily workflow automation for extraction, processing, and analysis of large data sets.

Programmer Analyst

Citigroup, HCL America
Jersey City, NJ
09.2012 - 09.2014

Citi QA UAT Support - This application handles all the trading procedures and takes the data from various source systems. After that, it processes and stores the data in different databases. These systems transmit data to the next level.

  • Handling all the trade-related issues while trades are moving from EDTS/COMET (FO) to ZTS (MO).
  • Sending daily checkouts mail to business users for all the tools that are being used in the trading life cycle.
  • Automation program developed for morning checkouts.
  • Server maintenance.

Technical Lead

HCL Technologies
Bangalore
04.2011 - 08.2012

Citi Equities DMOS Application Support.

  • Involved in the requirement study.
  • Developing components for the Middle Tier as per various business rules.
  • Designing of User interface.
  • Testing of Modules and Code review.
  • Performance Monitoring (CPU, Memory, Paging, Network Latency, HTTP Response)

Project Engineer

DRDO
Hyderabad
03.2006 - 04.2011

The Defence Research and Development Organisation (DRDO) is an agency under the Department of Defence Research and Development in the Ministry of Defence of the Government of India, charged with military research and development, headquartered in Delhi, India.

  • Programming in PL/SQL in both Linux and Windows platforms.
  • Software Configuration activities.

Contributed in maintenance of hardware in Loop Simulation activities. Prepare the documents as per requirements.

Education

B.Tech - Electrical & Electronics Engineering

Sree Vidyanikethan Engineering College
A.Rangampeta, India
04-2003

Skills

  • PySpark and Spark SQL optimization
  • Distributed computing architectures
  • Machine learning deployment in Spark
  • Data pipeline design and ETL automation
  • Real-time stream processing with Kafka
  • Data governance and security in Spark
  • Agile project management
  • Complex problem solving
  • Technical communication and stakeholder management
  • Continuous learning and technology adaptation

Certification

  • Certified Scrum Master (CSM) - Scrum Alliance
  • SAFe 5.1 Certified - Scaled Agile Framework
  • Certified Developer for Apache Hadoop (CCDH410)
  • Oracle Certified Associate (OCA)

Tools & Technologies

  • JIRA, Confluence, Azure DevOps
  • Agile Release Trains (ARTs)
  • SAFe Agile, Kanban
  • CI/CD, Jenkins, Git
  • SQL, Big Data Technologies
  • Tableau, Power BI for Agile Metrics

Timeline

PySpark Developer

Tamanna Solutions pvt ltd
01.2024 - Current

Spark/Big Data Engineer

BDO, Capgemini
07.2021 - 12.2023

Spark/Big Data Consultant

TravisMathew, Ameri100
08.2017 - 09.2020

Senior Hadoop Consultant

Citigroup
10.2015 - 07.2017

Hadoop Consultant

HSBC
09.2014 - 10.2015

Programmer Analyst

Citigroup, HCL America
09.2012 - 09.2014

Technical Lead

HCL Technologies
04.2011 - 08.2012

Project Engineer

DRDO
03.2006 - 04.2011

B.Tech - Electrical & Electronics Engineering

Sree Vidyanikethan Engineering College
Gopikrishna Nallappareddi