Summary
Overview
Work History
Education
Skills
Timeline
Generic

Prakash B

Big Data Engineer
Bangalore

Summary

Diligent Big Data Engineer with a robust background in designing and implementing scalable big data solutions, demonstrating a strong track record of optimizing data pipelines to enhance processing efficiency. Expertise in leveraging Apache Hadoop and Spark facilitates advanced data analysis and real-time processing capabilities. Committed to driving innovation through data-driven insights and solutions, consistently delivering high-quality results in fast-paced environments. A proactive approach to problem-solving enables effective tackling of complex challenges and contributes significantly to team objectives.

Overview

4
4
years of professional experience

Work History

Big Data Engineer

Consultants to Government and Industry
05.2024 - Current
  • Automated data quality validation processes using Python, reducing manual checks by 30%.
  • Conducted Spark job performance tuning, achieving 20–25% faster execution of critical workloads.
  • Collaborated with stakeholders on solution design, production support, and deployment.
  • Ensured system reliability by troubleshooting production issues and optimizing cluster usage.
  • Automated routine tasks through scripting languages, reducing manual effort and human error risks.

Big Data / Hadoop Developer

Tata Consultancy Services Ltd
04.2021 - 03.2024
  • Delivered data pipelines for PNC Bank (US) to process high-volume financial data across multiple sources.
  • Developed end-to-end ETL workflows using Spark, Hive, Sqoop, Oozie, Impala, improving reporting efficiency.
  • Migrated and integrated data from RDBMS systems (MySQL, Oracle, Teradata) into HDFS for enterprise-wide use.
  • Built Hive & Impala models enabling faster queries and supporting regulatory compliance reporting.
  • Partnered with business analysts and data teams to design optimized data warehousing solutions.
  • Maximized resource utilization within a multi-node cluster environment through effective job scheduling using tools like Oozie.
  • Implemented data partitioning strategies to optimize storage usage and query performance.

Education

Associate of Arts - Electronics & Communication

MVJ College of Engineering
Bengaluru, India
04.2001 -

Skills

Spark development

PySpark

Big data analytics

Hive query optimization

Proficient in Hadoop

Python

SQL

Azure Databricks

Linux command line proficiency

Service Now

Git

Apache Kafka

Experienced with Impala

Cloudera platform experience

Timeline

Big Data Engineer

Consultants to Government and Industry
05.2024 - Current

Big Data / Hadoop Developer

Tata Consultancy Services Ltd
04.2021 - 03.2024

Associate of Arts - Electronics & Communication

MVJ College of Engineering
04.2001 -
Prakash BBig Data Engineer