Summary
Overview
Work History
Education
Skills
Personal Information
Certification
Timeline
Generic
Prabir Sahoo

Prabir Sahoo

Bangalore

Summary

Seasoned data engineering professional with over 12 years of experience in designing and implementing data-intensive applications, enterprise-grade data warehouses, and scalable data lake solutions. Expertise includes building robust ETL data pipelines using modern technologies and tools across the Big Data Hadoop ecosystem (Hive, Apache Spark), and leveraging cloud platforms to optimize performance and scalability.

Experience Highlights:

  • Solid understanding of data engineering principles and best practices.
  • In-depth understanding of the Big Data ecosystem and Spark architecture.
  • Familiarity with building simple GenAI use cases, leveraging GenAI tools.
  • Extensive experience in various database technologies, like RDBMS, SQL, Oracle, NoSQL, Hive, and HBase.
  • Hands-on experience in building Hive queries for data processing and performance tuning of Hive SQL.
  • Exposure to cloud technologies like MS Azure, AWS, and Google Cloud.
  • Exposure to various ETL and BI reporting tools, like Informatica, SSIS, etc.
  • Exposure to Snowflake and Databricks for data integration and analytics.
  • Strong experience with Unix/Shell scripting and Python scripting, and exposure to scheduling tools like Autosys, Automic, and ControlM.
  • Experience in Data Modeling, ER, and Dimensional Modeling, etc.
  • Experience with CI/CD (Jenkins) and version control (Git).
  • Experience in Agile methodology, Scrum, Kanban, and Sprint.
  • Experience in managing and leading a team of 5 to 6 software engineers across global locations.

Overview

15
15
years of professional experience
1
1
Certification

Work History

Senior Data Engineer(Sr. Associate Vice President)

Wells Fargo
09.2022 - Current
  • Designed and developed TSDS (Strategic Data Store) for marketing and trade surveillance applications using Big Data Hadoop platform.
  • Spearheaded end-to-end development, including SUNRISE, data ingestion, transformation, and storage pipelines, ensuring high scalability and reliability.
  • Managed and mentored high-performing Big Data engineering team, promoting collaboration and focus on delivery.
  • Productionized TSDS platform, facilitating real-time data delivery to critical surveillance systems.

Assoc Mgr. Software Devl

IQVIA
12.2017 - 08.2022
  • Developed and optimized robust data pipelines using Apache Spark and Scala for seamless integration of internal and external data sources.
  • Managed data ingestion processes on Amazon S3, orchestrating flat and parquet files into stage tables.
  • Executed data mastering techniques for loading customer data into Reltio platform, enhancing quality and accessibility.
  • Collaborated with cross-functional teams to extract and publish client reporting data from CDW Hive tables.
  • Led improvements in data processing layers, including Stage, ETL, and publish fact tables to optimize system performance.
  • Directed offshore development team, performed root cause analysis for production issues, and implemented fixes.
  • Maintained comprehensive design documentation, conducted code reviews, and mentored junior team members.

Software Engineer

Collabera
03.2017 - 11.2017
  • Managed enterprise Data Warehouse, ensuring data availability, integrity, and timely reporting.
  • Designed robust SSIS ETL pipeline to optimize extraction, transformation, and loading processes.
  • Identified ETL performance bottlenecks and implemented solutions to enhance system efficiency.
  • Automated manual reporting tasks, improving productivity and reducing operational overhead.
  • Monitored production systems to maintain consistent data availability for reporting applications.
  • Performed performance tuning of complex Oracle SQL queries, delivering solutions that accelerated data processing.

PLSQL/ETL Developer(Assistant Manager)

Meena Agency Limited
01.2013 - 02.2017
  • Developed efficient stored procedures and packages to enhance data retrieval and application response time.
  • Automated tasks using shell scripts to increase team productivity.
  • Streamlined data migration by creating SSIS ETL tools for project-specific needs.
  • Maintained comprehensive database documentation for accurate record-keeping and reference.
  • Contributed to successful project completion by meeting strict deadlines and client requirements.

Automation Program Engineer

TRL Krosaki Ltd.
07.2010 - 12.2012

Worked as Electronics and Automation program engineer for developing PLC and SCADA programming.

Education

B.Tech - Electronics & Telecommunication

BPUT
ODISHA
08-2009

Skills

Data engineering and big data computing

PySpark and Scala

Python programming

Advanced SQL and databases

ETL processes and RDBMS

Snowflake and Databricks

Data lake solutions

Data warehousing strategies

Generative AI applications

Unix and shell scripting skills

Cloud data platforms expertise

Personal Information

Certification

  • GCP Associate Cloud Engineer
  • Azure Data Engineer

Timeline

Senior Data Engineer(Sr. Associate Vice President)

Wells Fargo
09.2022 - Current

Assoc Mgr. Software Devl

IQVIA
12.2017 - 08.2022

Software Engineer

Collabera
03.2017 - 11.2017

PLSQL/ETL Developer(Assistant Manager)

Meena Agency Limited
01.2013 - 02.2017

Automation Program Engineer

TRL Krosaki Ltd.
07.2010 - 12.2012

B.Tech - Electronics & Telecommunication

BPUT
Prabir Sahoo