Summary
Overview
Work History
Education
Skills
Certification
Languages
Personal Information
Hobbies and Interests
Projects
Hobbies and interests
Interests
Timeline
Generic
Prachi Jain

Prachi Jain

Gurugram

Summary

Senior IT professional with 11+ years of experience in Big Data engineering, delivering scalable and high-performance data solutions using Scala, Python, Apache Spark, Streamsets, and MySQL. Proven expertise in AWS cloud and architecting end-to-end data pipelines to support complex business needs. Hands-on experience with Palantir Foundry, developing Workshop dashboards to accelerate clinical trial insights and enable data-driven decision-making. Known for driving efficiency, optimizing data workflows, and continuously adapting to emerging technologies.

Overview

12
12
years of professional experience
1
1
Certification

Work History

Senior Software Engineer

EPAM Systems
05.2021 - Current
  • Developed and deployed interactive dashboards using Workshop enabling data-driven decision-making for the client.
  • Created data models and set Data Ontology for various dashboards.
  • Wrote 2D & 3D aggregations using Foundry customized Typescript functions.
  • Built data pipelines and transformed raw files into useful datasets using Pyspark.
  • Implemented changes and bug fixes via change management framework.
  • Debugged problems across a full stack of Foundry and code based on Pyspark, Java, Elasticsearch, SQL.
  • Collaborated with stakeholders to understand business requirements and translate them into efficient solutions.
  • Acted as third level support for critical applications in Palantir and resolved complex incidents.
  • Monitored datasets and their build status to maintain sustainable data availability.
  • Documented technical work and mentored new team members.
  • EPAM India Pvt Ltd, Pharma, US Pharma company. Working as Senior Software Engineer.

Senior Associate Data Engineering

Axtria India Pvt. Ltd.
Gurgaon
02.2020 - 05.2021
  • Fetch source data from Oracle using Streamsets (Streamsets as unified data ingestion and transformation tool), storing data in AWS S3 (in parquet format) as base layer (Landing zone for raw data).
  • Transforming data from Base layer to curated layer (Delta Processing using SCD type one) using Streamsets.
  • Again, transforming data from curated layer to data product layer (Data consumption layer for downstream systems) with required details.
  • Using AWS S3 as storage, Using transformations like filter, union, group by, aggregation etc.
  • Used Matillion jobs to load the data from Source to target. The Source is database in Oracle, the target is staging tables in Redshift.
  • Then load the data from Staging to Publish tables. This is a delta load and rows are updated/inserted/inactivated into the Publish tables through Matillion jobs. These will act as Publish tables to business users and common area views/tables.
  • Axtria India Pvt. Ltd., Pharma, US Pharma company. Working as Senior Associate.

Consultant I

EXL Services
Gurgaon
04.2019 - 01.2020
  • Used Sqoop to import data from Oracle/Teradata into HDFS.
  • Developed ETL framework in Spark using Dataframes/SQL.
  • Developed Oozie workflow jobs to execute Hive, Sqoop and Spark actions.
  • Working to create one view using Spark to make it available for visualization for the BI team.
  • Fetch data from Oracle (OLTP data) using Kafka for monitoring purpose and storing the data into HBase. Using Hive on top of HBase to view the data in HBase.
  • EXL Services, Banking and Financial Service. Working for US leading Bank as Senior Business Analyst.

Big Data Developer

IBM India Pvt. Ltd.
Bangalore
05.2015 - 10.2018
  • Developed Spark-Scala transformations for large-scale datasets.
  • Built event-based data processing pipelines using Hive and Hadoop.
  • Implemented data ingestion frameworks using Sqoop and Oozie.
  • Worked in Agile environment with sprint-based delivery cycles.
  • Handled complex data formats including XML transformations.
  • Performing validation on data provided from source and ingesting that to Hive.
  • Running Hive queries to check the data in hive is same as the data in source and performing basic Data Integrity Check on them.
  • IBM India Pvt. Ltd., Banking and Financial Service.

Java Developer

Pratham Software
Jaipur
08.2014 - 02.2015
  • Development of SOAP/REST web services using advanced Java.
  • BDD stories/scenarios for automated testing using JBehave framework.
  • Writing unit tests using JUnit and performing component integration testing.
  • Pratham Software, Jaipur.

Education

B. Tech - CSE

Rajasthan Technical University
Jaipur, Rajasthan
01.2014

HSC -

H.G International School
Abu road, Rajasthan
01.2010

SSC -

St. Anselm’s School
Abu road, Rajasthan
01.2008

Skills

  • Apache Spark
  • Apache Hive
  • Sqoop
  • Hadoop
  • MapReduce
  • Redshift
  • EMR
  • EC2
  • Athena
  • Secret Manager
  • S3
  • ETL Tools-Matillion
  • Streamsets
  • Palantir
  • Scala
  • Python
  • TypeScript
  • WinSCP
  • Put
  • GitHub
  • Git
  • Gerrit
  • Agile

Certification

b6vtce6jp5uu, Palantir Certified Foundry Application Developer

Languages

English
Hindi

Personal Information

Father's Name: Pradeep Kumar Jain

Hobbies and Interests

  • Cooking
  • Gardening
  • Dancing

Projects

  • EPAM India Pvt Ltd, Pharma, US Pharma company, Senior Software Engineer, Developed and deployed interactive dashboards using Workshop enabling data-driven decision-making for the client., Created data models and set Data Ontology for various dashboards., Wrote 2D & 3D aggregations using Foundry customized Typescript functions., Built data pipelines and transformed raw files into useful datasets using Pyspark., Implemented changes and bug fixes via change management framework., Debugged problems across a full stack of Foundry and code based on Pyspark, Java, Elasticsearch, SQL., Collaborated with stakeholders to understand business requirements and translate them into efficient solutions., Acted as third level support for critical applications in Palantir and resolved complex incidents., Monitored datasets and their build status to maintain sustainable data availability., Documented technical work and mentored new team members.
  • Axtria India Pvt. Ltd., Pharma, US Pharma company, Senior Associate, Fetch source data from Oracle using Streamsets, storing data in AWS S3 as base layer., Transforming data from Base layer to curated layer using Streamsets., Transforming data from curated layer to data product layer with required details., Using AWS S3 as storage, using transformations like filter, union, group by, aggregation etc., Used Matillion jobs to load the data from Source to target.
  • EXL Services, Banking and Financial Service, US leading Bank, Senior Business Analyst, Used Sqoop to import data from Oracle/Teradata into HDFS., Developed ETL framework in Spark using Dataframes/SQL., Developed Oozie workflow jobs to execute Hive, Sqoop and Spark actions., Working to create one view using Spark for visualization for the BI team., Fetch data from Oracle using Kafka for monitoring purpose and storing the data into HBase.
  • Banking and Financial Service, Banking Client, United Kingdom, Big Data Developer, 3.5 years, Developed Spark-Scala transformations for large-scale datasets., Built event-based data processing pipelines using Hive and Hadoop., Implemented data ingestion frameworks using Sqoop and Oozie., Worked in Agile environment with sprint-based delivery cycles., Handled complex data formats including XML transformations.
  • Pratham Software, Trainee Software Developer - Java developer, Development of SOAP/REST web services using advanced Java., BDD stories/scenarios for automated testing using JBehave framework., Writing unit tests using JUnit and performing component integration testing.

Hobbies and interests

  • Dancing, Cooking, Badminton, Table Tennis, Basketball

Interests

  • Dancing
  • outdoor games (cricket, basketball, badminton, swimming)
  • Cooking

Timeline

Senior Software Engineer

EPAM Systems
05.2021 - Current

Senior Associate Data Engineering

Axtria India Pvt. Ltd.
02.2020 - 05.2021

Consultant I

EXL Services
04.2019 - 01.2020

Big Data Developer

IBM India Pvt. Ltd.
05.2015 - 10.2018

Java Developer

Pratham Software
08.2014 - 02.2015

B. Tech - CSE

Rajasthan Technical University

HSC -

H.G International School

SSC -

St. Anselm’s School
Prachi Jain