Summary
Overview
Work History
Education
Skills
Certification
Timeline
Hi, I’m

Rohan Srivastava

Lead Data Engineer (Azure Certified)
Noida
Rohan Srivastava

Summary

I am a Lead Data Engineer with experience of more than 7.8 years in Hadoop and Azure Cloud . I have worked with Healthcare,Insurance,Retail and Financial projects extensively and currently employed with EPAM. My Tech stack includes Hive, Spark, Scala, Spark SQL, Kafka, Sqoop, Oozie ,Shell scripts ,Java , Azure sql Db, Azure synapse , Azure Data factory, Azure Cosmos , Azure Streaming , Azure HDInsight , Azure Databricks etc

Overview

8
years of professional experience
6
years of post-secondary education
1
Certification

Work History

EPAM
Noida

Lead Software Engineer
09.2021 - Current

Job overview

Project 1 :

  • Helped a client to move data from different sources (SDTP/FTP/Db2 etc) to ADLS Gen2 using ADF pipeline .
  • Basic transformation was done in pyspark using databricks activity
  • End to End pipeline was automated and we used Azure Devops Pipeline for CICD .

Project 2 :

  • Helped a client to move 14 use cases originally build in spark scala to migrate from Hadoop to Azure Databricks .
  • The data was ingested using Qlik in ADLS and which act as a source of all the 14 use cases .
  • Jenkins was used as CICD tool which migrated the project to uat & prod .

Metlife
Noida

Team Lead
10.2019 - 09.2021

Job overview

  • Worked upon development of Spark based logics for aggregating,designing,cleansing, transforming terabytes of structured & semi structured data available in HDFS .
  • Parametrized shell scripting code was written which was responsible for automating and triggering spark & hive scripts.
  • Experience in Ingesting the data from different RDBMS using spark framework and converting the same into appropriate views and tables .
  • Developed a framework to perform monitoring over different Data Science models and report them in case the model performance degrades .
  • Decompressed non supportable formats of data around 4 TB in hdfs and made it handy to use for Data Science team using various Big data technologies

EPAM
Noida

Senior Big Data Engineer
08.2017 - 09.2019

Job overview

  • Hadoop was used for ETL to ingest, aggregate and parse the data from about 20 data sources .
  • Developed Sqoop scripts in order to make the interaction between Hive and SQL Database .
  • Built hive logics which has been further used in creating predictive model variables .
  • Converted GBM class file into Pojo models using Java .GBM files are the one generated using H2O model (ML algorithm).
  • Aggregated Pojo model and logistic regression code with Hive logics using shell scripting .
  • End to End automation were done and it was made sure that the script daily delivers reports to clients in correct format .
  • Scheduled scripts daily using TWS jobs

Tata Consultancy Service
Noida

Big Data Engineer
02.2016 - 07.2017

Job overview

  • Developed analytics at the point of Business Sales in a Pharmacy Based chain .
  • The Daily sales were pulled using Sqoop from DB2 server and saved as hive tables .
  • Business Logics were converted using hive .The Daily sales after transformation was visualized using Tableau .
  • The final tables were optimized and saved into appropriate formats using partitioning and bucketing .

Education

Jaypee Institute of Information & Technology
Noida

Bachelor of Technology from Electronics And Communications Engineering
07.2011 - 07.2015

University Overview

Graduated with 77 %

GN National Public School
Gorakhpur

Senior Secondary from Science & Maths
07.2009 - 07.2010

University Overview

Graduated with 80 %

GN National Public School
Gorakhpur

High School from Science & Maths
07.2007 - 07.2008

University Overview

Graduated with 86.4 %

Skills

    Spark

undefined

Certification

Micorsoft Azure Data Engineer Associste (DP 200 & DP 201)

Timeline

Lead Software Engineer
EPAM
09.2021 - Current

Micorsoft Azure Data Engineer Associste (DP 200 & DP 201)

04-2021
Team Lead
Metlife
10.2019 - 09.2021
Senior Big Data Engineer
EPAM
08.2017 - 09.2019
Big Data Engineer
Tata Consultancy Service
02.2016 - 07.2017
Jaypee Institute of Information & Technology
Bachelor of Technology from Electronics And Communications Engineering
07.2011 - 07.2015
GN National Public School
Senior Secondary from Science & Maths
07.2009 - 07.2010
GN National Public School
High School from Science & Maths
07.2007 - 07.2008
Rohan SrivastavaLead Data Engineer (Azure Certified)