Summary

Overview

Work History

Education

Skills

Timeline

Nitin Chaudhary

Noida,India

Summary

● Developed Python scripts to transform raw data into intelligent data as specified by business user

● Worked closely with data modelers to model new incoming data sets

● Processed data into HDFS by developing solutions, analyzed the data using Pyspark, Hive/Impala/Kudu and produce results to downstream systems

● Developed Shell, Python scripts to automate the manual monitoring process for ETL jobs

● Used Spark API over Cloudera Hadoop YARN to perform analytics on data in Hive.

● Reading ETL jobs data from Oracle using Pyspark and store into Hive/Impala

● Skilled in interacting with clients and coordinating with multiple stakeholders for data exchange

● Written Python script for Kafka Producer Consumer

● Possess functional knowledge of designing and developing applications in Spark using Scala to compare the performance of Spark with Hive

● Knowledge on Hadoop Ecosystem such as HDFS, Resource Manager, Node Manager, Name Node, Data Node and MapReduce program paradigm

● Insightful knowledge of loading and transforming large sets of Structured, Semi-Structured and Unstructured data and analyzed them by running Hive queries and Python Scripts

● Possess strong analytical and problem-solving skills; an effective leader with excellent skills in motivating individual employee performance

Overview

years of professional experience

years of post-secondary education

Work History

Teach Lead Data Engineer

Birlasoft

Noida, Uttar Pradesh

05.2020 - Current

Title: Falcon (Birlasoft - JNJ)

Role: Development, Coding, Testing

Domain: Big Data (Pharma)

Technologies: Spark, Python, Cloudera, Hive/Impala/Kudu, Oozie, Oracle

Synopsis:

Capturing logs using Pyspark/Shell Script from different applications like GSDL, GSDR, Safety etc. and storing in Hive/Impala. Also getting server availability and ETL job status to show data in Tableau Dashboard.

Logging.

Data Engineer

Condeco Software

Gurgaon, Haryana

01.2021 - 04.2021

Title: Galaxy (Condeco)

Role: Development, Coding, Testing

Domain: Big Data (Sales)

Technologies: Spark, Scala, Azure Synapse Studio, Azure Data Pipeline, SQL

Synopsis:

Build data pipelines for data ingestion from source (SQL) to destination (blob) and automate it. Applied transformations using spark and store into data warehouse for reporting. Also wrote Stored Procedure for custom

Logging.

Big Data Consultant

Capgemini

Gurgaon, Haryana

12.2019 - 01.2021

Title: AML Reconciliation (Capgemini – Citi Bank)

Role: Development, Coding, Testing

Domain: Big Data (Financial)

Technologies: Spark, Scala, Hive, CDH5, Oracle

Synopsis:

Data ingested to HDFS in parquet format and we get the data from HDFS and process it to different layers using Spark Dataframe/SQL API by applying some business rules. In last layer, creating reconciliation report for valid transactions.

Associate Programmer

Clavax Technologies

Gurgaon, Haryana

02.2017 - 10.2019

Title: Novus Loyalty Analytics (Clavax Technologies)

Role: Development, Coding, Testing

Domain: Big Data Analytics

Technologies: Java, Hadoop, MongoDB, PowerBI

Synopsis:

Novus is a loyalty program for RuPay debit card. In this project providing vouchers, points and other loyalty benefits to our customers on using RuPay card. Our clients are Banks, NPCI, Tempe Golf Range, BookMyShow and others. Data comes in different formats (TEXT, CSV, TSV, JSON, and XML) and stored in HDFS. Processed data using MapReduce and Hive. Power BI is used to prepare the report from data.

Software Developer

MBD Alchemie

Delhi, Delhi

01.2017 - 02.2019

Education

No Degree - Master of Computer Application (MCA)

IGNOU

Delhi

04.2001 -

Bachelor of Science - Computer Application

Apex Institute

Meerut

07.2007 - 07.2010

Skills

Spark, MapReduce, HDFS, Hive/Impala, Kudu, Oozie, Sqoop, Yarn, Python, Scala, Java , SQL, Cassandra, Java, Spring MVC, Hibernate, Flask, Eclipse, VSCode, NetBeans, IntelliJ, Azure, AWS

Timeline

Data Engineer

Condeco Software

01.2021 - 04.2021

Teach Lead Data Engineer

Birlasoft

05.2020 - Current

Big Data Consultant

Capgemini

12.2019 - 01.2021

Associate Programmer

Clavax Technologies

02.2017 - 10.2019

Software Developer

MBD Alchemie

01.2017 - 02.2019

Bachelor of Science - Computer Application

Apex Institute

07.2007 - 07.2010

No Degree - Master of Computer Application (MCA)

IGNOU

04.2001 -

Nitin Chaudhary

Summary

Overview

Work History

Teach Lead Data Engineer

Data Engineer

Big Data Consultant

Associate Programmer

Software Developer

Education

No Degree - Master of Computer Application (MCA)

Bachelor of Science - Computer Application

Skills

Timeline

Data Engineer

Teach Lead Data Engineer

Big Data Consultant

Associate Programmer

Software Developer

Bachelor of Science - Computer Application

No Degree - Master of Computer Application (MCA)

Similar Profiles

KRISHNA KRISHNA null

Srikanth Reddy ChandaSrikanth Reddy Chanda

ALI SAFDAR NAIFALI SAFDAR NAIF

Gagan MandapatiGagan Mandapati