Summary
Overview
Work History
Education
Skills
Software
Certification
Timeline
Generic

RAJESH DASH

SDE-2
Bangalore

Summary

Result oriented and passionate professional with overall 9 years of experience in Application Development and Big Data Engineering with knowledge of Batch and Stream processing, RESTful services and Cloud. Very curious to learn and grow in career.

Overview

16
16
years of professional experience
5
5
years of post-secondary education
2
2
Certifications
3
3
Languages

Work History

Data Engineer

Expedia
Bangalore
2021.06 - Current

Meteor Pipeline

  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Analyzed complex data and identified anomalies, trends and risks to provide useful insights to improve internal controls.
  • Capturing of data from sim capture database and doing various transformation over the data.
  • Extensively used DataFrame operations.
  • Implemented various rules in Spark to meet business requirements.
  • Implemented raw, bronze and silver layer for the pipeline.
  • Responsible for writing scala unit test for testing custom function.
  • Responsible for writing open source library for anonymization of data.
  • Started building some part this project from scratch which includes POC as well.
  • Involved in writing data-quality test cases and show it on sql analytics dashboard
  • Technologies used: Apache Spark(Batch), Scala, Databricks , Postgres,DMS,Git, ,SQS,SNS,Lambda,S3

Adaptive Learning

  • Developed API to collect user behavior event data to capture data from 7 products .
  • Involved in writing pipeline for cleansing the data and flattening the complex columns to create easy query able data for datascientist
  • Involved in writing data quality tests for different test cases.
  • Developed, implemented and maintained data analytics protocols, standards and documentation.
  • Involved in storing profiles (Derived from event-store data) in mongo-db database for querying.
  • Implemented simple lambda based api with api gateway for querying data from mongo-db.
  • Technologies used: Fast API , Lambda,S3,FireHose,AWS Glue,Apache Spark (Batch),Scala, Mongo-db ,Git

Module Lead

American Express
Bangalore
2020.12 - 2023.06

CISO

  • Built ETL pipeline for scaling up data processing flow to meet the rapid data growth by exploring different technologies like NIFI,Elastic Search.
  • Pulling the data from islon share location and deriving the end to end flow of modifying the customer data, processing the data and store unique results in Hive.
  • Worked on ingestion using Apache NIFI and scheduling the spark jobs through it.
  • Responsible for writing spark job which check different file hashes and reporting if it comes under malicious using spark and crowdstrike .
  • Responsible for creating NIFI flow which runs query from elastic search and generate report by storing unique records in hive whenever there is any attack .
  • Technologies used: Scala,Spark,Elastic Search Hive, Shell, Nifi ,Bitbucket, Crowdstrike,Siem

Technical Lead

Huawei
Bangalore
2018.05 - 2020.11

N-Viva

  • Built ETL pipeline for scaling up data processing flow to meet the rapid data growth by exploring Spark and improving the performance of the existing algorithm in Hadoop using Spark-Context, Spark- SQL, Data Frame, Pair RDD’s and Spark YARN
  • Pulling the data from mediation and deriving the end to end flow of modifying the customer data, processing the data and rewarding eligible customer.
  • Worked on rewarding process to store the reward details through the reward loader in cassandra
  • Worked on logging the status of each of the job in to dynamo db using aws sdk.
  • Defined parameters for API and data acquisitions.
  • Technologies used: Apache Spark, S3,AWS ,Scala ,Casandra, Dynamo-Db, GitHub, Jenkins, Scala test

HCL Sametime

  • Technologies used: Apache Hadoop, HDFS, Java, Mongo DB, React-js, GitHub

Senior Software Engineer

NXP Semiconductors
Bangalore
2012.10 - 2018.01

EDA (Electronic Design Automation)

  • Worked with software development and testing team members to design and develop robust solutions to meet client requirements for functionality, scalability and performance.
  • Tested methodology with writing and execution of test plans, debugging and testing scripts and tools.
  • Technologies used: Apache Hadoop,HDFS ,Java , Hive ,GitHub, Jenkins, Ant,Maven

CDK (Configuration Development Kit)

  • Technologies used:Java1.8 , Equinox,OSGI,Javafx,Matlab,jaxb,Sonar Qube,Black Duck.NSIS,Jenkins,EMF,Xtext

Software Engineer

NXP SemiConductor
Bangalore
2012.10 - 2016.10

IREC (Intelligent Radio Configuration Tool)

  • Technologies used: Java1.7 , Equinox,OSGI,Jface,C,EMF ,Sonar Qube,Black Duck.NSIS,Jenkins

Education

PGD (Big Data & Analytics) - Big Data

Birla Institute of Technology, Pilani
London
2020.11 - 2021.11

BTECH - Computer Engineering

Biju Patnaik University Of Technology, RKL
Odisha
2006.07 - 2010.07

Skills

¥ A Big Data enthusiast working as a Data Engineer with technologies like with Apache Spark, Scala, Databricks,Api Gateway,Python flask and Fast API

undefined

Software

Apache Spark (Batch/Stream)

Scala

Java

Python

AWS(RDS, S3,Dms, SQS, SNS, Lambda, API Gateway)

Apache NIFI

Apache Hive

Kinesis FireHose

Kafka

Fast Api and Flask

Docker

Cloudera

Databricks

SQL(Postgres/Mysql)

No Sql(Caassandra,Mongo-db)

Certification

Databricks Spark Certifiaction

Timeline

DataBricks Certified Delta Lake Developer

2022-11

Databricks Spark Certifiaction

2021-08

Data Engineer

Expedia
2021.06 - Current

Module Lead

American Express
2020.12 - 2023.06

PGD (Big Data & Analytics) - Big Data

Birla Institute of Technology, Pilani
2020.11 - 2021.11

Technical Lead

Huawei
2018.05 - 2020.11

Senior Software Engineer

NXP Semiconductors
2012.10 - 2018.01

Software Engineer

NXP SemiConductor
2012.10 - 2016.10

BTECH - Computer Engineering

Biju Patnaik University Of Technology, RKL
2006.07 - 2010.07
RAJESH DASHSDE-2