Summary
Overview
Work History
Education
Skills
Languages
Timeline
Generic

Akshay Balasaheb Nikam

Flat No 104 A, Gini Sanskriti Society, Hadapsar,Pune

Summary

  • Currently employed at Genpact India, Pune as a Big Data Developer since Feb 2024.
  • I have 6+ years of experience in various domains within the IT industry, specializing in the Hadoop Ecosystem, PySpark APIs, and NoSQL Databases such as Hbase and AWS services
  • Proficient in handling large datasets and utilizing core components of the Big Data ecosystem, including Spark, Hive, Sqoop, HDFS, Hbase and Kafka
  • Proficient in AGILE methodologies and adept at troubleshooting and optimizing performance issues

Overview

6
6
years of professional experience

Work History

Lead Consultant (Data Engineer)

Genpact India
Pune
02.2024 - Current

Project :- Risk Canvas

Client :- Amp bank, Australia

  • Risk Canvas is a comprehensive financial crime risk management platform.
  • This product is designed to help financial institutions manage various aspects of financial crime prevention, such as Anti-money laundering(AML), fraud detection, transaction monitoring, risk scoring, customer due diligence.
  • For this project, we used technologies like Python, PySpark, AWS Glue, AWS Athena, DynamoDB, AWS Lambda, Elasticsearch, Step Functions, EC2, EMR, S3, HBase, and RestAPI.
  • For this, pipelines are created to process the data from the S3 location to the Risk Canvas UI.

Big Data Developer

Amdocs Development Center LLP
Pune
07.2022 - 02.2024

Project :- Legal Data Retention

Client :- Vodafone Ireland

  • Designed and implemented a data pipeline to provide legal customer data to Ireland Government based on complaint registers for specific mobile numbers. For this client, where we are dealing with large amount of data which is in hourly basis, daily basis, weekly basis and monthly basis.
  • As well as we are pulling data using API’s, then we are encrypting the data with gpg encryption and moving it to GCS(GCP) using Apache NiFi.
  • Again read the file from GCS and decrypt it then transforms it into parquet file and if any Pii data is present then pseudonymize the pii data and again store it on GCS.
  • Created Pyspark jobs to transform data as per business requirement with gpg encryption. Developed a Python application to move files from SFTP location to HDFS.
  • Utilized API integration and encryption techniques for secure data transfer to GCP using Apache NiFi.
  • Transformed and pseudonymized sensitive data, created PySpark jobs, and shared reports with the business team. Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability

Big Data Developer

Mobile Programming India Pvt Ltd
08.2021 - 07.2022

Project :- Network Monitor System (Vodafone Ireland)

Client :- Amdocs

  • Built an end-to-end data pipeline for real-time customer data using HDFS, Apache-Spark, Hbase, Python, and Kafka.
  • Created Spark jobs to process and store data in Kafka and Hbase.
  • Developed Python API (Flask) to display data on a web application.
  • Dockerized the application for efficient deployment.

Data Engineer

MyTech Solutions
Pune
07.2018 - 07.2021

Project :- Retail Store Data Pipeline

Client :- Mahyco Pvt. Ltd. (Pune)

  • Ingested supermarket data from the mainframe system to the data warehouse in real-time.
  • Developed an end-to-end data pipeline using HDFS, Apache Spark, MySQL, and PowerBI for data cleaning, processing, and analysis.
  • Collaborated with the business team to understand requirements and implemented data transformations accordingly.
  • Managed JIRA tickets, conducted data validation, quality checks, and profiling.

Education

Master of Computer Application -

Savitribai Phule Pune University
Pune
06-2018

Bachelor of Computer Science -

Dr. Babasaheb Ambedkar Marathwada University
Aurangabad
01.2015

Skills

  • PySpark
  • SQL
  • Spark
  • Python
  • Hive
  • Hbase
  • Kafka
  • RestAPI
  • Hadoop
  • JIRA
  • Jenkins
  • Github
  • GCP Pubsub
  • GCP Bigtable
  • Docker
  • AWS Glue
  • Elastic Search
  • AWS-Lambda
  • EC2
  • S3
  • Step Function
  • Athena
  • Dynamo db

Languages

  • English
  • Marathi
  • Hindi

Timeline

Lead Consultant (Data Engineer)

Genpact India
02.2024 - Current

Big Data Developer

Amdocs Development Center LLP
07.2022 - 02.2024

Big Data Developer

Mobile Programming India Pvt Ltd
08.2021 - 07.2022

Data Engineer

MyTech Solutions
07.2018 - 07.2021

Master of Computer Application -

Savitribai Phule Pune University

Bachelor of Computer Science -

Dr. Babasaheb Ambedkar Marathwada University
Akshay Balasaheb Nikam