Summary
Overview
Work History
Education
Skills
Certification
Work Availability
Timeline
Generic
Kajal Mahajan

Kajal Mahajan

Mumbai

Summary

Highly-motivated employee with desire to take on new challenges. Strong worth ethic, adaptability and exceptional interpersonal skills. Adept at working effectively unsupervised and quickly mastering new skills.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Bank of America
Mumbai
03.2021 - Current
  • Implemented Linux shell script for CDH clusters on various environments which will do cleanups, and take backups of data stage hive, Vertica, and grid jobs
  • Implemented Python scripts to fetch the risk reports and health check reports through Selenium and load them into the Oracle database
  • Collaborated on ETL (Extract, Transform, and Load) tasks, maintaining data integrity and verifying pipeline stability using Pyspark
  • Automated metrics flow line process and generated dashboards through Tableau
  • Supported cluster upgrade on CDH (Cloudera distributed Hadoop) Clusters
  • Implemented
  • JIL Autosys script for scheduling jobs in Prod and Lower Lanes environments
  • Supported Hashicorp vault migration
  • Developing an application to efficiently process on-demand requests from an SQS queue by reading data from Delta Lake in delta data frames.Leveraging Spark SQL for seamless query data, followed by storing the output in an S3 bucket for data processing
  • Efficiently worked and maintained the Hadoop cluster enterprise platform in production
  • Worked on Python-based script which reduced 90% of manual effort on resiliency project in
  • DataStage for identifying particular server routine jobs
  • Automated monthly openshift cve id‘s report from the Redhat side using Python, and
  • Selenium, which saved a lot of manual efforts and quick turnaround for the team
  • PROJECTS Sentiment analysis on Amazon product reviews
  • Role: Student Performed sentiment analysis on Amazon product reviews to decide overall market sentiment about the product
  • Project involved the use of multiple open-source Python libraries like Selenium for web scrapping, and Textblob for sentiment analysis
  • Machine learning algorithm logistic regression was used to predict the sentiment of unknown data
  • Tableau was used to present results in a presentable and understandable format.

Education

Bachelor of Engineering - Big Data technologies

CDAC Sunbeam Institute
02.2021

Diploma - Electronics and Telecommunication Engineering

Gokhale Education Society College of Engineering
06.2019

Electronics and Telecommunication Engineering

Cusrow Wadia Institute of technology
06.2016

Skills

  • Python, Linux shell scripting, MySQL, Oracle DB
  • Hive, Hadoop, Kafka, Alteryx, Apache Spark
  • IBM DataStage, Geneos, Autosys
  • JIRA,CI/CD, Docker,Databricks, Snowflake, and AWS EC2
  • Data analysis
  • Solution development

Certification

  • Microsoft Azure AZ-900 Certifications
  • Data analysis using Python powered by Coursera

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Timeline

Data Engineer

Bank of America
03.2021 - Current

Bachelor of Engineering - Big Data technologies

CDAC Sunbeam Institute

Diploma - Electronics and Telecommunication Engineering

Gokhale Education Society College of Engineering

Electronics and Telecommunication Engineering

Cusrow Wadia Institute of technology
Kajal Mahajan