Highly-motivated employee with desire to take on new challenges. Strong worth ethic, adaptability and exceptional interpersonal skills. Adept at working effectively unsupervised and quickly mastering new skills.
Overview
3
3
years of professional experience
1
1
Certification
Work History
Data Engineer
Bank of America
Mumbai
03.2021 - Current
Implemented Linux shell script for CDH clusters on various environments which will do cleanups, and take backups of data stage hive, Vertica, and grid jobs
Implemented Python scripts to fetch the risk reports and health check reports through Selenium and load them into the Oracle database
Collaborated on ETL (Extract, Transform, and Load) tasks, maintaining data integrity and verifying pipeline stability using Pyspark
Automated metrics flow line process and generated dashboards through Tableau
Supported cluster upgrade on CDH (Cloudera distributed Hadoop) Clusters
Implemented
JIL Autosys script for scheduling jobs in Prod and Lower Lanes environments
Supported Hashicorp vault migration
Developing an application to efficiently process on-demand requests from an SQS queue by reading data from Delta Lake in delta data frames.Leveraging Spark SQL for seamless query data, followed by storing the output in an S3 bucket for data processing
Efficiently worked and maintained the Hadoop cluster enterprise platform in production
Worked on Python-based script which reduced 90% of manual effort on resiliency project in
DataStage for identifying particular server routine jobs
Automated monthly openshift cve id‘s report from the Redhat side using Python, and
Selenium, which saved a lot of manual efforts and quick turnaround for the team
PROJECTS Sentiment analysis on Amazon product reviews
Role: Student Performed sentiment analysis on Amazon product reviews to decide overall market sentiment about the product
Project involved the use of multiple open-source Python libraries like Selenium for web scrapping, and Textblob for sentiment analysis
Machine learning algorithm logistic regression was used to predict the sentiment of unknown data
Tableau was used to present results in a presentable and understandable format.
Education
Bachelor of Engineering - Big Data technologies
CDAC
Sunbeam Institute
02.2021
Diploma - Electronics and Telecommunication Engineering
Gokhale Education Society College of Engineering
06.2019
Electronics and Telecommunication Engineering
Cusrow Wadia Institute of technology
06.2016
Skills
Python, Linux shell scripting, MySQL, Oracle DB
Hive, Hadoop, Kafka, Alteryx, Apache Spark
IBM DataStage, Geneos, Autosys
JIRA,CI/CD, Docker,Databricks, Snowflake, and AWS EC2
Data analysis
Solution development
Certification
Microsoft Azure AZ-900 Certifications
Data analysis using Python powered by Coursera
Work Availability
monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse
Timeline
Data Engineer
Bank of America
03.2021 - Current
Bachelor of Engineering - Big Data technologies
CDAC
Sunbeam Institute
Diploma - Electronics and Telecommunication Engineering