Expertised in Python, Pyspark, AWS and advanced SQL having 2+ years of experiance in developing data model and implementaion of ETL process from end to end. Implemented data driven solution which gives high 80-90% accuracy.
Overview
4
4
years of professional experience
1
1
Certification
Work History
Data Engineer
Systech
07.2022 - Current
Build a pipeline to extract data from Gsheets through Fivetran and load into redshift and transform data by using DBT.
Build a pipline to extract dara from Redshift tables and transform data in aggregated level and loads into AWS S3 through databricks.
On top of S3 parquet/csv files created AWS athena tables by using AWS services like S3, Glue Catalouge and Glue crawlers.
Developed a model to do fuzzy match process to return respective mapped id with confidence score on basis of given input informations.
Developed a Airflow DAGS to run models on given time.
Data Engineer (Intern)
Altimetrik
02.2022 - 06.2022
Data scraping from online sources by using Python packages like Selenium, Beautiful Soup.
Converting raw data into meaning format, data preprocessing with Pandas and Numpy.
Finding out artifacts from dataset by visualizing with dash-plotly, matplotlib and seaborn.
Implementation of dashboards for the businesses to solve problems for sales team or marketing team.
Connecting and implementing AI-ML functionality and making dashboards actionable.
Data Scientist
Chingari
12.2021 - 01.2022
Worked on data annotation for training and predicting the custom objects in the images with using DL (Deep Learning) Transfer learning models such as YOLOV5.
Worked on implementation of telegram chat-bots to automate the process to extract wanted text with Python RegEx then remove respective data from SQL database.
Software Developer
KreativSARG Technologies
02.2020 - 02.2021
Worked on OpenERP system and odoo to implement CRM, HR and Sales app models with Python and OOPS concepts.
Managing and maintaining postgres database and UI of product life cycle from manufacturing, production to sale.
Worked on implementation of tables, forms, view, and roles from backend.
Worked on creating fields with relational database like one2one, one2many and many2one.
Education
BE Computer
Yadavrao Tasgaonkar Institute of Engineering & Tec
Mumbai, MH
07.2019
Skills
Python, C, Pyspark, SQL
AWS S3, Athena, Redshift Glue and Lambda
Fivetran, Looker, Airflow, DBT and databricks
Tableau, Excel and Dash-Plotly,
Github, SVN
Certification
PG Diploma in Data Science – DataTrained Education Mar 2021 – Sep 2021
Languages
English
Upper intermediate (B2)
Hindi
Upper intermediate (B2)
Telugu
Upper intermediate (B2)
Marathi
Upper intermediate (B2)
Timeline
Data Engineer
Systech
07.2022 - Current
Data Engineer (Intern)
Altimetrik
02.2022 - 06.2022
Data Scientist
Chingari
12.2021 - 01.2022
Software Developer
KreativSARG Technologies
02.2020 - 02.2021
BE Computer
Yadavrao Tasgaonkar Institute of Engineering & Tec