Summary
Overview
Work History
Education
Skills
Websites
Certification
Accomplishments
Publications
Projects
Timeline
Generic

Shubham Naik

Data Analyst
Pune

Summary

An analytical-minded data science enthusiast proficient in handling large datasets and ability to use statistics and machine learning for finding complex data patterns that drive meaningful impact on the business. Looking for the opportunity to build a challenging career and apply skills in an innovative and simplify process with a team-oriented attitude.

Overview

2025
2025
years of professional experience
5
5
Certifications

Work History

Data Analyst

TwarIT Mobility Solutions Pvt. Ltd.
Pune
10.2024 - Current
  • Responsible for gathering, cleaning, and managing data related to operations of MSRTC's electric bus fleet. Utilized Python and SQL for data cleaning, preprocessing, and transformation to ensure data accuracy and reliability.
  • Developed and maintained interactive dashboards to provide real-time insights into key operational metrics such as energy consumption, trip duration, and total trip kilometers. Leveraging Power BI/Tableau, dashboards offer comprehensive views that support data-driven decisions, enhance fleet performance, optimize routes, and improve overall energy efficiency.
  • Improved decision-making processes with accurate data analysis and visualization techniques.
  • Managed large-scale databases to ensure timely access to critical information for key stakeholders.

Data Analyst

Medulla Recruitment Services
Pune
7 2023 - 08.2024
  • Established a culture of continuous improvement by fostering open communication channels and encouraging feedback on data analysis methodologies
  • Data Visualization expert with experience in creating dashboards using advanced Dax and captivating reports
  • Requirement gathering from the business
  • Data Loading from various sources such as SQL Server, Synapse DWH, source including flat files
  • Analyzing data and KPI Identification
  • Data Transformation, Data Modelling and UI & Wireframe Designing
  • Analytical thinking for translating data into informative visuals and reports using Advanced DAX
  • Implemented Incremental Refresh in the reports
  • Assisted Technical teams as well as business with guidance and consultancy in the matters related to the Projects
  • Knowledge of Object-Level Security (OLS), Row-Level Security (RLS), Page Level Security (PLS), Paginated Reports

Data Science Trainee

Almabetter
Bangalore
05.2021 - 06.2023
  • Learnt skills like Feature Engineering, EDA, Data Visualization and other skills like teamwork, time management and problem-solving
  • Got hands-on experience with tools like Tableau and Scikit-learn
  • Gained proficiency in python programming
  • Learned and gained proficiency in new technologies like Tableau and NLTK
  • Converted EDA capstone project to interactive tableau dashboard: - Airbnb Booking Analysis
  • Worked as Subject Matter Expert and solved data science related questions and queries over Doubt Resolution forum

Education

BCA -

Savitribai Phule Pune University
Pune

HSC -

SJV College
Pune

Skills

Python

SQL

PostgreSQL

PowerBI

Tableau

Looker Studio

Scikit-Learn

Keras

Pyspark

Pandas

Numpy

NLTK

ETL

AWS

Github

Docker

Kubernetes

Machine Learning

Deep Learning

LLM

Time Series

AWS Sagemaker

AWS Glue

Certification

Data Science Professional, 2018, Marsian Technologies

Accomplishments

  • Silver Badge in Python, HackerRank
  • Golden Badge in SQL, HackerRank

Publications

  • Medium Blogs on Data Science, 2022
  • Decision Trees for Classification and Regression, 2022

Projects

Credit Card Default Prediction 

  • Built a Classification model using XGBoost to predict whether a customer will default on his/her credit card.
  • Benchmarked XGBoost algorithm against Logistic Regression, Random Forest and Support Vector Classification.
  • Experimented with Performance metrics for various models found XGBoost as the best model with a Recall of 85.6%., Logistic Regression, Random Forest Classifier, XGBoost, Support Vector Classifier, SMOTE, Class Imbalance


NYC Taxi Trip Time Prediction

  • Built a Regression Model using Gradient Boosting, Decision tree regressor, and XGBoost models to predict taxi trip time in NYC.
  • Employed processing techniques such as Feature Engineering and Outlier Treatment.
  • Experimented with Hyperparameter Tuning techniques such as GridSearchCV and achieved a R2 Score of 73.7% using the Gradient Boosting., Linear Regression, Decision Tree, Gradient Boosting, XGB Regressor.


Netflix Movies and TV-Shows Clustering

  • Employed NLP techniques such as Text cleaning, Stemming, Tokenizer, removing Stopwords. Performed vectorization of the final text column using TFIDF followed by dimensionality reduction using PCA.
  • Performed Data Cleaning, preprocessing and analysis, Building models to address business problems by finding data patterns. Employed NLP techniques such as Text cleaning.
  • Visualizing most used words in Title and Cast with WordCloud.
  • Evaluated the optimal clusters using silhouette and got 5 clusters, K-means performed well on the dataset and gives Silhouette score of 0.82., K-Means, PCA, NLP, Clustering, Silhouette score, Elbow score.


Timeline

Data Analyst

TwarIT Mobility Solutions Pvt. Ltd.
10.2024 - Current

Data Science Trainee

Almabetter
05.2021 - 06.2023
Data Science Professional, 2018, Marsian Technologies
Full Stack Data Science Program, 2022, Almabetter
AWS Certified Solutions Architect Associate SAAC03, 2022, Udemy
AWS Certified Machine Learning Specialty, 2022, Udemy
Pyspark & AWS (Data Pipeline), 2022, Udemy

Data Analyst

Medulla Recruitment Services
7 2023 - 08.2024

BCA -

Savitribai Phule Pune University

HSC -

SJV College
Shubham NaikData Analyst