Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Timeline
Generic

Dr. Anuj Bajpai

Data Scientist
Nagpur

Summary

Highly organized and motivated Data Scientist with expertise in gathering, cleaning, and organizing data for both technical and non-technical stakeholders. Advanced proficiency in statistical, algebraic, and analytical techniques, coupled with a significant background in machine learning algorithms, including theoretical and applied aspects. Skilled in computer vision and deep learning methodologies, adept at leveraging these technologies to extract insights and drive decision-making processes.

Overview

2
2
years of professional experience
5
5
years of post-secondary education
10
10
Certifications

Work History

Data Scientist

Konverge.AI
03.2022 - Current
  • Worked with stakeholders to develop quarterly roadmaps based on impact, effort and test coordinations.
  • Discovered stories told by data to present information to scientists and business managers.
  • Utilized advanced querying, visualization and analytics tools to analyze and process complex data sets.
  • Created and implemented new forecasting models to increase company productivity.
  • Applied statistical and algebraic techniques to interpret key points from gathered data.
  • Compiled, cleaned and manipulated data for proper handling.

Education

Ph.D. - Computational Sciences

IIT Kanpur
Uttar Pradesh
07.2012 - 11.2017

Skills

    Machine learning

undefined

Accomplishments

    Project 1: Sales Forecasting Using time series data

    Problem Statement : To suggest possible forecasting models for prediction of shipped units for a list of 110 products

  • Developed a precise and well-defined comprehension of the business requirement.
  • Figured out anomaly in the data using proper EDA techniques at the early stages of the project.
  • As data was having a superimposing random component and all statistical models were not working satisfactorily. In such a condition prepared an exhaustive list of experiments where most of the possible models have been tried for wide range of products.
  • Applied a variety of models, spanning from traditional to cutting-edge, in the realm of sequence modeling.
  • Establishing a firm performance criteria for different models such as to decide optimal performance of a model.
  • Achieved better forecasting accuracy compared to baseline models or industry-standard methods, showcasing the effectiveness of the chosen approach.
  • Tuned model hyperparameters systematically to enhance performance, demonstrating a strong grasp of model configuration.
  • Delivering results at stipulated time with well prepared documentation.
  • Projects 2: Hr Suite Attrition prediction.

    Problem Statement: The main aim of work was to develop an accelerator for prediction of attrition(turnover), termination reason, turnover reason and type as well as forecasting attrition using time series forecasting.

  • Learned as well as implemented various machine learning techniques like feature selection, k-fold cross validation techniques, SMOTE, ADASYN(data augmentation techniques).
  • Tried, tested and tuned hyper-parameters for different machine learning models like logistic regression, decision tree, SVM and marked the best possible.
  • Successfully accomplished 4 tasks out of 5 with commendable accuracy, and other parameters like precision, accuracy and F1 score. Also made use of kappa coefficient in case of ties between models.
  • Project 3: Optimization Singular Control Technologies

    Problem Statement : A well structured python program making use of efficient libraries for solving operation research problems is desired. Developed application should be equipped with capability of solving linear/nonlinear optimization problems as well as iterating over multiple solvers.

  • Expertly formulated the problem as a mathematical optimization model, translating real-world constraints and objectives into a format compatible with OR-Tools.
  • Designed custom objective functions that appropriately quantified the problem's goals, balancing multiple criteria to achieve the desired outcomes.
  • Made use of pulp library and goggle OR tools for solving for solving constrained optimization LPP ans QPP.
  • Project 4: Cohort analysis for ride business.

    Problem Statement: To perform cohort study, churn analysis/retention analysis/personalized customer experience.

  • To demonstrate our expertise for the bidding project, we will commence by generating the necessary data and then proceed with the full implementation of the analysis outlined above.
  • Created an extensive dataset containing trip records, customer profiles, and driver information, ensuring seamless interconnection and consistency among all the data components.
  • Designed an impressive dashboard spanning five pages that effectively presents analytics derived from trip, customer, and driver data.



Certification

Python for Data Science, AI & Development Course IBM Python for Data Science, AI & Development

Timeline

Fundamentals of the Databricks Lakehouse Platform Accreditation (V2)

09-2023

Databricks Certified Learning Associate Scalable Machine Learning with Apache Spark™ (V2)

09-2023

Custom Models, Layers, and Loss Functions with TensorFlow

08-2023

Data Collection and Processing with Python

06-2023

SQL for Data Science

02-2023

Convolutional Neural Networks in TensorFlow

12-2022

Python for Data Science, AI & Development Course IBM Python for Data Science, AI & Development

10-2022

Data Science Methodology

10-2022

Introduction to TensorFlow for Artificial Intelligence, Machine Learning, and Deep Learning

10-2022

Data Scientist

Konverge.AI
03.2022 - Current

High Perfomance Computing for Scientists and Engineers

11-2020

Ph.D. - Computational Sciences

IIT Kanpur
07.2012 - 11.2017
Dr. Anuj BajpaiData Scientist