
Mastered the concepts of statistics, Python and R programming, machine learning, Hadoop ecosystem and its deployment. Expert on several frameworks including NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, Spark, NLP, Web Scraping. Experienced at creating data regression models, using predictive data modeling, and analyzing data mining algorithms to deliver insights and implement action-oriented solutions to complex business problems. Seeking an opportunity to achieve higher career growth through a continuous learning process and leveraging the power of Machine Learning & advanced analytics to make the firms business thriving and successful. Skilled in machine learning, data visualization, mind reader and creative thinking.
Handling data team of size 6 in which all the data related activities will be handled as mentioned below.
104- call centre(Health Helpline)
MMU(Mobile medical units)
TMC(Tele medical units)
HWC(Health and wellness Centre)-LHF
Bahmni (Higher health facility)
Sakhi application(Asha related) etc.
In which 2 types of visualizations that include:
a)Visualization using Tableau.
b)Visualization using Open Source techniques using Python Dash and Streamlit which will be based on stake holder requirement.
Ref. Dashboard for 104 call centre for all the 4 locations which we serve.(which is created using Open Source Python Dash
http://hihl104.piramalswasthya.org:8090/
Some of them are mentioned below
a)Forecasting number of diseases based upon the historical data which we receive from our 104 call centres,MMU's,TMC data and all services and predicting top 10 diseases and their algorithm's using Time Series Forecasting. So that our services will run keeping predictive models data and act accordingly.
b) Prediction of NCD (Non Communicable Diseases) and
CD diseases based upon the data which we receive from the Sakhi Application (Asha App.) one of our service.
Like Prediction on Cancer /Hypertension Prediction /BP based upon the historical data and inputs provided by our Clinical domain team(Doctors) using Supervised Machine Learning techniques.
Projects Done:
Project Name: Industrial Clusters Identification http://tsocmms.nic.in/TLNPCB/userMaster/consent CTEDetail1B Project description: This project is intended to carry out to identify the different types of industries, their status, scale and all possible details. Synopsis: · Extract data from tsocmms. http://tsocmms.nic.in/TLNPCB/userMaster/getApp ValueForCTOTotalA?id=115852&days1=-1&a=z
· Performed data Exploration, Wrangling, Manipulation all possible Data Acquisition and EDA, Missing Values, cleaning and visualize data. · Build multiple models using different approaches i.e., supervised (Both Classification and Regression Models), Unsupervised models (K-Means Clustering) to gain insights from the data and to check the accuracy of the models.
·Visualizing the models using Matplotlib, Seaborn and Tableau.
Project Name: Market Strength Analysis https://www.mca.gov.in/mcafoportal/viewComp anyMasterData.do Project description: To find out the market potential of different segments and to gain insights from it.
Synopsis: ·
Collected data from MCA portal.
Pre-processed the data into required format.
Perform Feature engineering by different feature selection techniques.
Perform PCA, LDA and build models based on supervised learning
and visualizing.
Project Name: Industrial Segmentation https://tspcbbmwmanifest.cgg.gov.in/Reports/Vie wIndustries.aspx
Project description: To segment the industries based on their CFO
specifications and to gain insights from it. Synopsis: This Project is to segment the industries based their CFO in which Data is extracted,
Processed, building models and gaining insights from the Data to
identity the percentage of different segments in the market and
their capabilities.
Project Name: Healthcare - Survival rate-COVID-19 (Diabetes and Hypertension).
https://www.bharatbiotech.com/
Synopsis: Created an ML model to classify patient survival rate
with Covid19 affected having Diabetes and Hypertension by using
Supervised learning Classification models.
Project Name: Loan Eligible Prediction Synopsis:
Completed This project with Machine Learning Models.
Python
Machine Learning
Tableau
Dash
Streamlit
Data Analytics
NLP
• 7+ years of experience as Data Scientist
• 3+ years of experience as Business Analyst and Business Excellence by executing data-driven solutions to increase efficiency, accuracy, and utility
of internal data processing.
• Hands-on experience in various Business and data analytic platforms.
• Responsible for using data visualization techniques using Tableau to increase the usability and accessibility of data for the sales and marketing departments.
• Analyzed and processed complex data sets using advanced querying, visualization and analytics tools.
• Hands-on experience in applying core Machine Learning methodologies: Random Forest, Decision trees, Classification, Clustering, Predictive Analytics, Regression.
• Used statistical techniques to validate data and interpretations.
M L Expert
Problem Solving
Decision Making
Business Communication