Summary
Overview
Work History
Education
Skills
Hobbies
Timeline
Generic
Shikta Sweekrita

Shikta Sweekrita

Aspiring Data Analyst
Mumbai

Summary

As a highly motivated and detail-oriented individual, I am passionate about working in the field of data analytics. I have a postgraduate degree in botany and I have developed a keen eye for patterns and a strong ability to analyze large amounts of data.


During my certification, I have developed a keen interest in data analytics and have gained experience in data cleaning, transformation, and analysis. I have worked with various data tools and software and am proficient in Microsoft Excel, Python, and SQL. I am also well-versed in statistical analysis and data visualization techniques.


As a fresher, I am eager to learn and grow in a dynamic work environment and am actively seeking opportunities in the field of data analytics. I believe that my strong analytical skills, attention to detail, and passion for working with data make me a great fit for any data analytics opportunity.

Overview

1
1
year of professional experience
8
8
years of post-secondary education
3
3
Languages

Work History

Capstone Project

Pima Indians Diabetes Database
02.2023 - 03.2023
  • Objective - The objective of this project is to use data science techniques to develop a model that can accurately classify whether an individual has diabetes or not. By analyzing various independent variables related to the individual's health and lifestyle, we aim to create a reliable model that can identify the presence of diabetes in individuals. The ultimate goal of this project is to contribute to the development of effective tools for diabetes diagnosis and prevention.
  • Methodology - The methodology for this project involves using exploratory data analysis techniques to analyze the relevant datasets and gain insights into the relationship between the independent variables and the presence of diabetes. We performed various data visualization techniques to identify correlations, outliers, and patterns in the data, which helped us to make informed decisions during the model selection process. Machine learning models such as Logistic Regression and Random Forest were applied to build a classification model that can predict whether an individual has diabetes or not. To evaluate the performance of the model, various metrics such as accuracy, precision, recall, and F1 score were used.
  • Results - The results of our project demonstrate the potential of data science techniques in analyzing complex datasets. Our model achieved high accuracy and performed well in identifying the presence of diabetes in individuals. This project highlights the importance of machine learning and data science techniques in the field of healthcare, particularly in developing effective tools for diabetes diagnosis and prevention.
  • Technical skills - Python, Pandas, Numpy, Matplotlib, Seaborn, scikit-learn, Microsoft Power BI, Power Point.

Academic Project 1

Iris Dataset
12.2022 - 12.2022
  • Objective - The objective of this project is to use the independent variables to predict the species of flowers in the dataset, which contains samples from three different species. By analyzing the special features of each flower and utilizing machine learning techniques, the goal is to create an accurate and reliable model for predicting the species of flowers based on the provided dataset.
  • Methodology - In this project, we applied data preprocessing techniques to clean and transform the dataset, including handling missing data, scaling the features, and encoding categorical variables. We then used a supervised machine learning model, specifically the Decision Tree algorithm, to classify the flowers based on their special features. To evaluate the performance of the model, we used several evaluation metrics such as accuracy, precision, recall, F1 score, and confusion matrix. We also conducted cross-validation. Overall, our methodology involved data preprocessing, machine learning model implementation, evaluation metrics, cross-validation, and feature selection, all of which were crucial steps in building an accurate and reliable model for predicting the species of flowers based on the given dataset.
  • Conclusion - The results of our project demonstrate the effectiveness of the Decision Tree algorithm in predicting the species of flowers based on their special features. The model achieved high accuracy and performed well in classifying the samples from the three different species. Our project also shows the importance of proper data preprocessing and evaluation metrics, in building an accurate and reliable machine learning model.

Academic Project 2

Credit Card Fraud Detection
11.2022 - 11.2023

Objective - The objective of this project is to develop a data science model for reducing the loss caused by credit card fraud. By analyzing transaction data and identifying patterns of fraudulent activities, we aim to create a reliable system that can detect and prevent fraudulent transactions.

Methodology - The methodology for this project involves pre-processing the transaction data and applying logistic regression to develop a model that can identify fraudulent transactions. The methodology included cleaning and transforming the data, which involved removing outliers and missing values. The model was trained on a labeled dataset that contains both fraudulent and non-fraudulent transactions. Regularization techniques were used to improve the model's generalization ability. To evaluate the performance of the model, various evaluation metrics such as accuracy, precision, recall, and F1 score were used.

Conclusion - Through our methodology, we were able to identify and analyze patterns in the transaction data that are associated with fraudulent activities, which can be used to further refine the model and improve its performance. Our project demonstrated the power of data analytics in detecting and preventing fraudulent transactions.

Education

Certification - Data Science And Business Analytics

Boston Institute of Analytics
Mumbai
08.2022 - 03.2023

Master of Science - Cytogenetics, Molecular Biology, Biotechnology

Thakur College of Science And Commerce
Mumbai
10.2020 - 04.2022

Bachelor of Science - Botany

Thakur College of Science And Commerce
Mumbai
07.2017 - 09.2020

HSC - Science

Thakur College of Science And Commerce
Mumbai
06.2015 - 06.2017

SSC - General

Ryan International School
Mumbai
04.2014 - 05.2015

Skills

    Python Programming

undefined

Hobbies

  • Dancing
  • Music
  • Travelling

Timeline

Capstone Project

Pima Indians Diabetes Database
02.2023 - 03.2023

Academic Project 1

Iris Dataset
12.2022 - 12.2022

Academic Project 2

Credit Card Fraud Detection
11.2022 - 11.2023

Certification - Data Science And Business Analytics

Boston Institute of Analytics
08.2022 - 03.2023

Master of Science - Cytogenetics, Molecular Biology, Biotechnology

Thakur College of Science And Commerce
10.2020 - 04.2022

Bachelor of Science - Botany

Thakur College of Science And Commerce
07.2017 - 09.2020

HSC - Science

Thakur College of Science And Commerce
06.2015 - 06.2017

SSC - General

Ryan International School
04.2014 - 05.2015
Shikta SweekritaAspiring Data Analyst