Data Scientist with a strong focus on managing complex datasets for machine learning applications. Proficient in natural language processing and a wide array of machine learning algorithms, with a keen ability to visualize and analyze data to uncover actionable insights.
Programming Languages: Python
Machine Learning Techniques: Decision Tree, SVM, Linear Regression, Ridge Regression, Lasso Regression, Logistic, KNN Regression, Random Forest, Naïve Bayes, K Means, Ensemble Techniques PCA, AdaBoost Grid Search CV, Random Search CV, Hypothesis testing, Predictive modelling
Python /ML Packages: Pandas, NumPy, Scikit learn
Natural Language Processing: Understanding, representation, classification & clustering nltk, Genism , text blob, lang detect, google trans ,BOW, TFIDF, word2vec, doc2vec,sent2vec ,key phrase extraction
Data Visualization: Matplotlib, Seaborn,
Database: SQL, MYSQL
Deep Learning: Neural Networks, Deep Learning, ANN, CNN, RNN, Back Propagation, Transfer Learning, TensorFlow 2x, Kera’s
Cloud Platform Services & Deployment: AWS, Flask, Postman
Version Control: GitHub, Git Bash
Project 1: Automated Document Verification System
Domain: Insurance
· Design and develop deep learning models for document verification, such as Optical Character Recognition (OCR), image classification, or anomaly detection models.
· Applied comprehensive data preprocessing methods, including image resizing for uniformity and normalization of pixel values.
· Employed data augmentation methods to enhance dataset variety and improve model adaptability.
· Optimize model performance for speed and accuracy.
· Committed to professional development by engaging in current front-end trends and technologies.
____________________________________________
Project 2: Sentiment Analysis of Organic Products.
Domain: Retail
· Developed and implemented a custom NLP-based sentiment analysis model to classify customer feedback on organic products into positive, neutral, or negative categories.
· Collaborated with cross-functional teams to define project goals, acquire and preprocess data from multiple sources, including product reviews, social media comments, and customer surveys.
· Conducted extensive data cleaning and text preprocessing, including tokenization, lemmatization, removal of stopwords and feature extraction.
· Applied machine learning algorithms to build robust sentiment classifiers and Optimized model performance using hyperparameter tuning and cross-validation.
· Worked with Data Engineers to build automated data pipelines, ensuring continuous collection and analysis of customer feedback.
· Presented project outcomes to stakeholders, demonstrating how sentiment trends informed product development in the organic product market.
____________________________________________
Project 3: Bank Marketing Campaign Optimization
Domain: Banking Financial Services